Results and Apendix Section

RESULTS SECTION
Provided below is the analysis of the questions in the assignment:
1. Read in the data file and give VAR007 to VAR009 more meaningful names.
Solution:
Provided below are the questions pertaining to the research objectives of the
questionnaire to be prepared:
• Please share your agreement with this statement: “I have supportive classmates"
• Please share your agreement with this statement: “My teacher has fair rules for the
class and is extremely impartial.”
• Please share your agreement with this statement: “I had a satisfying overall
academic experience in university program"
• Please share your agreement with this statement: “I often procrastinate about that
status of your personal goals in life"
• Please share your agreement with this statement: “Sports and other extracurricular
activities motivates me to learn more"
• Please share your agreement with this statement: “I think that the school provides
you with adequate sports facilities"
Likewise, the following meaningful names were allotted to all the variables as below:
Variable Name Revised Name
VAR001 Supportive_Classmates,
VAR002 Fair_Teachers
VAR003 Satisfying_Overall_Academic_Experience
VAR004 Personal_Goals_Procrastination
VAR005 Motivation_of_Extracurricular_Activities
VAR006 Adequate_Sports_Facilities
VAR007 Attainment_at_University
VAR008 Respondent_Gender
VAR009 Respondent_ID
2. Make a plot showing the distribution of responses on the dependent variable.
Solution:
Provided below is the plot of Attainment at University with respect to the Supportive
classmates:
From the above plot, it can be said that the students agreeing for supportive classmates
are having highest amount of attainment at university and students disagreeing for
supportive classmates are having lowest amount of attainment at university
respectively.
Provided below is the plot of Attainment at University with respect to the teachers being
fair:
From the above plot, it can be said that the students agreeing for fair and unbiased
faculties are having highest amount of attainment at university and students strongly
agreeing for the same are having lowest amount of attainment at university respectively.
Provided below is the plot of Attainment at University with respect to the overall
academic experience being satisfying:
From the above plot, it can be said that the students being neutral for satisfying overall
academic experience are having highest amount of attainment at university and students
strongly disagreeing for the same are having lowest amount of attainment at university
respectively.
Provided below is the plot of Attainment at University with respect to the

procrastination created in the status of personal goals:
From the above plot, it can be said that the students being neutral for procrastination of
their personal goals are having highest amount of attainment at university and students
strongly disagreeing for the same are having lowest amount of attainment at university
respectively.
Provided below is the plot of Attainment at University with respect to the motivation
gained with the extracurricular activities:
From the above plot, it can be said that the students agreeing for the motivation gained
from sports and extracurricular activities are having highest amount of attainment at
university and students strongly disagreeing for the same are having lowest amount of
attainment at university respectively.
Provided below is the plot of Attainment at University with respect to the adequate
sports facilities being provided:
From the above plot, it can be said that the students strongly disagreeing for the
adequate sports facilities are having highest amount of attainment at university and
students strongly agreeing for the same are having lowest amount of attainment at
university respectively.
3. Recode VAR001 to VAR006 from text strings to the corresponding values 1 to 5.
Solution:
The variables VAR001 to VAR006 were recoded in the pattern where 1 (Strongly
Disagree) to 5 (Strongly Agree) and the following snapshot represents the output from
the creation of the new variables with recorded values:
Original values:
Recoded values:
4. Compute a summary score for each respondent that combines all of the six
questions equally.
Solution:
Taking average as a summary score of all 6 responses at a respondent level, we get the
following 92 values for each respondent:
These values have been stored in a column named summary_score for further analysis.
5. Make an appropriately labelled plot showing the relationship between the
summary score and the dependent variable, with lines of different colour for Males
and Females
Solution:
Provided below is the scatter plot obtained for denoting the relationship between the
summary score and the dependent variable (Attainment at university), the different dots
of males and females have been used here for differentiation.
From the above plot, we can say that the proportion of males is higher than that of
females. Also, there is a extremely low rate of direct relationship seen between these
variables. This means that with an increase in summary score, there is a very mild
increase in the attainment at university.
6. Find the correlation between the summary score and the dependent variable
Solution:
From the cor() function in R, the correlation obtained between summary score and the
dependent variable is 0.04191044 respectively. This means there is very low form of
positive correlation between summary_scores obtained and the attainment at university.
We can also say that there is neutral relation between these 2 variables.
7. Obtain the Bayes Factor of this correlation
Solution:
Consider the below snapshot for the R output for Bayes Factor:
From the output, we can conclude the Bayes factor for the correlation to be 0.333
respectively.
8. Run a regression to predict the dependent variable from the summary score alone.
Solution:
Here,
Dependent variable: Attainment at university
Independent variable: Summary Score
Null hypotheses: There is significant relationship between summary score and

attainment at university.
Alternate hypotheses: There is an insignificant relationship between summary score and
Provided below is the summary of the regression model fitted between these 2
variables:
Provided below are the insights generated from the output:
The regression line fitted is:
Attainment at university = 49.9834 + 0.7759 * (Summary Score)

Here,
Probability value corresponding to the summary score: 0.692
As the probability value is more than 0.05, we do not reject null hypotheses at 5%
level of significance.
Hence, we can conclude that the summary scores of the respondents do not affect
the attainment at university significantly.
9. Run a regression to predict the dependent variable from the summary score and
sex.
Solution:
Here,
Dependent variable: Attainment at university
Independent variable: Summary Score and Respondent Sex
For respondent sex as the independent variable:

Null hypotheses: There is significant relationship between respondent sex and
Alternate hypotheses: There is an insignificant relationship between respondent sex and
For summary score as the independent variable:

Null hypotheses: There is significant relationship between summary score, respondent
and attainment at university.
Alternate hypotheses: There is an insignificant relationship between summary score and
Provided below is the summary of the regression model fitted between these 3
variables:
Provided below are the insights generated from the output:
The regression line fitted is:
Attainment at university = 47.999 + 2.079 * (Summary Score) – 6.331 * (Male

category of Respondent Gender)
Here,
Probability value corresponding to the summary score: 0.27
Probability value corresponding to the respondent sex (Male): 0.00274
As the probability value for summary score is more than 0.05, we do not reject null
hypotheses at 5% level of significance.
Hence, we can conclude that the summary scores of the respondents do not affect
the attainment at university significantly.
As the probability value for male respondents is less than 0.05, we reject null
hypotheses at 5% level of significance.
Hence, we can conclude that the mail respondents significantly impact the
APPENDIX
Presented below are all the codes used for this assignment:
Data = read.csv("ReferralData2020.csv")
#Provided below are the questions pertaining to the research objectives of the questionnaire to
be prepared:
#Please share your agreement with this statement: “I have supportive classmtes"
#Please share your agreement with this statement: “My teacher has fair rules for the class and
is extremely impartial.”
#Please share your agreement with this statement: “I had a satisfying overall academic
experience in university program"
#Please share your agreement with this statement: “I often procrastinate about that status of
your personal goals in life"
#Please share your agreement with this statement: “Sports and other extracurricular activities
motivates me to learn more"
#Please share your agreement with this statement: “I think that the school provides you with
adequate sports facilities"
##Q1.
#Re-naming and creating of more meaninnful naes for variables VAR001 to VAR006
colnames(Data) <- c("Supportive_Classmates",
"Fair_Teachers",
"Satisfying_Overall_Academic_Experience",
"Personal_Goals_Procrastination",
"Motivation_of_Extracurricular_Activities",
"Adequate_Sports_Facilities",
"Attainment_at_University",
"Respondent_Gender",
"Respondent_ID")
##Q2.
#Plot showing the distribution of responses on the dependent variable
Data$Supportive_Classmates = as.factor(Data$Supportive_Classmates)
Data$Fair_Teachers = as.factor(Data$Fair_Teachers)
Data$Satisfying_Overall_Academic_Experience =
as.factor(Data$Satisfying_Overall_Academic_Experience)
Data$Personal_Goals_Procrastination = as.factor(Data$Personal_Goals_Procrastination)
Data$Motivation_of_Extracurricular_Activities =
as.factor(Data$Motivation_of_Extracurricular_Activities)
Data$Adequate_Sports_Facilities = as.factor(Data$Adequate_Sports_Facilities)
Data$Attainment_at_University = as.numeric(Data$Attainment_at_University)
library(ggplot2)
#Plot of Attainment at University with respect to the Supportive classmates
ggplot(Data, aes(x = Supportive_Classmates, y = Attainment_at_University)) +
geom_bar(stat="identity", fill="steelblue")
#Plot of Attainment at University with respect to the teachers being fair

ggplot(Data, aes(x =Fair_Teachers , y = Attainment_at_University)) +
geom_bar(stat="identity", fill="yellow")
#Plot of Attainment at University with respect to the overall academic experience being
satisfying
ggplot(Data, aes(x =Satisfying_Overall_Academic_Experience , y =
Attainment_at_University)) +
geom_bar(stat="identity", fill="red")
#Plot of Attainment at University with respect to the procrastination created in the status of
personal goals
ggplot(Data, aes(x =Personal_Goals_Procrastination , y = Attainment_at_University)) +
geom_bar(stat="identity", fill="pink")
#Plot of Attainment at University with respect to the motivation gained with the extracurricular
activities
ggplot(Data, aes(x =Motivation_of_Extracurricular_Activities , y =
Attainment_at_University)) +
geom_bar(stat="identity", fill="purple")
#Plot of Attainment at University with respect to the adequate sports facilities being provided
ggplot(Data, aes(x =Adequate_Sports_Facilities , y = Attainment_at_University)) +
geom_bar(stat="identity", fill="green")
##Q3.
#Recode of VAR001 to VAR006 from text strings to the corresponding values 1 to 5
library(dplyr)
Data$Supportive_Classmates.n <- recode(Data$Supportive_Classmates,"Strongly Disagree" =
1, "Disagree" = 2, "Neither" = 3, "Agree" = 4, "Strongly Agree" = 5)
Data$Supportive_Classmates.n <- as.numeric(Data$Supportive_Classmates.n)
Data$Fair_Teachers.n <- recode(Data$Fair_Teachers,"Strongly Disagree" = 1, "Disagree" = 2,
"Neither" = 3, "Agree" = 4, "Strongly Agree" = 5)
Data$Fair_Teachers.n <- as.numeric(Data$Fair_Teachers.n)
Data$Satisfying_Overall_Academic_Experience.n <-
recode(Data$Satisfying_Overall_Academic_Experience,"Strongly Disagree" = 1, "Disagree"
= 2, "Neither" = 3, "Agree" = 4, "Strongly Agree" = 5)
Data$Satisfying_Overall_Academic_Experience.n <-
as.numeric(Data$Satisfying_Overall_Academic_Experience.n)
Data$Personal_Goals_Procrastination.n <-
recode(Data$Personal_Goals_Procrastination,"Strongly Disagree" = 1, "Disagree" = 2,
"Neither" = 3, "Agree" = 4, "Strongly Agree" = 5)
Data$Personal_Goals_Procrastination.n <- as.numeric(Data$Personal_Goals_Procrastination)
Data$Motivation_of_Extracurricular_Activities.n <-
recode(Data$Motivation_of_Extracurricular_Activities,"Strongly Disagree" = 1, "Disagree" =
2, "Neither" = 3, "Agree" = 4, "Strongly Agree" = 5)
Data$Motivation_of_Extracurricular_Activities.n <-
as.numeric(Data$Motivation_of_Extracurricular_Activities.n)
Data$Adequate_Sports_Facilities.n <- recode(Data$Adequate_Sports_Facilities,"Strongly
Disagree" = 1, "Disagree" = 2, "Neither" = 3, "Agree" = 4, "Strongly Agree" = 5)
Data$Adequate_Sports_Facilities.n <- as.numeric(Data$Adequate_Sports_Facilities.n)
##Q4.
#Summary score for each respondent that combines all of the six questions equally.
#Taking average of all 6 responses at a respondent level
Data$Summary_Score = rowMeans(Data[,c('Supportive_Classmates.n', 'Fair_Teachers.n',
'Satisfying_Overall_Academic_Experience.n',
'Personal_Goals_Procrastination.n','Motivation_of_Extracurricular_Activities.n','Adequate_S
ports_Facilities.n')])
##Q5.
#Plot showing the relationship between the summary score and the dependent variable, with
lines of different colour for Males and Females
library(ggplot2)
p <- ggplot(Data, aes(Summary_Score, Attainment_at_University))
p + geom_point(aes(colour = Respondent_Gender), size = 2)+
labs(y= "Summary Score", x = "Attainment at University", title="Scatterplot of Summary
Scores vs Attainment at University")
##Q6.
#Correlation between the summary score and the dependent variable
cor(Data$Summary_Score, Data$Attainment_at_University)
#0.04191044
##Q7.
#Bayes Factor of this correlation
#install.packages("BayesFactor")
#install.packages("Rcpp")
library(Rcpp)
library(BayesFactor)
correlationBF(Data$Summary_Score, Data$Attainment_at_University)
#Bayes factor analysis
#--------------
# [1] Alt., r=0.333 : 0.2576632 ±0%
#
#Against denominator:
# Null, rho = 0
#---
# Bayes factor type: BFcorrelation, Jeffreys-beta*
##Q8.
#Regression to predict the dependent variable from the summary score alone
Model1 = lm(Attainment_at_University~Summary_Score, data = Data)
summary(Model1)
##Q9.
#Regression to predict the dependent variable from the summary score alone
Model2 = lm(Attainment_at_University~Summary_Score+Respondent_Gender, data = Data)
summary(Model2)

Results and Apendix Section

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Results and Apendix Section

Uploaded by

Copyright:

Available Formats

RESULTS SECTION

Provided below is the analysis of the questions in the assignment:

2. Make a plot showing the distribution of responses on the dependent variable.

Provided below is the plot of Attainment at University with respect to the

3. Recode VAR001 to VAR006 from text strings to the corresponding values 1 to 5.

7. Obtain the Bayes Factor of this correlation

Null hypotheses: There is significant relationship between summary score and

Provided below are the insights generated from the output:

The regression line fitted is:

Attainment at university = 49.9834 + 0.7759 * (Summary Score)

For respondent sex as the independent variable:

For summary score as the independent variable:

Provided below are the insights generated from the output:

The regression line fitted is:

Attainment at university = 47.999 + 2.079 * (Summary Score) – 6.331 * (Male

#Plot of Attainment at University with respect to the teachers being fair

You might also like