Professional Documents
Culture Documents
Case Study Analysis: Academic Performance of University Students
Case Study Analysis: Academic Performance of University Students
Group members:
1. Nguyễn Thị Lan Anh 1604010005
2.Nguyễn Tuấn Phong 1704040093
3. Nguyễn Gia Phương Anh 1704040005
4. Lê Thị Bảo Ngọc 1704040084
5. Nguyễn Việt Hoa 1704040043
6. Nguyễn Thị Nhung 1704040093
7. Đặng Ngọc Quỳnh 1704040100
TABLE OF CONTENTS
I. Introduction...........................................................................................................1
II. Answering questions..............................................................................................1
Question 1...................................................................................................1
Question 2...................................................................................................1
Question 3...................................................................................................5
Question 4...................................................................................................7
Question 5...................................................................................................8
Question 6.................................................................................................10
I. INTRODUCTION
The objective of the survey in this case is to test for any significant interaction between
Classroom seating positions and Gender and to test for any significant difference in academic
performance (GPA) due to seat preference and gender. We can easily notice that the suitable
inference technique should be used for this study is Two-way ANOVA model. Two-way
ANOVA compares the mean differences among groups that have been split into 2 independent
factors, each with several levels. In particular, it is clear that respondents were asked to specify
one of three levels of seat preference: “front” , “middle” and “back”. Therefore, seating
positions become the first factor which including 3 levels. The second factor is gender with 2
levels of male and female. From utilizing two factors, two-way ANOVA will expose the
interaction between these two factors. Each combination of the factors is named a cell.
Therefore, total combinations of seats and genders results in 6 cells.
2. Produce descriptive statistics for the dataset. You are expected to generate as many
relevant descriptive statistics as possible using ALL the relevant tools introduced in the
labs of this course. Remember to provide appropriate interpretations for the descriptive
We can get the mean of the GPA and find the standard deviation of two other variables but we
have to convert variable Gender and Seat into factors. Using “Factor” function, then use “By”
function to get the mean for two groups at the same time.
Convert variable Gender and Seat into factors and Crosstabulation table between
Gender and Seat variables:
❖ StudentSurvey$Gender <- factor(StudentSurvey$Gender, levels=c("Male","Female"))
❖ StudentSurvey$Seat <- factor(StudentSurvey$Seat, levels=c("Back","Front","Middle"))
❖ table(StudentSurvey$Seat,StudentSurvey$Gender)
From this output, it is clearly seen that the highest standard deviation is the combination of back
seat and male gender at 0.4958685 and the lowest one is 0.3795011 examined from the group of
front seat and female gender.
❖ Graphical description
➢ install.packages("gplots")
➢ library(gplots)
➢ plotmeans(GPA ~ interaction(Seat,Gender), data = StudentSurvey, xlab = "Seat and
Gender", ylab = "expected GPA", main="Mean Plot with 95% CI")
Mean plot provides the difference between mean GPA of each combination and standard
deviation of them. Plot in front seat combined with female gender stands at the highest GPA with
more than 3.3 , followed by “Back.Female” at nearly 3.2, and the lowest one is the “Back.Male”
with only 3.0.
3. Check all the assumptions of the inference technique you suggest in Question 1. Are the
assumptions satisfied? Explain.
There are 3 assumptions required to use two – way ANOVA:
Male Female
Back 50 50
Front 50 50
Middle 50 50
As can be seen, the total sample size of this survey is 300 observations provided in the
accompanying file named StudentSurvey.csv, consists of six groups: Back-Male, Back-Female,
Front-Male, Front-Female, Middle-Male, Middle-Female. Since there is not any information on
how respondents are selected, the group thinks that they are chosen randomly. Each response
came from a different person, and his/her answer is not affected by another. Therefore, the
samples are independent, and are randomly selected.
install.packages("car")
library(car)
leveneTest(StudentSurvey$GPA,interaction(StudentSurvey$Seat,StudentSurvey$Gender)
, center=mean)
qqPlot(lm(GPA ~ Gender + Seat + Gender*Seat, data=StudentSurvey), simulate=T,
main="Q-Q Plot", labels=F)
The outcome:
It is clearly seen from the Q-Q plot that all outliers lie within the confidence envelop,
which obviously demonstrates that all populations are normally distributed.
4. Perform the inference technique you suggest in Question 1. Remember to provide all the
necessary steps. What are your interpretations and conclusions? Explain.
ANOVA test 2-way factors:
Step 1: Identify null and alternative hypothesis:
Ho: There is not a significant interaction between seat preference and gender in GPA.
Ha: There is significant interaction between seat preference and gender in GPA.
Step 2: Test statistic and p-value:
We used Rstudio to calculate and had the output as following:
> StudentSurvey.result<-aov(GPA ~ Gender*Seat, data = StudentSurvey)
> summary(StudentSurvey.result)
Df Sum Sq Mean Sq F value Pr(>F)
Gender 1 1.40 1.4008 7.108 0.0081 **
Seat 2 0.93 0.4673 2.371 0.0951 .
Gender:Seat 2 1.35 0.6745 3.423 0.0339 *
Residuals 294 57.94 0.1971
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
Reject Ho. The effect of interaction between seat preference and gender is significant.
Step 5: Conclusion:
We have enough statistical evidence to conclude that there are significant differences in GPA
due to seat preference and gender.
As we can see from the interaction plot, the male and female student groups record a significant
difference among the ones who sit in the front, middle and back. Looking at the details, the
female group who sit in the front scores the highest GPA with over 3.3 while the male group
who also sit at the same spot has 3.1. The female sitting in the middle has approximately 3.1 and
the male group has a bit higher GPA. The female group who sits in the back shows a similarity
with the ones who sit in the middle but the male has the lowest GPA (less than 3.0). From this
interaction, we can conclude that the ones who sit from the middle to the front has the tendency
of having higher GPA. Yet, the female group who sits in the back also has remarkable result.
An intersection among seat lines can be observed in the above interaction plot. This indicates
that there is a connection between genders and the seat position. The female students sitting in
the front and the back of the class have better performance than the male students and the
contrary can be seen in the middle seat group.
III. Conclusion
Two-way ANOVA which is used to address this case is satisfied. It brings us to the conclusion
that it is significant about the change in academic performance due to the relationship classroom
seating positions and academic performance (GPA) for both female and male students.
APPENDIX
Please fill out this form to perform evaluation of your group members. Discuss with all members
and agree on the final evaluations.
Please evaluate each member out of a scale of 100%. Allocation should be based upon group
opinions regarding how satisfactorily the member fulfilled his/her assigned tasks within the
group’s case study. For example, a 100% rating should be given to members who fulfilled
satisfactorily the tasks assigned by the group.
Group members should ask themselves the following questions before assigning the percentages
to others.
1. Did he/she do his/her fair share of the work on schedule and to the group’s satisfaction?
2. Did he/she cooperate with other group members?
3. Did he/she participate in, contribute to and share ideas in all relevant discussions?
4. Did he/she attend group meetings when required?
5. Did he/she relate and communicate to other group members?
Note: Your final mark for the case study will be equal to Your group result * Your peer
rating.