Professional Documents
Culture Documents
Team5 Project
Team5 Project
I. Introduction
1. Problem
Descriptive characteristic of the height of DUT student via freshman students of FAST
Two-sample test the mean of math entering score of 2 group students.
2. Parameter
Population parameter of interests (from 21ES/ECE students)
Solution: We need to change the format of data to uniform and expect that the wrong data are given from
students due omitted or redundant numbers. Therefore, we can fix it based on intuition.
#Comment:
By histogram1, the distribution concentrates on densely interval [160,170] cm. And,
Interval [170,175]cm is least.
By histogram2, the distribution appears normal. This agrees to Central Limit Theorem
which implies that
sample sizes are larger than 40, the distribution is approximately normal. Here n=38.
b. Weight in two groups.
#Comment:
By Weight in group1, it’s clear that a first half of sample is narrower than the rest.
And there doesn't exist outlier.
By Weight in group2, it appears that the distribution in two sides with respect to "median"
point is quite equal. Similarly, no outlier appears.
c. Total Score.
#Comment
-Values of sample lie between a little lower than 21 and 27.
-Interval median to 75th-percentile is the narrowest.
-There is a mild outlier in this sample.
2. Inferential statistic.
Problem: determine the mean height of students in FAST from group1 and group2.
a. Confidence Interval.
Group1.
Let confidence level be 95%. Given that n = 22, x=170.0818∧s=8.015143 .
Because sample size is 22. Then, it doesn’t satisfy the CLT theorem, so we must use t-
distribution.
-Use R code:
-Result:
Group2.
Similary, confidence level is 95%. Given that n = 38, x=170.6184∧s=6.20531 Because the
sample size n = 38,this is a little less than 40 .So, we can “quitely” agree the CLT theorem.
Result:
Group1
Because sample size is 22. It doesn’t satisfy the CLT theorem we must use t_distribution.
Moreover, because this is two-tailed test.
Because this case is for large-sample tests without knowing population standard deviation.
Test static is quite different:
Then we reject Ho if either z >= z_alpha/2 or z <= -z_alpha/2.
Using R-studio.
R
esult:
Comment: Using a significance level of .05, we can barely reject the null hypothesis in favor
of the alternative hypothesis
IV. Conclude:
1. By confidence level = 95%. In group1, the confidence interval [166.5281,173.6355]. This
figure to group2 is [168.6455,172.5914]. There for, we can use the null hypothesis in group
1, it is the average height is 167cm to represent the average height of DUT students, but we
can not use the null hypothesis in group 2 to define it.
2. There is no different in the true average math score entering of FAST students in 2 group