Professional Documents
Culture Documents
b) Give a short paragraph (40 to 100 words) to explain why these variables are related
Quantitative and qualitative data provide different outcomes, and are often used together to
get a full picture of a population. For example, the data above is collected on:
- The age (quantitative), the degree of education (qualitative) could also be gathered to
get more detail on the age for each type of education level.
- The quantity of study devices (quantitative), learning devices data (qualitative) could
also be gathered to get more detail on the quantity of study devices for each type of
learning device.
c) For the qualitative variable, construct a table including frequency distributions and percent
frequency. Give a short description for the table.
(Choose types of learning devices as the qualitative variable)
Categories Frequency Percentage
Samsung 5 16.6667
iPhone 7 23.3333
iPad 5 16.6667
Macbook 13 43.3333
-> Looking at the frequency distribution table reveals that 5 out of 30 students have Samsung,
7 have iPhone, 5 have iPad and 13 have Macbook.
d) For the quantitative variable, compute the descriptive statistics including mean, mode,
median, variance, standard deviation, and range (you may use Data Analysis tool pack, or
statistical functions in MS Excel). Give a short description for this variable.
(Choose the quantity of learning devices as the quantitative variable)
60
- Mean: x= =2
30
- Mode: the most frequent value=2
- Median¿ 2
N
- Variance
∑ ( x i−x )2 20 20
i=1
¿ = = ≈ 0.6897
n−1 30−1 29
√
N
∑ ( x i− x)2
√
- Standard deviation i=1 20
: s= =
≈ 0.8305
n−1 30−1
- Range: largest number−smallest number=4−1=3
-> The mean tells that in average, each student has 2 devices or 2 devices is the expected
value for each student.
-> The median reveals that the “central” quantity of device(s) is 2.
-> The mode illustrates the majority of students have 2 learning devices.
-> Range shows 3 devices in a set vary and the difference between highest (4 devices) and
lowest values (1 device).
-> Variance represents an actual value of 0.6897 in a data set that vary from the mean and
the spread between quantities in a data set.
-> Standard deviation measures how far apart quantities are in a data set, which is 0.8305.
Count of Column
Quantity Labels
Row Labels 1 2 3 4 Grand Total
iPad 1 2 1 1 5
iPhone 4 3 7
Macbook 7 6 13
Samsung 4 1 5
Grand Total 9 13 7 1 30
√
N
∑ ( x i− x)2
√
- Sample standard deviation: i=1 654
s= = ≈ 9.6658
n−1 8−1
b. Construct a 90% confidence interval estimate of the population mean cost for
prescription of Zocor
1−α=0.9 => α =0.1 ; d f =n−1=7
t α =t 0.05,7 =1.895 => E=t × s =1.895 × 9.6658 ≈ 6.4759
α
2
,df
2
, df √n √8
- Lower limit: x−E=108−6.4759=101.5241
- Upper limit: x + E=108+ 6.4759=1 14.4759
101.5241< μ<114.4759
c. Construct a 95% confidence interval estimate of the population mean cost for
prescription of Zocor
1−α=0.95 => α =0. 05 ; d f =n−1=7
t α =t 0.02 5,7 =2.365 => E=t × s =2.365 × 9.6658 ≈ 8.0821
α
2
,df
2
, df √n √8
- Lower limit: x−E=108−8.0821=99.9179
- Upper limit: x + E=108+ 8.0821=11 6.0821
99.9179< μ<116.0821
d. Discuss why the 90% and 95% confidence intervals are different.
Different at 90% and 95% -> different t-value at 90% and 95% -> different
confidence intervals.
e. State the assumption about the population when construct the confidence interval in
part (b) and (c)
Level of significance is a statistical term for how willing you are to be wrong:
- With a 95% confidence interval, you have a 5% chance of being wrong.
- With a 90% confidence interval, you have a 10% chance of being wrong.
{
H 0 : μ=3
H 1 : μ≠ 3
b. Compute the standard error of the mean. Construct a 95% confidence interval for
population mean. Using the confidence interval, test whether or not the mean of the
population significantly differs from $3.
28
- Sample mean: x= =2.8
10
√
N
∑ (x i− x)2
√
- Standard deviation: i=1 4.417
s= = ≈ 0.7006
n−1 10−1
s
- Standard error of the mean ¿ ≈ 0.2215
√n
- Critical value: t α =t 0.025 ,9 =2.262
,df
2
s 0.7006
=> E=t α , df × =2 .262 × ≈ 0.5011
2 √n √10
- Lower limit: x−E=2. 8−0.5011=2.2988
- Upper limit: x + E=2.8+ 0.5011=3.3011
=> 2.2988< μ<3.3011 since the mean of population doesn’t differ from $3
c. Determine the test statistic and at 95% confidence, test whether or not the mean of the
population significantly differs from $3.
x−μ 2.8−3
- t-statistic: t= = ≈−0.9027
s / √ n 0.7006/ √ 10
- Critical value: t α ,df =t 0.025,9=2.262
2
s 0.7006
=> E=t α , df × =2.262 × ≈ 0.5011
2 √n √ 10
- Lower limit: x−E=2.8−0.5011=2.2988
- Upper limit: x + E=2.8+ 0.5011=3.3011
=> 2.2988< μ<3.3011 since the mean of population doesn’t differ from $3
{
H 0 : μ ≥ 25,000
H 1 : μ<25,000
b. Calculate the p-value and at 95% confidence, test the hypothesis and give the
conclusion?
- 1−α=0.95 => α =0.05
213000
- Sample mean: x= ≈ 23666.6667
9
√
N
∑ (x i− x)2
√
- Sample standard deviation: i=1 50000000
s= = =2500
n−1 9−1
x−μ 23666.6667−2500 0
- t-statistic: t= = =−1.6
s/√n 2500/ √ 9
p-value < 0.00001
Since p < 0.00001, p < = 0.05
Reject H0
There is no evidence to conclude the average income of households in the area is
at least $25,000