Professional Documents
Culture Documents
Ex. “Based on GSS data, we’re 95% confident that the population
mean of the variable LONELY (no. of days in past week you felt
lonely, y = 1.5, s = 2.2) falls between 1.4 and 1.6.
n
• Sample std. dev. estimates population std. dev. s
sˆ s i
( y y ) 2
n 1
s ˆ s / n (1 ) / n
itself depends on the unknown parameter!
In practice, we estimate
ˆ 1 ˆ
(1 )
s by se
ˆ n n
ˆ z(se) with se ˆ (1 ˆ ) / n
z-value such that prob. for a normal dist within z standard errors of
mean equals confidence level
(e.g., z=1.96 for 95% confidence, z=2.58 for 99% conf.)
se ˆ (1 ˆ ) / n 0.0(1.0) / 20 0.000
95% CI for population proportion is 0.0 ± 1.96(0.0), or (0.0, 0.0)
s
se
n
95% confidence interval for m :
s
y 1.96( se), which is y 1.96
n
Confidence Level
90% 95% 98% 99%
df t.050 t.025 t.010 t.005
1 6.314 12.706 31.821 63.657
10 1.812 2.228 2.764 3.169
16 1.746 2.120 2.583 2.921
30 1.697 2.042 2.457 2.750
100 1.660 1.984 2.364 2.626
infinity 1.645 1.960 2.326 2.576
y = 11.4, 11.0, 5.5, 9.4, 13.6, -2.9, -0.1, 7.4, 21.5, -5.3, -3.8,
13.4, 13.1, 9.0, 3.9, 5.7, 10.7
SPSS: Analyze -> compare means -> 1-sample t test
---------------------------------------------------------------------------------------
Variable N Mean Std.Dev. Std. Error Mean
weight_change 17 7.265 7.157 1.736
----------------------------------------------------------------------------------------
GSS asks “On average day, how many hours do you personally
watch TV?”
4268(0.20)(0.80) 683
(instead of n = 1067)
• Better to use approx value for rather than 0.50 unless you
have no idea about its value.
Choosing the Sample Size
• Determine parameter of interest (population mean
or population proportion)
2
z
Proportion (to be “safe,” set = 0.50): n (1 )
M
2
Mean (need a guess for value of s): z
n s
2
M
Example: n for estimating mean
2 2
z 2 1.96
n s 7
2
47
M 2
Possible samples ( yi y )2 ( yi y ) 2 ( yi m ) 2
(equally likely) n n 1 n
(0, 0) 0 0 1
(0, 2) 1 2 1
(2, 0) 1 2 1
(2, 2) 0 0 1