Professional Documents
Culture Documents
Lecture 7-1 - Stratified Sampling
Lecture 7-1 - Stratified Sampling
Reading Assignment:!
Scheaffer, Mendenhall, Ott, & Gerow!
2/1/13
Chapter 5! 1
Goals for this Lecture!
2/1/13
2
Stratified Sampling!
2/1/13
3
Reasons for Stratified Sampling!
• More precision!
– Assuming strata are relatively homogeneous, can
reduce the variance in the sample statistic(s)!
– So, get better information for same sample size!
• Estimates (of a particular precision) needed
for subgroups!
– May be necessary to meet survey’s objectives!
– May have to oversample to get sufficient
observations from small strata !
• Cost!
– May be able to reduce cost per observation if
population stratified into relevant groupings!
2/1/13
4
An Example!
2/1/13
5
Source: Survey Methodology, 1st ed., Groves, et al, 2004.
Strata!
2/1/13
6
Some Notation!
2/1/13
7
Mean Estimation!
st ∑
N N i =1
τˆi = ∑ Ni yi
N i =1
2/1/13
8
Mean Estimation Summary!
1 L
• Estimator for the mean:! yst = ∑ Ni yi
N i =1
• Variance of yst :!
⎛ 1 L
⎞
Var ( yst ) = Var ⎜ ∑ N i yi ⎟
⎝N i=1 ⎠
1 L 2
= 2 ∑ N i Var ( yi )
N i=1
1 L 2⎛ ni ⎞ ⎛ si2 ⎞
! = 2 ∑ N i ⎜ 1− ⎟ ⎜ ⎟
N i=1 ⎝ N i ⎠ ⎝ ni ⎠
y
• Bound on the error of estimation:! 2 Var ( st )
2/1/13
9
Example: TV Viewing Time (1)!
• Survey to estimate
average TV viewing time!
• Population consists of two
urban areas and rural
residents!
• Reasons to stratify:!
– Similarity of viewing
habits by towns and rural
regions!
– Cost: perhaps cheaper to
survey each region
separately (?)!
• Sample allocated
proportional to strata size!
2/1/13
10
Source: Elementary Survey Sampling, 7th ed., Scheaffer, Mendenhall, Ott, and Gerow, 2012.
Example: TV Viewing Time (2)!
2/1/13
11
Source: Elementary Survey Sampling, 7th ed., Scheaffer, Mendenhall, Ott, and Gerow, 2012.
Example: TV Viewing Time (3)!
2/1/13
12
Source: Elementary Survey Sampling, 7th ed., Scheaffer, Mendenhall, Ott, and Gerow, 2012.
Example: TV Viewing Time (4)!
• Correct analysis:!
yst = 27.7, σˆ yst = 1.4, 95% CI = (24.9, 30.5)
2/1/13
13
Design Effect!
L
• Estimator for the total:!τˆst = Nyst = ∑ Ni yi
τˆ
• Variance of st :!
i =1
⎛ L
⎞
Var (τˆst ) = Var ⎜ ∑ N i yi ⎟
⎝ i=1 ⎠
L
=∑ N i2 Var ( yi )
i=1
L ⎛ n ⎞ ⎛ s 2
⎞
! =∑ N i ⎜ 1− ⎟ ⎜ ⎟
2 i i
i=1 ⎝ N i ⎠ ⎝ ni ⎠
τˆ
• Bound on the error of estimation:! 2 Var ( st )
2/1/13
15
Proportion Estimation Summary!
1 L
• Proportion estimator:! pˆ st = ∑ N i pˆ i
N i =1
• Variance of yst :!
⎛ 1 L
⎞
Var ( p̂st ) = Var ⎜ ∑ N i p̂i ⎟
⎝N i=1 ⎠
1 L 2
= 2 ∑ N i Var ( p̂i )
N i=1
1 L 2⎛ ni ⎞ ⎛ p̂i (1− p̂i ) ⎞
= 2 ∑ N i ⎜ 1− ⎟ ⎜ ⎟
N i=1 ⎝ N i⎠⎝
ni ⎠
!
p̂
• Bound on the error of estimation:! 2 Var ( st )
2/1/13
16
Sample Size Determination & Allocation!
2/1/13
17
Sample Size Calculation for
Estimating the Mean!
∑ i σ i ai
N 2 2
n= i=1
L
N 2 B 2 4 + ∑ N i2σ i2
i=1
2/1/13
18
Proportionate Allocation to Strata!
∑N σ i
2 2
i
ai i
2 2
i i
N) ∑Nσ i
2
i
n= i=1
L
= i=1
L
= i=1
1 L
N 2 B 2 4 + ∑ N i2σ i2 N 2 B 2 4 + ∑ N i2σ i2 NB 2 4 + ∑ N iσ i2
i=1 i=1 N i=1
2/1/13
19
Example!
Ni
Ni/N
ni
ni/Ni
yi si2
2/1/13
20
Source: Survey Methodology, 1st ed., Groves, et al, 2004.
Other Allocation Schemes!
2/1/13
21
Poststratification!
2/1/13
22
What We Have Covered!
2/1/13
23