0% found this document useful (0 votes)

245 views43 pages

Stratified Sampling Explained

The document discusses stratified sampling. It begins by explaining that stratified sampling aims to reduce heterogeneity in a population by dividing it into homogeneous subgroups called strata. Samples are then randomly selected from each stratum. The document provides an example of stratifying students by grade level to estimate average student height. It then defines the key terms and notation used in stratified sampling and outlines the procedure. Finally, it discusses how to estimate the population mean and variance from a stratified sample.

Uploaded by

Shourjyo biswas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

245 views43 pages

Stratified Sampling Explained

Uploaded by

Shourjyo biswas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Chapter 4

Stratified Sampling
An important objective in any estimation problem is to obtain an estimator of a population parameter
which can take care of the salient features of the population. If the population is homogeneous with
respect to the characteristic under study, then the method of simple random sampling will yield a
homogeneous sample, and in turn, the sample mean will serve as a good estimator of the population
mean. Thus, if the population is homogeneous with respect to the characteristic under study, then the
sample drawn through simple random sampling is expected to provide a representative sample.
Moreover, the variance of the sample mean not only depends on the sample size and sampling fraction
but also on the population variance. In order to increase the precision of an estimator, we need to use a
sampling scheme which can reduce the heterogeneity in the population. If the population is
heterogeneous with respect to the characteristic under study, then one such sampling procedure is a
stratified sampling.

The basic idea behind the stratified sampling is to

 divide the whole heterogeneous population into smaller groups or subpopulations, such that the
sampling units are homogeneous with respect to the characteristic under study within the
subpopulation and
 heterogeneous with respect to the characteristic under study between/among the subpopulations.
Such subpopulations are termed as strata.
 Treat each subpopulation as a separate population and draw a sample by SRS from each stratum.
[Note: ‘Stratum’ is singular and ‘strata’ is plural].

Example: In order to find the average height of the students in a school of class 1 to class 12, the
height varies a lot as the students in class 1 are of age around 6 years, and students in class 10 are of age
around 16 years. So one can divide all the students into different subpopulations or strata such as
Students of class 1, 2 and 3: Stratum 1
Students of class 4, 5 and 6: Stratum 2
Students of class 7, 8 and 9: Stratum 3
Students of class 10, 11 and 12: Stratum 4
Now draw the samples by SRS from each of the strata 1, 2, 3 and 4. All the drawn samples combined
together will constitute the final stratified sample for further analysis.

Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur

Page 1
Notations:
We use the following symbols and notations:
N : Population size
k : Number of strata
Ni : Number of sampling units in ith strata
k
N   Ni
i 1

ni : Number of sampling units to be drawn from ith stratum.

n  k ni : Total sample size
i 1

Population (N units)

Stratum 1 Stratum 2 Stratum k Nk units

N1 units N2 units
………

Sample 1 Sample 2 Sample

n1 units n2 units ……… k nk

Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur

Page 2
Procedure of stratified sampling
Divide the population of N units into k strata. Let the ith stratum has N1,i  1, 2,..., number of units.
k

 Strata are constructed such that they are non-overlapping and homogeneous with respect to the
characteristic under study such that  N  N.
k
i1
i

 Draw a sample of size ni

from ith ( i  1, 2,..., k stratum using SRS (preferably WOR)
)
independently from each stratum.
 All the sampling units drawn from each stratum will constitute a stratified sample of size
n  k ni .
i1

Difference between stratified and cluster sampling schemes

In stratified sampling, the strata are constructed such that they are
 within homogeneous and
 among heterogeneous.

In cluster sampling, the clusters are constructed such that they are
 within heterogeneous and
 among homogeneous.
[Note: We discuss the cluster sampling later.]

Issues in the estimation of parameters in stratified sampling

Divide the population of N units in k strata. Let the i th stratum
has Ni , i 1, 2,..., number of units.
k
Note that there are k independent samples drawn through SRS of sizes
n1, n2 ,..., from each of the
strata. So, one can have k estimators of a parameter based on the sizes nk
respectively. Our
n1, n2 ,...,
nk

interest is not to have k different estimators of the parameters, but the ultimate goal is to have a single
estimator. In this case, an important issue is how to combine the different sample information together
Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur
Page 3
into one estimator, which is good enough to provide information about the parameter.

We now consider the estimation of population mean and population variance from a stratified sample.

Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur

Page 4
Estimation of population mean and its variance
Let
Y : characteristic under study,
yij : value of jth unit in ith stratum j = 1,2,…,ni, i = 1,2,...,k,

1 N i

Yi  
Ni j 1
yij : population mean of ith stratum

ni
1
yi  y ij : sample mean from ith stratum
n j 1

1
i
 N Y   w Y : population mean where
k i i i i N
 Ni .
Y  wi
N
i1 i1

Estimation of population mean:

First, we discuss the estimation of the population mean.
Note that the population mean is defined as the weighted arithmetic mean of stratum means in the case
of stratified sampling where the weights are provided in terms of strata sizes.
1 k
one may choose the sample mean
Based on the expression Y   NYi i
,
N i1
1 k

y
n i1 ni yi

as a possible estimator of Y .

Since the sample in each stratum is drawn by SRS, so

E( yi )  Yi ,
thus
1 k

E( y ) 
n n E( y )
i1
i i

1 k

n n Y
i1
i i

Y
and y turns out to be a biased estimator of Y . Based on this, one can modify y so as to obtain an

unbiased estimator of Y . Consider the stratum mean which is defined as the weighted arithmetic mean
of strata sample means with strata sizes as weights given by
1 k
Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur
Page 5
yst  Ny.
N i1
i i

Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur

Page 6
Now

E( yst ) 1 k N E( y )
N i1
i i

  k
NY
1 i1 i i
N

Y
Thus is an unbiased estimator of Y .
yst

Variance of
yst
k k ni

Var( )   w2 Var( y ) 
y
 ww Cov( y , y ).
st i i i j i
i1 i ( j )1 j 1

Since all the samples have been drawn independently from each of the strata by SRSWOR so
Cov( yi , y j )  0,i  j
N  ni 2
Var( y )  i S
i i
Ni ni
where
1 N
 Y i )2.
S2 
i
(Y i
ij
Ni 1 j1
Thus
k 2
Ni  ni 2

Var( yst )   Si
wi
i1 Ni ni
 n  S
2
2
k

 i w
 .
i i
1
i1  Ni  ni
2
Observe that Var( yst ) is small when S is small. This observation suggests how to construct the strata.
i
If S is small for all i = 1,2,...,k, then Var( y ) will also be small. That is why it was mentioned earlier
2
i st

that the strata are to be constructed such that they are within homogeneous, i.e., S 2 is small and among
i

heterogeneous.

For example, the units in geographical proximity will tend to be more closer. The consumption pattern
in the households will be similar within a lower income group housing society and within a higher
Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur
Page 7
income group housing society, whereas they will differ a lot between the two housing societies based on
income.

Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur

Page 8
Estimate of Variance
Since the samples have been drawn by SRSWOR, so
E(s 2 )  S 2
i
1
where s2  (  y )2
y
ni
i
ij i
ni 1 j
1

and N i  ni
V‸ar( y )  s2
i
Ni ni i

so V‸ar(
k
2
y st

) i1 w
i i

2  Ni  ni  2
k

 w s
 i 
N n
 i
i1
 i i 

Note: If SRSWR is used instead of SRSWOR for drawing the samples from each stratum, then in this
case
yst  k wi yi
i1

E( yst )  Y k
 N 1  2
2  i
2 k
2
wi yst )  
Var( i
Si   wi
i1  Nn  ni
i i i1

 
2 2
k
w s
V‸ar( )   ni i
y s
i1 i
N

2
where   ( i  y )2.
1 y
i ni j 1 ij i

Advantages of stratified sampling

1. Data of known precision may be required for certain parts of the population.
This can be accomplished with a more careful investigation to a few strata.
Example: In order to know the direct impact of the hike in petrol prices, the population can be
divided into strata like lower income group, middle-income group and higher income group.
Obviously, the higher income group is more affected than the lower-income group. So more
Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur
Page 9
careful investigation can be made in the higher income group strata.
2. Sampling problems may differ in different parts of the population.
Example: To study the consumption pattern of households, the people living in houses, hotels,
hospitals, prison etc. are to be treated differently.

Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur

Page 10
3. Administrative convenience can be exercised in stratified sampling.
Example: In taking a sample of villages from a big state, it is more administratively convenient
to consider the districts as strata so that the administrative set up at district level may be used for
this purpose. Such administrative convenience and the convenience in the organization of
fieldwork are important aspects in national level surveys.
4. Full cross-section of the population can be obtained through stratified sampling. It may be
possible in SRS that some large part of the population may remain unrepresented. Stratified
sampling enables one to draw a sample representing different segments of the population to any
desired extent. The desired degree of representation of some specified parts of the population is
also possible.
5. Substantial gain in efficiency is achieved if the strata are formed intelligently.
6. In the case of skewed population, use of stratification is of importance since larger weight may
have to be given for the few extremely large units, which in turn reduces the sampling
variability.
7. When estimates are required not only for the population but also for the subpopulations, then the
stratified sampling is helpful.
8. When the sampling frame for subpopulations is more easily available than the sampling frame
for the whole population, then stratified sampling is helpful.
9. If the population is large, then it is convenient to sample separately from the strata rather than
the entire population.
10. The population mean or population total can be estimated with higher precision by suitably
providing the weights to the estimates obtained from each stratum.

Allocation problem and choice of sample sizes is different strata

Question: How to choose the sample sizes n1, n2 ,..., so that the available resources are used in an
nk
effective way?
There are two aspects of choosing the sample sizes:
(i) Minimize the cost of survey for a specified precision.
(ii) Maximize the precision for a given cost.

Note: The sample size cannot be determined by minimizing both the cost and variability
simultaneously. The cost function is directly proportional to the sample size, whereas variability is
Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur
Page 11
inversely proportional to the sample size.
Based on different ideas, some allocation procedures are as follows:

Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur

Page 12
1. Equal allocation
Choose the sample size ni to be the same for all the strata.

Draw samples of equal size from each stratum.

Let n be the sample size and k be the number of strata, then
n
n  for all i  1, 2,..., k.
i
k

2. Proportional allocation
For fixed k, select ni
such that it is proportional to stratum size Ni , i.e.,

ni  Ni
or ni   Ni

where  is the constant of proportionality.

 n 
i1
k
i

or n   N
n
  .
N
Thus n
n  N.
i   i
 N
Such allocation arises from considerations like operational convenience.

3. Neyman or optimum allocation

This allocation considers the size of strata as well as variability
ni  Ni Si
*
n  NS
i i i

*
where  is the constant of proportionality.
k k
 n  i
*
NS
i i
i1 i1
k

or n  
*
NS
i 1
i

or  
*
n
.
 Ni Si
k

i 1

Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur

Page 13
Thus n 
nNi Si
i k .
Ni Si
i1

k
This allocation arises when the Var  yst  is minimized subject to the constraint (prespecified).
 ni i1

There are some limitations to the optimum allocation. The knowledge of Si (i 1, 2,..., k) is needed to

know ni . If there are more than one characteristics, then they may lead to conflicting allocation.

Choice of sample size based on the cost of survey and variability

The cost of the survey depends upon the nature of the survey. A simple choice of the cost function is
C  C0  k C n
i i
i1

where
C : total cost
C0 : overhead cost, e.g., setting up the office, training people etc

Ci : cost per unit in the ith stratum

 C n : total cost within the sample.

k
i1
i i

To find ni
under this cost function, consider the Lagrangian function with a Lagrangian
multiplier  as
  Var( y )   2 (C  C )
2 1 1  2
k st k
2

 wi  n   Si   Ci ni
N 
i1  i i i1
2 2
k
wi Si k k
w2iS 2i
    Cn 
2
i1 i i
i1 i1 Ni
ni  wS 2

 k  C n   terms independent of n .

i i i i  i
ni
i1  

Thus  is minimum when

wi Si
 
ni Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur
Sampling
Page 14
Ci ni for all i

or ni  wi Si
1 .
Ci

Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur

Page 15
How to determine  ?
There are two ways to determine  .
(i) Minimize variability for a fixed cost.
(ii) Minimize cost for given variability.
We consider both cases.

(i) Minimize variability for fixed cost

Let C  C* be the pre-specified cost which is fixed.
0

So  Cn k
i i
 C*
0
i1

k wi
or  C*
Si 0

 i1
Ci  Ci

 Ci wi Si
k

or   i1
.
C0 *
1 wi
Substituting  in the expression for ni  , the optimum ni is obtained as
C i
Si

 
* wS  C*
ni  i i  k 0
.
Ci Ci wi Si 

 
 i1
The required sample size to estimate Y such that the variance is minimum for the given cost C  C* is
0
k
*
n i1 i

(ii) Minimize cost for a given variability

Let V  V0 be the pre-specified variance. Now determine such that
ni
k
1 1 
 w2S 2  V
 n N
 i i 0
i1  i i 
k
wi2 Si 2 k
w2 S 2
or   V0
 i i


 Ci
i1 ni
Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur
Page 16
k 2 2
i1
Ni S w
or  wS  
2
V
2
 i i
0
i1 wi Si i
i1 Ni

Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur

Page 17
k
w2 S 2
V0   i i
Ni 1 wi Si
or   i1 (after substituting ni  ).
Ci
 wi Si Ci
k

i1

Thus the optimum ni is


wS
wSCk
i i i 
n~i   i 1 
w2S 2  .
i i

Ci 
V0   i i
k

 i1 Ni 

So the required sample size to estimate Y such that cost C is minimum for a
prespecified variance V0 is n  
k n~ .
i
i1

Sample size under proportional allocation for fixed cost and for fixed variance
(i) If cost C  C0 is fixed then C0  C n
k
i1
i i .

Under proportional allocation, ni  nN  nw

i i
N
k
So C0  n  wiCi
i1

C0
or n  .
w C k
i i
i1

Thus ni  Co wi .
 wiCi
The required sample size to estimate Y k
in this case is n   ni .
i1

Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur

Page 18
(ii) If variance = V0 is fixed, then
k
1 1  2 2
 wS V
 n N  i i 0
i1  i i
k
wi2 Si 2 k
w2 S 2
or   
V 0
 i i
i1 ni i1 Ni
2 2
k
w S k
w2 S 2
or  i i  V0   i i (using ni  nwi )
i1
nwi i1 Ni
k

wS
2 2
i
or n  i1
w Sk 2 2

V0  i i
N i1 i
k

wS
2 2
i1
or ni  wi i
2 2
.
w S k

V0  i i
N i1 i

This is known as Bowley’s allocation.

Variances under different allocations

Now we derive the variance of y under proportional and optimum allocations.
s
t

(i) Proportional allocation

Under proportional allocation

ni  n Ni
N
and
Var( y)
k
 Ni  ni  2 2
st
  wi Si
i1  N n
i i

 n Ni  2
N   N 
Var )( y  
k i

 nN   i  Si
2
prop st
i1  N N   N
 i 
N
i
Nn k NS2

Nn 
i i

i1 N
Nn
Nn k i1


 i 4 | Stratified Sampling | Shalabh, IIT Kanpur
Sampling Theory| Chapter
Page 19
w S 2.

Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur

Page 20
(ii) Optimum allocation
Under optimum allocation
nNi Si
ni  k

i1
N S i i

V
 (y ) k
1 1 
 w2 S 2
opt st
 n N
 i i
i 1  i i 
k
w2 S 2 k
w2 S 2
 i i
 i i

i1 ni i1 Ni
  k 
k    Ni Si  k w2 S 2
   w S  i 1   i i
2 2

i1  i i nN S
  i i  i1 Ni
  
1
 k N i Sk i   k w
i i 
i2 i2

.   NS
i1  n N i1  i1 Ni
2

1 k
NS  2 1 k
2
 1 k
wi S i 2k
  i i

2
  N n
 wS    wS 2
.
n N  i i
N
i i

 i1  i1 i  i1  i1

Comparison of variances of the sample mean under SRS with stratified

mean under proportional and optimal allocation:
(a) Proportional allocation:
Nn
VSR ( y)  S2
S Nn
k  n
N NS2
Vp r op ( yst )   ii.
Nn i1 N
In order to compare V
( y) and ( y ) first we attempt to express S 2 as a function of S 2.
V ,
SRS prop st i

Consider
k Ni

(N 1)S   2
 Y )2
(Y i1 j 1
i

Ni 2
k

   (Yij  Yi )  (Yi  Y )

Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur
Page 21
i1 j
k 1 k Ni
Ni
 (Y  Y )   (Y  Y )
ij i
2 2

i1 j i1 j 1
1
k k
 (N
i1 1)S 2i 
i  N (Y
i1
i
 Y )2

Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur

Page 22
N k 1 2 Ni
S  N i k1
 
2
S 
(Y  Y )2.

N i1 N i i1 N i

For simplification, we assume that

Ni is large enough to permit the approximation
Ni 1
 1 and N 1
Ni N 1.
Thus
Ni k
Ni
S 2  k S 2  (Y  Y ) 2
i1 N i1 N
i i

k Nn 2 Nn Ni N-n

or S 
 Ni k 2 N  n
S 
(Y  Y )2 (Premultiply on both sides)
 by
Nn Nn i1 N i Nn i1 N i Nn
Nn
Vark (Y )  V ( y ) 
w (Y  Y )2
 SRS prop st
Nn i1
i

Since  w (Y  Y )
i1
i
2
 0,

 Varprop ( yst )  VarSRS ( y ).

A larger gain in the difference is achieved when Yi differs from Y more.

(b) Optimum allocation

 1
V
y ( ) 
1 

wS
2
 wS 2
.
k k
opt st  i i  i i
n i1  N i1
Consider
 N  n  k  1 2

k  1 k
y ( ) V ( y )  
V   wS 2     wS    w S 2  

prop st opt st i i
 Nn  i1 i i   i i
 N i 1 
n
1  k
 k i1
2

  wS 2   wi Si 
n i

 i1  i1  
1 2
1 k wi S i  S
2

  ni1 n
w here
Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur
Page 23
1 k )2
n i1
wi (Si  S
k
S   wi Si
i1

 Varprop ( yst ) Varopt ( yst ) 

0 or Varopt ( yst )  Varprop ( yst
).

Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur

Page 24
The larger gain in efficiency is achieved when Si differs from S more.
Combining the results in (a) and (b), we have
Varopt ( yst )  Varprop ( yst )  VarSRS ( y)

Estimate of variance and confidence intervals

Under SRSWOR, an unbiased estimate of S 2 for the ith stratum (i = 1,2,...,k) is
i
1
s2  ( ni
 y )2.
y
i
ij i
ni 1 j
1

In stratified sampling,
2 Ni  ni
k
2

Var( yst )   Ni ni Si .
wi
i1

So, an unbiased estimate of Var(yst ) is

‸ 2 Ni  ni 2
k
Var( yst )  wi N n si
 i1
i i
k
2
k 2 w
s w 2 s2
 
i1
i i
 i i

Ni
ni i1

  k
w2 s2 1 k

N
i i 2

ni w
i1 i1

The second term in this expression represents the reduction due to finite population correction.

The confidence limits of Y can be obtained as

yst  t V‸ar( yst )

assuming y is normally distributed and is well determined so that t can be read from normal
s
t V‸ar( y )
st
distribution tables. If only few degrees of freedom are provided by each stratum, then t values are
obtained from the table of student’s t-distribution.

The distribution of V‸ar ( y )is generally complex. An approximate method of assigning an effective
st

k 2
 
ii 2

number of degrees of freedom (n ) to 

is n   i1 
Sampling Theory| Chapter 4 | Stratified
e Sampling | Shalabh, IIT Kanpur
Page 25
k
V‸ar ( y )
st e 2 4
gi i

i1 ni
1
where g ) Ni (Ni  ni and Min(n 1)  n  k(n 1) assuming y are normally distributed.
i i e i ij
ni i1

Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur

Page 26
Modification of optimal allocation
Sometimes in the optimal allocation, the size of subsample exceeds the stratum size. In such a case,
replace ni by Ni

and recompute the rest of ni ' s by the revised allocation.

For example, if n1 
then take the revised ni ' s as
N1, n~1  N1
and
(n  N1 )wi Si i  2, 3,..., k
;n~ 
i k

w S
i2
i i

provided n~i 
for all i = 2,3,…,k.
Ni

Suppose in revised allocation, we find that n~2  N2 then the revised allocation would be

n~1  N1
n~2  N 2
(n  N1  N 2 )wi Si
n~  ; i  3, 4,..., k.
 wi Si
i

i3

provided n~i 
for all i  3, 4,..., k.
Ni

We continue this process until every n~i  Ni .

In such cases, the formula for the minimum variance of

ys need to be modified as
t

Min Var( yst )  i

* 2
( wS )
  w Si
* 2

 n* N
where  * denotes the summation over the strata in which n~  and n* is the revised total sample
N
i i

size in the strata.

Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur

Page 27
Stratified sampling for proportions
If the characteristic under study is qualitative in nature, then its values will fall into one of the two
mutually exclusive complimentary classes C and C’. Ideally, only two strata are needed in which all the
units can be divided depending on whether they belong to C or its complement C’. Thus is difficult to
achieve in practice. So the strata are constructed such that the proportion in C varies as much as possible
among strata.
Let

Pi Ai
 N :Proportion of units in C in the i stratum
th

ai
p  : Proportion of units in C in the sample from the ith stratum
i
ni
An estimate of population proportion based on the stratified sampling is

pst 
k
Ni pi
N .
 i1

which is based on the indicator variable

1
jth unit belongs totheith stratum is in C
Y when
ij 
0 otherwise
an yst  pst .
d

Her Ni
e S2  Pi Qi
i Ni 1

where Qi  1 Pi .

Also Var( ) Ni  ni
y w2 S 2 .

k

st i i
i1 Ni ni
1 k N
(N
2
 n ) PQ
So Var( pst ) 
2 N  i
N i1 i
ni i .
i1 i i

If the finite population correction can be ignored, then

Var( ) PiQi
p w2 .
st
 k
i
i1 ni
Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur
Page 28
If the proportional allocation is used for ni
, then the variance of pst is
Nn 1 k
N 2PQ
Var (p )

prop st
 i i i

N Nn i1 Ni 1
Nn k
 Nn i1 wi PiQi


Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur

Page 29
and its estimate is
‸ Nnk
wi piqi .
Var prop ( pst ) Nn
 n 
i1 i 1

The best choice of ni such that it minimizes the variance for fixed total sample size is

n  N Ni PiQi Ni 1
i i

 Ni PiQi

Thus n  n
Ni PiQi
i k .
 Ni PiQi
i1

Similarly, the best choice of ni k

such that the variance is minimum for fixed cost C  C0   Ci ni is
i1

PiQi
nN
iC
i
ni  k
.
PQ
N
i i
i
Ci
i1

Estimation of the gain in precision due to stratification

An obvious question crops up that what is the advantage of stratifying a population in the sense that
instead of using SRS, the population is divided into various strata? This is answered by estimating the
variance of estimators of population mean under SRS (without stratification) and stratified sampling by
evaluating
V‸ar SRS ( y) V‸ar( y )

V‸ar( y ) st
.
s
This gives an idea about the gain in efficiency due to stratification.
Nn 2
Since VarSRS ( y )  S , so there is a need to express in terms of S 2 . How to estimate S based
2
S2
Nn i

on a stratified sample?

Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur

Page 30
Consider
k Ni
(N 1)S 2
  (Y i
 Y )2
i1 j 1
2
k Ni

   (Yij  Yi )  (Yi  Y ) 
i1 j 1
k Ni k
  (Y  Y )2   N (Y  Y )2
ij i i
i1 j 1 i1
k k
  (N 1)S 2 
i1  N (Y  Y )2 i1

 kk (N 1)S  N
 2
wY2 2

 
Y
i
i i 
i i1
i1

In order to estimate S 2 , we need to estimates of S i2 , Y 2 and Y 2. We consider their estimation one by
one.

(I) For an estimate of S 2 , we have

E(s2 )  S 2
i

So Sˆ  s 2 .
2
i

(II) For estimate of Y 2i , we know

Var( y )  E( y 2 ) [E( y )]2

i i
 E( y 2 )  Y 2
i
or Y 2  E( y 2 ) Var ( y ).
i i

An unbiased estimate of Y 2 is
ˆ
Y 2  y 2 V‸ar( y )
i

 Ni  ni  2
i
 y 2
i
 s.
i   i
 Ni ni 

(III) For the estimation of Y 2, we know

Var( y )  E( y 2 ) [E( 2
y )]
st st st

 E( y ) Y 2 2
s
Y 2  E( y 2 ) Var( y )
st st

Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur

Page 31
So, an estimate of Y 2 is
ˆ
Y 2  y 2 V‸ar( y )
st st
 y2  k
 Ni  ni  2 2
st  wi si .
i1  N n
i i 

Substituting these estimates in the expression (n 1)S 2 as follows, the estimate of S

2 is obtained as

(N 1)S 2  
 (N
 k1)S  N
2
w Y 2Y 2


ˆ2 1 k
i i i i
i1 
N 
i1
k
ˆ 2
k
ˆ2 ˆ 2 
as   ( Ni   w
S 1)S
Ni NY 1 Y 
i1 i i  
1  k

 2  N  k  2  Ni  ni    2
k Ni  ni 2 2 

  Ni 1 si   N 1   wi  yi  2 wi si 
N n1   si     yst  
n  N
 i1    i1  i i  i1 i i 
  

1  k
2 N  k k
Ni  ni 2 
 2

yst )    Ni 1 si    wi ( yi   wi (1  wi ) si  .
N 1  i 1  N 1 i1 n i1 N
i i 
Thus
N  n ˆ2
V‸ar SRS ( y )  S
Nn
Nn N  ni 2 

 k  N ( N  n)
2 w ( y  y )2 k w (1 w ) i s
( i 1)s   nN (N 1)
N (N 1)n
k 

i 
i

 i1
   i1
i i st

i1
i i
Nn
i i 
and k
‸ ni Ni  2 2

Var( yst )  wi si .
Ni ni
 i1

Substituting these expressions in

V‸ar SRS ( y) V‸ar( y )

V‸ar( y ) st
,
s
the gain in efficiency due to stratification can be obtained.
Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur
Page 32
If any other particular allocation is used, then substituting the appropriate ni under that allocation, such

gain can be estimated.

Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur

Page 33
Interpenetrating subsampling
Suppose a sample consists of two or more subsamples which are drawn according to the same sampling
scheme. The samples are such that each subsample yields an estimate of the parameter. Such subsamples
are called interpenetrating subsamples.

The subsamples need not necessarily be independent. The assumption of independent subsamples helps
in obtaining an unbiased estimate of the variance of the composite estimator. This is even helpful if the
sample design is complicated and the expression for variance of the composite estimator is complex.

Let there be g independent interpenetrating subsamples and t ,t ,...,t be g unbiased estimators of

1 2 g

parameter  where t j ( j  1, 2,..., g) is based on jth interpenetrating subsample.

Then an unbiased estimator of  is given by

ˆ 1 g
   t , say.
g j 1 t

Then

E(ˆ)  E(t )  
and
‸ ˆ ‸
g
1 2

Var( )  Var(t ) 
g(g  1)
 (t
j j t) .

Note that
‸  
1  g 2 2

(t    g(t  ) 
 g(g)  1)  j
E Var(t )  E
 

1  g
g(g 1) 
  Var(t j )  g Var(t )

1 
 (g 2  g)Var(t )  Var(t )
g(g 1)
If the distribution of each estimator tj is symmetric about  , then the confidence interval of  can be
obtained by
g 1
1
P  Min(t1 , t2 ,..., t g )    Max(t1 , t2 ,..., t g )  1   .
  2
Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur
Page 34
Implementation of interpenetrating subsamples in stratified sampling

Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur

Page 35
Consider the set up of stratified sampling. Suppose that each stratum provides an independent
interpenetrating subsample. So based on each stratum, there are L independent interpenetrating
subsamples drawn according to the same sampling scheme.

ˆ
Le ij (tot
)
be an unbiased estimator of the total of jth stratum based on the ith subsample ,

i = 1,2,...,L; j = 1,2,...,k.

An unbiased estimator of the jth stratum total is given by

ˆ J ˆ
Yj) (tot  1  Yij (tot )
L i1

and an unbiased estimator of the variance of Yˆ

is given by
j(t

‸ˆ
j (tot ) 1 L ˆ ˆ 2
Var )  L(L 1)  ( ij) (tot  j) (tot ) .
i1

Thus an unbiased estimator of population total Ytot is

L
ˆ
ˆ 1
  ˆ
k k
Ytot  Yj (tot )  Yij (tot )
j i1 j
1
k 1

And an unbiased estimator of its variance is given by

V‸ar(Yˆ )  k V‸ar(Yˆ )
tot j (tot )
j 1

1 L
 L(L 1) i1  (ˆij (tot  ˆj (tot )2.
k

 ) j )
1

Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur

Page 36
Post Stratifications

Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur

Page 37
Sometimes the stratum to which a unit belongs may be known after the field survey only. For example,
the age of persons, their educational qualifications etc. can not be known in advance. In such cases, we
adopt the post-stratification procedure to increase the precision of the estimates.

Note: This topic is to be read after the next module on ratio method of estimation. Since it is related to
the stratification, so it is given here.

In post-stratification,
 draw a sample by simple random sampling from the population and carry out the survey.
 After the completion of the survey, stratify the sampling units to increase the precision of the
estimates.
Assume that the stratum size Ni is fairly accurately known. Let

mi : number of sampling units from ith stratum, i = 1,2,...,k.

 m  n.
i1
i

Note that mi is a random variable (and that is why we are not using the symbol ni as earlier).

Assume n is large enough or the stratification is such that the probability that some mi  0 is negligibly

small. In case, mi  0 for some strata, two or more strata can be combined to make the sample size non-
zero before evaluating the final estimates.

A post stratified estimator of the population mean Y is

ypost 1 k

N i1
N y i i .

Now
k
1  
E( y ) E 
N E( y m , m ,..., m )
 1 k 
N i
1k i1 
post

 E 
 NY


N  i1 i i 

Y

Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur

Page 38
Var( y post )  E Var( m1 , m2 ,..., mk )  Var  E( m1 , m2 ,..., mk )
y post y post
2  1 1  2
 k


E  wi   
m  Si   Var(Y )
N
 i1  i i  
k
1   1 
 w  E
2 2

i     Si (Since Var(Y )  0).

N  mi   i 
i1  
1
To
find  1  , proceed as follows :
E
 
 mi  Ni
Consider the estimate of ratio based on ratio method of estimation as
n N

y  yj
Y Y j
Rˆ  j 1 R  j 1
 n
, N
.

x X
x
j 1
j X
j 1
j

We know that

E(Rˆ )  R
N  RS X  SXY
2

n . .
Nn X 2

1 if th unit belongs to ith stratum

Let  j
x
j

0 otherwise
and
y j 1for all j = 1,2,...,N.

Then R, Rˆ and S 2 reduces

x
to

y n
j

Rˆ j 1 n

 x n
j n
i

j 1

Y N
j

j 1 N

R X N
j
i
N
j 1

Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur

Page 39
1  N
 1 N2 1  N2
S 2  N 1  X 2  NX 2  N 1   N i 2  N 1   Ni
x  Ni N Ni
 j 1
j
    
1 N  1  Ni N 
Sxy   X jYj  NXY   Ni   0.
N 1 N 1  N 2 
 j 1 

Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur

Page 40
Using these values in E(Rˆ )  R, we have
n
E (Rˆ )  R  E
N  N (N  n)(N  Ni )
 .
  N nN 2 (N 1)
 ni  i i

Thus
N
1   N (N  n)(N  Ni ) 1
1E  
  N nN n2 N 2 (N 1) N
ni  i i i i
 N 1 
(N  n)N 1  .
 
n(N 1)N Nn n
i  i 
Replacing mi in place of ni , we obtain
 1   N
1E    (N  n)N 1 1 
m N  n(N 1)N N n n
 i i i  i 
Now substitute this in the expression of Var( ypost ) as
 1  1 2
Var( y ) k 2
S
w
posti  E  m  N  i
i1
  i i 
k
 N  n N N 1 
 w i S i  (N 1)n . N  1 nN  n 
2 2

i1  i  i 

N  n k w2S 2 1 1 1 1 
   
n(N 1) i i  w nw n
i1
 i i 
k N  n  1 
 2

n (N 1)
2

wi Si n 1 
i1 w
 i 

Nn k
 n2 (N 1) i1 (nw i1 w )S i
2

k Nn
 Nn (1 w )S 2.
 wS 
k
2


i i i i
n(N 1) i1 n2 (N i1
1)
Assuming N 1  N.

V(y Nn  n n N (1 w )S 2
n
 wS 
2
)

post
Nn i1
i
n2 i1
i i
N

Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur

Page 41
 Vprop ( Nn n

(1 w )S .
2
yst )  Nn2 i1
i

The second term is the contribution to the variance of

ypost due to mi ' s not being proportionately
distributed.

Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur

Page 42
If S 2  S 2 say for all i, then the last term in the expression is
, i w

Nn k Nn 2 k
Nn2  (1 w )S S (k 1)  w  1)
2

i1  i w Nn2 (Since

w
i1
i

 k 1  N  n w 2
  n  Nn S
  
k 1
 Var( y ).
st
n
n
The increase in the variance over Var (y is small if the average sample size n  per stratum is
)
prop st
2
reasonably large.

Thus a post-stratification with a large sample produces an estimator which is almost as precise as an
estimator in the stratified sampling with proportional allocation.

Sampling Theory| Chapter 4 | Stratified Sampling | Shalabh, IIT Kanpur

Page 43

Allocation of Sample Size
No ratings yet
Allocation of Sample Size
26 pages
Biostatistics for Health Professionals
No ratings yet
Biostatistics for Health Professionals
30 pages
Chi Square Statistics
No ratings yet
Chi Square Statistics
7 pages
Principles of Sampling
100% (1)
Principles of Sampling
14 pages
Measuring The Occurrence of Disease: Dr. Elijah Kakande MBCHB, MPH Department of Public Health
No ratings yet
Measuring The Occurrence of Disease: Dr. Elijah Kakande MBCHB, MPH Department of Public Health
25 pages
Sampling Techniques
No ratings yet
Sampling Techniques
6 pages
Data Management and Statistical Analysis Using SPSS
No ratings yet
Data Management and Statistical Analysis Using SPSS
4 pages
Sampling Techniques and Distributions
No ratings yet
Sampling Techniques and Distributions
125 pages
Estimation and Hypothesis Testing Guide
100% (2)
Estimation and Hypothesis Testing Guide
32 pages
Statistical Estimation Techniques Explained
No ratings yet
Statistical Estimation Techniques Explained
22 pages
Understanding Cluster Sampling Methods
No ratings yet
Understanding Cluster Sampling Methods
13 pages
One-Sample Hypothesis Testing Basics
No ratings yet
One-Sample Hypothesis Testing Basics
9 pages
Non-Parametric vs Parametric Tests
No ratings yet
Non-Parametric vs Parametric Tests
13 pages
Introduction to Statistics Overview
No ratings yet
Introduction to Statistics Overview
14 pages
ANOVA: Statistical Analysis Guide
No ratings yet
ANOVA: Statistical Analysis Guide
51 pages
Biostatistics Introduction and Variables
No ratings yet
Biostatistics Introduction and Variables
3 pages
Hypothesis Testing in Education
100% (2)
Hypothesis Testing in Education
47 pages
Statistics for Educators & Analysts
100% (1)
Statistics for Educators & Analysts
5 pages
Biostatistics for Medical Students
100% (1)
Biostatistics for Medical Students
19 pages
Chapter 3 Sampling Methods
No ratings yet
Chapter 3 Sampling Methods
46 pages
MST005 Solved
No ratings yet
MST005 Solved
41 pages
EpiData 3.1 Setup and Usage Guide
No ratings yet
EpiData 3.1 Setup and Usage Guide
36 pages
Understanding Basic Statistics Concepts
No ratings yet
Understanding Basic Statistics Concepts
58 pages
Sampling Methods and Data Presentation
No ratings yet
Sampling Methods and Data Presentation
19 pages
Week 2 Bayesian Statistics Quiz
33% (3)
Week 2 Bayesian Statistics Quiz
3 pages
Data Arrangement and Presentation Methods
No ratings yet
Data Arrangement and Presentation Methods
55 pages
Biostatistics for Nursing Students
100% (1)
Biostatistics for Nursing Students
40 pages
MCQs - Variables and Data Types
100% (1)
MCQs - Variables and Data Types
2 pages
Understanding Logistic Regression Basics
100% (1)
Understanding Logistic Regression Basics
37 pages
Biostatistics Lecture Notes Overview
50% (4)
Biostatistics Lecture Notes Overview
36 pages
12-Module 4 - Epidemiology Study Designs and Measures - Introduction and Motivation-04!09!2023
No ratings yet
12-Module 4 - Epidemiology Study Designs and Measures - Introduction and Motivation-04!09!2023
6 pages
Statistical Inference: Estimation & Hypothesis Testing
No ratings yet
Statistical Inference: Estimation & Hypothesis Testing
14 pages
Advanced Statistical Distributions
No ratings yet
Advanced Statistical Distributions
13 pages
Statistical Methods for Environmental Science
No ratings yet
Statistical Methods for Environmental Science
15 pages
Z Test and T Test
No ratings yet
Z Test and T Test
15 pages
Understanding Sampling Distributions in Statistics
No ratings yet
Understanding Sampling Distributions in Statistics
24 pages
Bayes' Law and Probability Concepts
No ratings yet
Bayes' Law and Probability Concepts
7 pages
L 7estimating Risk
100% (1)
L 7estimating Risk
63 pages
My Notes This Is Note For 2da3
100% (1)
My Notes This Is Note For 2da3
83 pages
Session Six Evidence Evaluation & Causality
No ratings yet
Session Six Evidence Evaluation & Causality
73 pages
Introduction To Probability and Statistics (IPS) : Endterm
No ratings yet
Introduction To Probability and Statistics (IPS) : Endterm
16 pages
Correlation of Fat Intake and Cholesterol
No ratings yet
Correlation of Fat Intake and Cholesterol
3 pages
Assignment5 - Regression Analysis - Solutions
No ratings yet
Assignment5 - Regression Analysis - Solutions
5 pages
Understanding Sample Surveys and Statistics
No ratings yet
Understanding Sample Surveys and Statistics
199 pages
Understanding Biological Variation and Statistics
No ratings yet
Understanding Biological Variation and Statistics
9 pages
Understanding Selection and Information Bias
No ratings yet
Understanding Selection and Information Bias
62 pages
Confidence Interval Estimation
No ratings yet
Confidence Interval Estimation
62 pages
Ss Notes
No ratings yet
Ss Notes
34 pages
Statistical Significance & Association
No ratings yet
Statistical Significance & Association
21 pages
Programmed Statistics by B L-Agarwal-Part1 PDF
No ratings yet
Programmed Statistics by B L-Agarwal-Part1 PDF
311 pages
New ND Manual PDF
No ratings yet
New ND Manual PDF
36 pages
SPSS and MINITAB Guide for Statistics
No ratings yet
SPSS and MINITAB Guide for Statistics
110 pages
Probit and Logit Analysis in Statistics
No ratings yet
Probit and Logit Analysis in Statistics
35 pages
Analyzing Grouped Data Statistics
No ratings yet
Analyzing Grouped Data Statistics
51 pages
Statistical Tests, P Values, Confidence Intervals, and Power, A Guide To Misinterpretations.
No ratings yet
Statistical Tests, P Values, Confidence Intervals, and Power, A Guide To Misinterpretations.
15 pages
Epid 600 Class 4 Measures of Association
No ratings yet
Epid 600 Class 4 Measures of Association
59 pages
Standard Deviation in Tree Height Study
100% (2)
Standard Deviation in Tree Height Study
2 pages
Chapter4 Sampling Stratified Sampling 1
No ratings yet
Chapter4 Sampling Stratified Sampling 1
27 pages
Stratified Sampling Explained
No ratings yet
Stratified Sampling Explained
26 pages
Understanding Stratified Random Sampling
No ratings yet
Understanding Stratified Random Sampling
16 pages
Whistleblowers Fight Over Award - Complaint
No ratings yet
Whistleblowers Fight Over Award - Complaint
15 pages
LFJ Format
No ratings yet
LFJ Format
3 pages
Distribution Transformer: Bureau of Energy Efficiency
No ratings yet
Distribution Transformer: Bureau of Energy Efficiency
51 pages
Divergence, Gauss Law in Differential Form
No ratings yet
Divergence, Gauss Law in Differential Form
22 pages
Chemical Bonding Seminar Notes
No ratings yet
Chemical Bonding Seminar Notes
2 pages
Pert and CPM
No ratings yet
Pert and CPM
137 pages
Ni Hms 418619
No ratings yet
Ni Hms 418619
15 pages
Arnis Muscle Endurance, Resistance Exercises
No ratings yet
Arnis Muscle Endurance, Resistance Exercises
14 pages
Transfer CFT UsersGuide AllOS en
No ratings yet
Transfer CFT UsersGuide AllOS en
2,506 pages
"All About That Bass" Lyrics Activity
No ratings yet
"All About That Bass" Lyrics Activity
2 pages
IIC Event List
No ratings yet
IIC Event List
6 pages
722.4 Transmission Catalogue: 722.400, 722-400, 722400, MERC124
No ratings yet
722.4 Transmission Catalogue: 722.400, 722-400, 722400, MERC124
3 pages
Understanding Volatile Oils and Their Uses
No ratings yet
Understanding Volatile Oils and Their Uses
60 pages
Vocab - Unit 3 - Data Communication
No ratings yet
Vocab - Unit 3 - Data Communication
6 pages
Anatomy Exam Questions for MBBCh Students
No ratings yet
Anatomy Exam Questions for MBBCh Students
2 pages
Cornelissen Et Al., 2016 - Flame Curtain Kiln Emissions
No ratings yet
Cornelissen Et Al., 2016 - Flame Curtain Kiln Emissions
17 pages
Is RG Jah Ss 2622023
No ratings yet
Is RG Jah Ss 2622023
9 pages
Passing Off in Intellectual Property Law
No ratings yet
Passing Off in Intellectual Property Law
31 pages
NorthRidge Church May 2012 Newsletter
No ratings yet
NorthRidge Church May 2012 Newsletter
15 pages
DSC 6 Notes
No ratings yet
DSC 6 Notes
14 pages
Chapter Vii
No ratings yet
Chapter Vii
2 pages
Design Guide for Unsupported Steel Beams
No ratings yet
Design Guide for Unsupported Steel Beams
57 pages
Nandi 2009
No ratings yet
Nandi 2009
8 pages
BZWBK24 Mobile Color Status Bar Update
100% (1)
BZWBK24 Mobile Color Status Bar Update
5 pages
Financial Literacy Advocacy Module
No ratings yet
Financial Literacy Advocacy Module
12 pages
Paper19 Set1 Sol
No ratings yet
Paper19 Set1 Sol
14 pages
Introduction to Yoga and Patanjali's Teachings
No ratings yet
Introduction to Yoga and Patanjali's Teachings
25 pages
Three: Georges Perec
No ratings yet
Three: Georges Perec
196 pages
1 Pgcil DT Ee 8 Feb 2024 Sol Part 1 PDF
No ratings yet
1 Pgcil DT Ee 8 Feb 2024 Sol Part 1 PDF
62 pages
Gas Compressor Efficiency Analysis
No ratings yet
Gas Compressor Efficiency Analysis
22 pages