Journal of The American Statistical Association

This article was downloaded by: [Northwestern University]
On: 20 December 2014, At: 22:07

Publisher: Taylor & Francis
Informa Ltd Registered in England and Wales Registered Number: 1072954
Registered office: Mortimer House, 37-41 Mortimer Street, London W1T 3JH,
UK
Journal of the American

Statistical Association
Publication details, including instructions for
authors and subscription information:
http://www.tandfonline.com/loi/uasa20
Minimum Variance
Stratification
a b
Tore Dalenius & Joseph L. Hodges Jr
a
University of Stockholm
b
University of California , Berkeley
Published online: 11 Apr 2012.
To cite this article: Tore Dalenius & Joseph L. Hodges Jr (1959) Minimum Variance
Stratification, Journal of the American Statistical Association, 54:285, 88-101
To link to this article: http://dx.doi.org/10.1080/01621459.1959.10501501
PLEASE SCROLL DOWN FOR ARTICLE
Taylor & Francis makes every effort to ensure the accuracy of all the
information (the “Content”) contained in the publications on our platform.
However, Taylor & Francis, our agents, and our licensors make no
representations or warranties whatsoever as to the accuracy, completeness,
or suitability for any purpose of the Content. Any opinions and views
expressed in this publication are the opinions and views of the authors, and
are not the views of or endorsed by Taylor & Francis. The accuracy of the
Content should not be relied upon and should be independently verified with
primary sources of information. Taylor and Francis shall not be liable for any
losses, actions, claims, proceedings, demands, costs, expenses, damages,
and other liabilities whatsoever or howsoever caused arising directly or
indirectly in connection with, in relation to or arising out of the use of the
Content.
This article may be used for research, teaching, and private study purposes.
Any substantial or systematic reproduction, redistribution, reselling, loan,
sub-licensing, systematic supply, or distribution in any form to anyone is
expressly forbidden. Terms & Conditions of access and use can be found at
http://www.tandfonline.com/page/terms-and-conditions
Downloaded by [Northwestern University] at 22:07 20 December 2014
MINIMUM VARIANCE STRATIFICATION
TORE DALENIUS
University of Stockholm
AND
JOSEPHL. HODGES, JR.
University of California (Berkeley)
1. Introduction..... . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88
2. An approximation to the exact solution... . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90
3. Application I: f(x) =e-S. . . . . . . . . .. . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . 94
4. Application II:f(x)=xe-z • . . . . . . . . . • . . . • . . . . . . . . . . . . . . . . . . . . . . . . . • • . . . . • 97
5. Application III:f(x)=2 p(x)............................................ 97

6. Application IV: f(x) =2(1-x). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98
7. Comparisons of different computing techniques... . . . . . . . . . . . . . . . . . . . . . . . . . 98
8. Comparisons of different rules for stratification. . . . . . . . . . . . . . . . . . . . . . . . . . . . 99
9. Acknowledgments. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. 101
References...... . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. 101
When estimating the mean value of a quantity z, in a population to

be divided into L strata according to the value of a quantity closely
correlated with e, it is necessary to choose the L -1 points of stratifica-
tion. Nearly optimum points are obtained if they are chosen to equalize
the integrals over the various strata of the square root of the population
density. A simple method for the iterative improvement of the points is
given and illustrated on several examples.
1. INTRODUCTION
1.1 Four specific design operations.

The use of stratified sampling involves four specific design operations:
(a) the choice of a stratification variable;
(b) the choice of the number L of strata;
(c) the determination of the way in which the population is to be stratified;
and
(d) the choice of the size nit, of the sample to be taken from the hth stratum.
In the present paper, we will discuss problem (c). The emphasis is on theory
only; in par. 1.2.1 we indicate sume aspects of great practical importance but
not dealt with here.
1.2 Review of different rules for stratification.

1.2.1 An exact solution. Dalenius [2] used an approach which provided an
exact solution to the problem as treated. This approach will be reviewed here,
since it is used in paragraph 2, of the present paper.
Dalenius considers a density f(x) with mean
(1)
The range Xo, XL of the estimation variable x is cut up into L parts at points
MINIMUM VARIANCE STRATIFICATION 89
Xl < ... <Xh < ... <XL-l. Each such part corresponds to a stratum; i.e.
the estimation variable X is used as stratification variable as well.' For the hth
stratum
Xh
Wh =
f Xh-l
f(t)dt (2)
Xh
Wh,uh = f Xh-l
tf(t)dt (3)
and
"h
f t 2f(t)dt
l) Xh-l
(4)
Uh" = - - W - - - ,uh 2
h
Obviously
(5)
A sample of n= Lh nh observations is selected fromf(x) and u is estimated by

(6)
This estimate has variance
(7)
It is well-known that the variance is minimum when using the Tschuprow-

Neyman allocation, i.e. when nh C( Wh/Th. Then the variance equals
U
2
min(i ) = ~ ( ~ WhUhY (8)
This variance is a function of the points Xh of stratification. Dalenius [2]

demonstrated that the set [Xh] of cutting points satisfying the relation
Uh 2 + (Xh - J.!.h)2 U2h+l + (Xh - ,uh+l)2
(9)
corresponds to minimum variance stratification, MVS.

We wish to stress the fact that the approach discussed here is effective in
the senso of mathematical theory. In practical sample design work, one does not
have a nice density f(x) with which to work, but is restricted rather to some
measure of ,size from the past. Moreover, in practice, such considerations as
costs and the breadth of the optimum arise.
An interesting application of Eq. (9) is discussed by Strecker [9, Sec.
223.122].
1 Dalenlus and Gurney [4) discuss tae case with a specificstratdfication variable.
90 AMERICAN STATISTICAL ASSOCIATION JOURNAL, MARCH 1959
The solution given by Eq. (9) presents computational difficulties when ap-
plied in a general case. For the special case where f(x) is the normal density
ip(x), Zindler [10] gives a useful computational procedure. In par. 2 of this
paper, a simple, generally applicable method of approximating the exact set
[x,,] is presented.
1.2.2 A conjecture. Dalenius and Gurney [4] conjectured that
TV"u" = constant (10)
would serve as an approximation to Eq. (9), when Lis "large.">
1.2.3 A rule of thumb. The observation that for many populations, and for
reasonable locations of the stratum boundaries, the relative variance does not
vary much from stratum to stratum, leads from Eq. (10) to
(l1a)
where JL'" refers to a measure variable which is assumed to be highly correlated
with the estimation variable, as discussed by Hansen et al. [6, p. 219]. For the
special case when these variables are identical, the rule is
TV"JL" = constant (l1b)
Mahalanobis [8, p. 4, footnote] proposes a similar rule: " ... an optimum
or nearly optimum solution would be obtained when the expected contribution
of each stratum to the total aggregate value of x is made equal for all strata"
(p. 4, footnote).
Kitagawa [7, pp. 27-30] analyzes this "principle of equipartition" in order to
justify it and presents some related results.
1.2.4 Equidistant stratification. The normal kind of first step to try when
setting up strata is often to make
X"+l - x" = constant (12)
Aoyama [1] derived this result by applying the mean value theorem to
Eq. (9), and assuming that the variation of the density is small in each stratum.
Dalenius and Hodges [5] analyze this rule in some detail.
2. AN APPROXIMATION TO THE EXACT SOLUTION
2.1 The approach of this paper

This paper will make use of the same approach as the one used by Dalenius
[2]; the crucial point is that the population is represented by a density f(x).
As a consequence, the results arrived at are of the same theoretical nature as
those discussed in par. 1.
2.2 First approximation
We introduce the transformation
(13)
• In practical sample design, it is important to realize that the big gain from the use of one-stage stratified
sampling is obtained in going from L = 1 to L =2. This point is illustrated by Dalenius [3, Ch, 8].
When u-+ 00 , y(U) approaches an upper bound H. The roots Xl' ••• Xh' ••• X'L-1
of the following equations
h
y(u) = -H, h = 1 ... L - 1 (14)
L
are taken as the (first) approximations, for large L, to the points Xl ••• Xh •••
XL-1 satisfying Eq. (9). ..
2.3 Justification
This approximation may be derived by the following heuristic argument.
When L is large, the strata will be narrow, and each will have an approximately
rectangular distribution, so that v'12 rTh == Xh - Xh-1. Then, by the mean value
theorem there exists a value fh of f in the hth stratum, such that
The last sum is minimized by making Yh - Yh-l = constant. A rigorous proof has
been given by Dalenius and Hodges [5].
2.4 Adaption to numerical calculations

We assume that the density lex) is stratified into L strata. Two consecutive
strata are specified by Xh-l, Xh and Xh+l. In order to simplify the formulas, we
will denote these points of stratification by Xh_1 = Xg, Xh = Xh and Xh+1 = Xi.
The interval Xg, Xh corresponds to the hth stratum, and Xh, Xi to the ith
stratum.
We define
Ip(u) = f-'"
U tPj(t)dt (16)
2
The conditional means !th, !ti and variances rTh2, rTi of the two strata are easily
expressed in terms of Ip(u)
f xh
xg
tj(t)dt
II (Xh) - I 1(x g )
!th = - - - - - (17)
Io(Xh) - 1 0(x o)
J:hj(t)dt
(18)
Moreover, we define
(19)
Thus
J 1h
fJ.h=- (20)
i:
J 2h J 1h2 J ohJ2h - J 1h2
rTh2 = - - - = ----- (21)
i: J oh2 J oh2
The condition given by Eq. (9) may now be expressed in terms of J ph and
J pi as follows
J 2h - 2XhJ1h + Xh 2Joh (22)

ylJohJ2h - Jv. 2
For simplicity, this expression will be written

A h - B h = !1h = 0 (23)
The set [Xh] satisfying Eq. (23) corresponds to MVS. If we substitute any
other values, say Xh', we will denote the left and right side of Eq. (22) by A h'
and Bh'respectively and the difference by !1h'.
2.5 Second approximation

In general, we may not expect the set [Xh'] derived from Eq. (14) to satisfy
Eq. (23). Thus, there is need for some method for adjusting the initial set
[Xh'] into a set [Xh"] which then can be checked in Eq. (23) etc. We will present
such a method here.
Consider a rectangular distribution f(x) = 1. With no loss of generality, we
may assume O:S::;x:S::;b. For this distribution we have:
(24)
Substituting these values in
A
J 2 - 2bJ1 + b2Jo (25)
.JJoJ2 - J 12
which is A h for h=l, we get
2
A = -= b "-' 1.15b (26)
yl3
i.e. A changes at about 1.15 times the rate at which the interval length (0, b)
changes.
As the next step, consider that this rectangular distribution is divided into
£=2 strata, at a point Xl', 0<X1' <b, arbitrarily chosen. This point Xl' specifies
A l ' and Bl. Applying the result quoted above, we realize that changing Xl' by
one unit will change A l ' and B l ' by approximately 1.15 units each and the
difference AI' by approximately 2.3 units. It seems reasonable to determine a
second point Xl" of stratification by setting !:ll" equal to zero, where
4
!:ll" = !:l/ + (xI" - Xl') ----= (27)
v'3
The solution of !:ll"=0 is given by
Xl" = Xl, - --v!3 A ,

'-'I (28)
4
The point Xl" may reasonably be expected to be superior to Xl' as an ap-
proximation to the MVS point Xl.
We will now generalize this to any number L of strata. For three consecutive
strata, with indices g, hand i, we have (in the rectangular case)
aA,,' aB,,' 2
--=-- =
axa' ax,,' -v!3
(29)
aA,,' aB,,' 2
-- = -- = -----:d
ax/ -v!3
while all expressions of the type aAh'/ ax;', aBh ' / axa' etc. are equal to zero.
From !:l,,' =A h' -Bh' we derive analogous expressions for a!:lh/aXh' etc.
These values will now be used in the following way. We have a set [Xh']
with corresponding !:lh'-values. We want to find a set [Xh"] with !:lh"l < !:lh'l. I I
By the mean value theorem we have
h = 1, ... , g, h, i ... L - 1, (30)
where we put xo"=xo' and X/'=XL'. Solving this system for x,," gives the set
wanted.
The matrix M of a!:l,,'/ ax;' is under the approximated rectangular distribu-
tion
2 -1 0 0 0 0
-1 2 -1 0 0 0
2
M =--= 0 -1 2 -1 0 0 (31)
v'3
0 0 0 ... -1 2
the matrix being a square matrix with L -1 rows and columns.

In the first and last rows we assumed the rectangular distribution with finite
range which is limited at one end by the point of very large absolute value. The
more the number of L, the better the approximation by the rectangular dis-
tribution. The inverse of M is
94 AMERICAN STA'l'ISTICAL ASSOCIATION JOURNAL, MARCH 1959
L-1 L-2 2 1
L-2 2(L - 2) 4 2
M-1 = - -
va 6 3
(32)
2·L 3 6
2 4 2(L - 2) L-2
1 2 (L - 2) L-1
Thus, the new set [x,,"] is found by computing
Xl" = xt' - 2L
va [(L - l),M + (L - 2).62' + ... +.6' L-d
- v3 [(L - 2).6t' + 2(L - 2).62' + ... + 2.6' L-d

2 L (33)
x" L-l = X'L-l - v3 [.61' + 2.62' + ... + (L - 1).6' L-d

2 L
If necessary, the procedure may be repeated to give a third set [x,,"'] etc.:
As the final step in our adjustment procedure, we now pretend that the pro-
cedure discussed above for the rectangular case, holds reasonably well also for
other cases.
In the following paragraphs, we will apply this procedure to some densities.
3. APPLICATION I: f(x) =e-'"

3.1 Derivation of y(u) and Ip(u)
From the definition of y(u), we get
y(u) = f U ve-tdt = 2(1 - e- u / 2) (34)
with y(O) =0, iJ{oo) =2.

Moreover,
(35)
Evaluating the integral gives
Io(u)
Il(u)
= 1 - e-
= 1 - e-
U
uer"
U
-
! (36)
12 (u ) = 2 - ru (u 2 + 2u + 2)
3.2 Numerical calculations
3.2.1 L=2 strata. The first approximation Xl' is given by the root of the
following equation
f . v!(t)dt = ~ [f 00 V!(t)dt] (37)
or in this case
1
2(1 - e- u / 2 ) = - 2 = 1 (38)
2
giving Xl' = 1.39 correct to two decimal places.

Using the Ip(u) functions, we now compute" the following Table 95.
TABLE 95
DETAILS OF NUMERICAL CALCULATIONS OF FIRST APPROXIMATION

TO THE OPTIMUM POINT OF STRATIFICATION OF f(x) =e-" FOR L=2
XO'=o x/=1.39 X2' = 00
Io(xh') = 0.0000 0.7509 1.0000

II(Xh') = 0.0000 0.4047 1.0000
1 2(xh' ) = 0.0000 0.3280 2.0000
J OI ' 0.7509
J 02' 0.2491
J II ' 0.4047
J 12' 0.5953
J 21' 0.3280
J.z' 1.6720
We compute
A 1' = 2.2761
B/ = 2.0000
11/ = 0.2761
As 111' is considerably larger than zero, we have to adjust xl. We thus com-
pute
Xl
" = Xl -
, vS
-"'1
A ,
(39)
4
giving Xl" = 1.27.
We now repeat the computations contained in Table 95. From the resulting
I p (x1" )- and J"-functions, we compute
A/' = 2.0162
B 1" = 2.0007
/11" = 0.0155
• The computations have been carried out at the Institute of Statistics, University of Stockholm.
Thus, Ill" is much closer to 0 than is Ill'. Therefore, Xl" should be closer to
the best point Xl than is Xl'. In fact, as shown in par. 3.3, the variance corre-
sponding to Xl" is less than the variance corresponding to Xl'.
3.2.2 L=3 strata. The first approximations Xl' and X2' are given by the
roots of the following equations (cf par. 3.2.1.)
1
2(1 - e- u ) =- 2
3
(40)
2
2(1 - e- u ) = - 2
3
giving xl' =0.81 and xl =2.20. Using these values, we carry through the com-
putations necessary to obtain x-", X2", and the variances for these values. The
results are shown in Table 96.
3.2.3 L>3 strata. Similar calculations have been carried out for L=4 and
L = 5 strata. As they are entirely analogous to those reported above, no details
will be given. The results are summarized in Table 96.
3.3 Summary table of numerical results

In the following Table 96 the numerical results are summarized.
TABLE 96
FIRST AND SECOND APPROXIMATION TO OPTIMUM POINTS OF
STRATIFICATION OF !(x)=e-tll FOR £=2· . ·5 STRATA AND
THE ASSOCIATED VARIANCES
Points of stratification
Number
of First appro = Xl' X2' Xa' x.' Variances
strata
Second appro = Xl" X2" :Ca" X,"
First 1.39 - - - 0.2877

2
Second 1.27 - - - 0.2855
First 0.81 2.20 - - 0.1345

3
Second 0.73 2.04 - - 0.1339
First 0.58 1.39 2.77 - 0.0777

4
Second 0.52 1.27 2.61 - 0.0774
First 0,45 1.02 1.83 3.22 0.0503

5
Second 0.39 0.92 1.68 3.02 0.0503
4. APPLICATION II: f(x) =xe-'"
4.1 The lp(u) functions
For this density, we get
lo(u) = 1 - e:» - ue:» )
+
ll(u) = 2 - e- U [tt2 2u + 2] (41)
12(u) = 6 - e- u [u 3 +
3u 2 + 6u + 6]
In the following Table 97, the numerical results are summarized.
TABLE 97
STRATIFICATION OF f(x) =X e-': FOR L=2 .. ·5 STRATA
AND THE ASSOCIATED VARIANCES
Number
of First appro = Xl' X2' xa' xl Variances
strata
Second appro = Xl" X2" X3" x,/1
First 2.37 - - - 0.6376

2
Second 2.30 - - - 0.6389
First 1.56 3.41 - - 0.3111

3
Second 1.54 3.26 - - 0.3069
First 1.21 2.37 4.12 - 0.1822

4
Second 1.20 2.27 3.94 - 0.1817
First 1.00 1.88 2.96 4.62 0.1195

5
Second 1.01 1.82 2.86 4.49 0.1192
5. APPLICATION m:f(x) =2~(x), O::;x

5.1 The lp(u) functions
For the right half of the normal density, we get
Io(u) = ¢(u) )
ll(u) = - feu) (42)
12(u) = Io(u) + uI1(u)

TABLE 98
STRATIFICATION OF f(x) =2",(x), 0 :S:x, FOR L =2' .. 5
STRATA AND THE ASSOCIATED VARIANCES
Number
of First appro = x,' X2' xa' x.' Variances
strata
Second appro = XI" $2" Xa" X/'
First 0.95 - - - 0.02752

2
Second 0.89 - - - 0.02729
First 0.61 1.37 - - 0.01301

3
Second 0.57 1.29 - - 0.01286
First 0.45 0.95 1.63 - 0.00749

4
Second 0.42 0.89 1.54 - 0.00751
First 0.36 0.74 1.19 1.81 0.00489

5
Second 0.34
I 0.70 1.12 1. 72 0.00482
6. APPLICATION IV: f(x) =2(1-x)

6.1 The lp(u)-functions
The lp(u)-functions are
lo(u) = 2u - u 2
2
ll(U) = u2 - -u3
3 (43)
2 1
l2(U) =-u 3 - --u 4
3 2
7. COMPARISONS OF DIFFERENT COMPUTING TECHNIQUES
For the research reported by Dalenius [3, Ch. 7], a very sizable amount of
computation was performed in order to find approximations to MVS. These
computations, performed by a professional computer of outstanding capacity,
were of a "trial-and-error" nature.
l\!INIMUM VARIANCE STRATIFICATION 99
TABLE 99
STRATIFICATION OF f(x) =2(1-x) FOR L=2 . . . 5 STRATA
AND THE ASSOCIATED VARIANCES
Number
of First appro = Xl' X2' Xa' x/ Variances
strata
Second appro = Xl" X2" Xa" X4"
First 0.37 - - - 0.0152

2
- - -
Second 0.35 0.0152
First 0.24 0.52 - - 0.0069

3
Second 0.23 0.50 - - 0.0069
First 0.17 0.37 0.60 - 0.0044
4 Second 0.18 0.37 0.62 - 0.0039
First 0.14 0.29 0.46 0.66 0.0029

5
Second 0.12 0.25 0.40 0.64 0.0023
By comparison, it is found that the new technique demands only a fraction

of the time used with the old technique, to produce sets [Xh'] having variances
substantially equal to the variances computed earlier.
8. COMPARISONS OF DIFFERENT RULES FOR STRATIFICATION
8.1 Comparison with the rule of thumb given in par. 1.2.3
The rule of thumb given in par. 1.2.3 amounts to finding a set [Xh'] such that
W h,uh constant C for h = 1 ... L. This condition is equivalent with
2=
Il(x'L) - Il(x'L_l) = C
Il(x'L-l) - I l(x'L_2) = c
(44)
Il(x'l) - Il(x' 0) = C
where Il(Xh') = Il(u) for U=Xh'. Adding the L equations we easily get
1 1
C = -Il(x'L) = -·H (45)
L L
as the value of C. From this, we derive the following result
Ilex'L-l) = L ~ 1. H)
(46)
L-2
I l(x'L_2) = ---·H
L
etc., from which it is easy to compute the set [Xh'] satisfying this rule of thum b
Using this set [Xh'], the variances may be computed and compared with
those arrived at by the approximate rule presented in this paper.
Such comparisons have been made for f(x) =e-'" for L=2, 3, 4 and 5 strata.
Thus, for e.g. L=5, we get the set [Xh'] by solving the equations
1 - e-'" - xe-'" = 0,20]

1 - e-'" - xe-'" = 0.40
(47)
1 - e-'" - xe-'" = 0.60
1 - e-'" - xe-'" = 0.80
The solution Xl' • • • X4' to these equations is given below together with the
corresponding first approximations derived by means of the technique pre-
sented in this paper.
TABLE 100a
COMPARISON OF POINTS OF STRATIFICATION OF f(x) =e-'" FOR L=2· .. 5
STRATA AS COMPUTED BY THE RULES GIVEN IN PAR. 1.2.3 AND PAR. 2.2
According to rule given in

Par. 1.2.3 Par. 2.2
0.00 0.00
0.82 0.45
1.38 1.02
2.02 1.83
2.99 3.22
00
The corresponding variances are given in the following table.
TABLE 100b
COMPARISON OF VARIANCES IN STRATIFIED SAMPLING FROM f(x) =e-':
FOR L=2··· 5 STRATA AS COMPUTED BY THE
RULES GIVEN IN PAR. 1.2.3 AND PAR. 2.2
Variances when using rule given in

L
Par. 1.2.3 Par. 2.2
2 0.3079 0.2877
3 0.1556 0.1345
4 0.0950 0.0777
5 0.0639 0.0503
8.2 Comparisons with equidistant stratification
Equidistant stratification is applicable only to a density with a finite range.
In order to throw some light on its efficiency, it has been applied to f(x)
=2(I-x); the resulting variances are compared in Table 101 with those result-
ing from using the first approximations derived by means of the technique given
in par. 2.2..
TABLE 101
COMPARISON OF VARIANCES IN STRATIFIED SAMPLING FROM
1(:1;) =2(1-:1;) FOR £=2· .. 5 STRATA AS COMPUTED BY
THE RULES GIVEN IN PAR. 1.2.4 AND PAR. 2.2
Variances when using rule given in

t.
Par. 1.2.4 Par. 2.2
2 0.0181 0.0152
3 0.0091 0.0069
4 0.0046 0.0044
5 0.0038 0.0029
9. ACKNOWLEDGMENTS
The authors are indebted to the referees for valuable comments on a previous
version of this paper.
REFERENCES
[1] Aoyama, H., "A study of the stratified random sampling." Annals 01 Mathematical
Statistice, Vol. VI, No.1 (1954) pp. 1-36.
[2] Dalenius, T., "The problem of optimum stratification." Skandinavisk Aktuarietid-
skrift, 3-4 (1950) pp, 203-13.
[3] Dalenius, T., Sampling in Sweden. Contributions to the Methods and Theories of Sample
Survey Practice, Stockholm: Almqvist och Wiksell, 1957.
[4] Dalenius, T. and Gurney, M., "The problem of optimum stratification II." Skandi-
navisk Aktuarietidskrift, 3-4 (1951) pp. 133-48.
[5] Dalenius, T. and Hodges, J. L., Jr., "The choice of stratification points." Skandinavisk
Aktuarietidskrift, 3-4 (1957) pp. 198-203.
[6] Hansen, M. H., Hurwitz, W. G. and Madow, W. G., Sample Survey Methods and
Theory, Vol. 1. New York: John Wiley & Sons, Inc. 1953.
[7] Kitagawa, T., "Some contributions to the design of sample survey." Sankhyi'i, Vol. 17,
Part 1 (1956) pp. 1-36.
[8] Mahalanobis, P. C., "Some aspects of the design of sample surveys." Sankhyi'i, Vol.
12, Parts 1 and 2 (1952) pp. 1-7.
[9] Strecker, H., Moderne Methoden in der Agrarstatistik. Wiirzburg: Physica-Verlag,
1957.
[10] Zindler, H.-J., "fiber Faustregeln zur optimalen Schichtung bei Normalverteilung."
Allgemeines Statistisches Archiv, 40. Band (1956) pp. 168-73.

Journal of The American Statistical Association

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Journal of The American Statistical Association

Uploaded by

Copyright:

Available Formats

This article was downloaded by: [Northwestern University]

On: 20 December 2014, At: 22:07

Journal of the American

To link to this article: http://dx.doi.org/10.1080/01621459.1959.10501501

PLEASE SCROLL DOWN FOR ARTICLE

5. Application III:f(x)=2 p(x)............................................ 97

When estimating the mean value of a quantity z, in a population to

1.1 Four specific design operations.

1.2 Review of different rules for stratification.

A sample of n= Lh nh observations is selected fromf(x) and u is estimated by

This estimate has variance

It is well-known that the variance is minimum when using the Tschuprow-

This variance is a function of the points Xh of stratification. Dalenius [2]

corresponds to minimum variance stratification, MVS.

2.1 The approach of this paper

2.4 Adaption to numerical calculations

J 2h - 2XhJ1h + Xh 2Joh (22)

For simplicity, this expression will be written

2.5 Second approximation

Substituting these values in

Xl" = Xl, - --v!3 A ,

h = 1, ... , g, h, i ... L - 1, (30)

the matrix being a square matrix with L -1 rows and columns.

- v3 [(L - 2).6t' + 2(L - 2).62' + ... + 2.6' L-d

x" L-l = X'L-l - v3 [.61' + 2.62' + ... + (L - 1).6' L-d

3. APPLICATION I: f(x) =e-'"

y(u) = f U ve-tdt = 2(1 - e- u / 2) (34)

with y(O) =0, iJ{oo) =2.

Evaluating the integral gives

f . v!(t)dt = ~ [f 00 V!(t)dt] (37)

giving Xl' = 1.39 correct to two decimal places.

DETAILS OF NUMERICAL CALCULATIONS OF FIRST APPROXIMATION

XO'=o x/=1.39 X2' = 00

Io(xh') = 0.0000 0.7509 1.0000

3.3 Summary table of numerical results

First 1.39 - - - 0.2877

First 0.81 2.20 - - 0.1345

First 0.58 1.39 2.77 - 0.0777

First 0,45 1.02 1.83 3.22 0.0503

First 2.37 - - - 0.6376

First 1.56 3.41 - - 0.3111

First 1.21 2.37 4.12 - 0.1822

First 1.00 1.88 2.96 4.62 0.1195

5. APPLICATION m:f(x) =2~(x), O::;x

5.2 Summary table of numerical results

First 0.95 - - - 0.02752

First 0.61 1.37 - - 0.01301

First 0.45 0.95 1.63 - 0.00749

First 0.36 0.74 1.19 1.81 0.00489

6. APPLICATION IV: f(x) =2(1-x)

7. COMPARISONS OF DIFFERENT COMPUTING TECHNIQUES

First 0.37 - - - 0.0152

Second 0.35 0.0152

First 0.24 0.52 - - 0.0069

First 0.17 0.37 0.60 - 0.0044

4 Second 0.18 0.37 0.62 - 0.0039

First 0.14 0.29 0.46 0.66 0.0029

By comparison, it is found that the new technique demands only a fraction

1 - e-'" - xe-'" = 0,20]

According to rule given in

The corresponding variances are given in the following table.

Variances when using rule given in