Professional Documents
Culture Documents
Abstract
In this paper we focus on investigating the precision of several variance
estimators considered by different authors, such as those suggested by
Isaki (Variance estimation using auxiliary information, J. Amer. Stat.
Assoc. 78, 117–123, 1983), and Kadilar and Cingi (Ratio estimators
for population variance in simple and stratified sampling, Appl. Math.
Comp. 173, 1047–1058 and Improvement in variance estimation using
auxiliary information, Hacet. J. Math. Stat. 35 (1), 111–115, 2006).
We propose a new hybrid class of estimators and show that in some
cases their efficiency is better than the aforementioned traditional ratio
and regression estimators of Isaki, and Kadilar and Cingi. Four nu-
merical examples are considered to further evaluate the performance of
these estimators.
1. Introduction
Consider a finite population U = {U1 , U2 , . . . , Ui , . . . , UN } consisting of N units. Let
y and x be the study and the auxiliary variables with population means Y and X re-
spectively. Let there be a sample of size n drawn from thisPpopulation using simple
random sampling without replacement (SRSWOR). Let s2y = n (y − ȳ)2 /(n − 1) and
Pn PN i
i=1
2 2 2
sx = i=1 (xi − x̄) /(n − 1) be the sample variances and Sy = i=1 (yi − Y )2 /(N − 1)
P
and Sx2 = N 2
i=1 (xi − X) /(N − 1) the population variances of y and x respectively. Let
Cy = Sy /Y , and Cx = Sx /X be the coefficients of variation of y and x respectively, and
∗
Department of Mathematics and Statistics, University of North Carolina at Greensboro,
Greensboro, NC 27412, USA. E-mail sngupta@uncg.edu
†
Department of Statistics, Quaid-i-Azam University, Islamabad 45320, Pakistan.
E-mail jsqau@yahoo.com
58 S. Gupta, J. Shabbir
ρyx the coefficient of correlation between y and x. We assume that all parameters of x
are known.
Let δ0 = (s2y − Sy2 )/Sy2 and δ1 = (s2x − Sx2 )/Sx2 be such that E(δi ) = 0, (i = 0, 1). Also,
to the first order of approximation, we have E(δ02 ) = γ(β2(y) − 1), E(δ12 ) = γ(β2(x) − 1)
r/2 s/2 P
and E(δ0 δ1 ) = γ(λ22 − 1), where γ = n1 − N1 , λrs = µrs /(µ20 µ02 ), µrs = N i=1 (yi −
Y )r (xi − X)s /N , and β2(y) = µ40 /µ220 , β2(x) = µ04 /µ202 are the coefficients of kurtosis of
y and x respectively.
Recall that the variance of the usual variance estimator Sby2 (= s2y ) is given by
where b∗ is the sample regression coefficient between s2y and s2x . The corresponding
2
Sy (λ22 −1)
population regression coefficient is given by β = Sx2 (β (see Garcia and Cebrian [3]).
2(x) −1)
Substituting the optimum values of d1 and d2 in (19), we get the optimum MSE of
b 2(α)
SPr in the form
!
(α) 2(α) 2(α) (α) (α) (α)
b 2(α) ∼ 4 A1 A5 + A2 A4 − 2A3 A4 A5
(20) MSE(SPr )opt = Sy 1 − (α) 2(α)
.
A1 A2 − A3
Variance Estimation Using Auxiliary Information 61
where
(0)
A1 = 1 + γ(β2(y) − 1),
A2 = γ(β2(x) − 1),
(0)
A3 = −γ(λ22 − 1),
(0)
A4 = 1,
(0)
A5 = 0.
(ii) For α = 1, the second part of the proposed estimator becomes a transformed
product estimator. When we put α = 1 in (14), (19), (20) and (21), we get
2
2(1) sx
(25) SbPr = d1 s2y + d2 Sx2 − s2x 2− ,
Sx2
2(1)
The optimum MSE of SbPr and the corresponding bias are given by
(26) Bias(SbPr ) ∼
2(1) 2 (1)
2 2 (1) (1)
= (d1 −1)Sy −d1 γSy (λ22 −1)+d2 Sx γ(β2(x) −1)
62 S. Gupta, J. Shabbir
and
!
(1) 2(1) 2(1) (1) (1) (1)
A1 A5 + A2 A4 − 2A3 A4 A5
MSE(SbPr )opt ∼
2(1) 4
= Sy 1− (1) 2(1)
,
A1 A2 − A3
or
(β2(x) − 1)U1 Sy4 γU1∗
(27) 2(1)
MSE(Sbpr )opt ∼ S
= y
4
1 − = ,
U2 U2
where
U1 = 1 − 3γ(β2(x) − 1) + γ 2 (β2(y) − 1)(β2(x) − 1) + γ 2 (β2(x) − 1)2
− γ 2 (λ22 − 1)2 ,
U2 = (β2(x) − 1) + γ(β2(y) − 1)(β2(x) − 1) − 3γ(β2(x) − 1)2
− γ(λ22 − 1)2 ,
U1∗ = (β2(y) − 1)(β2(x) − 1) − (λ22 − 1)2 − γ(β2(y) − 1)(β2(x) − 1)2
− γ(β2(x) − 1)3 + γ(β2(x) − 1)(λ22 − 1)2 .
The optimum values of d1 and d2 are
( )
(1) (1) (1) (1) (1) (1) (1)
(1) A2 A4 − A3 A5 (1) Sy2 A1 A5 − A3 A4
d1 = and d2 = ,
(1)
A1 A2 − A3
2(1) Sx2 (1)
A1 A2 − A3
2(1)
where
(1)
A1 = 1 + γ (β2y − 1) + (β2(x) − 1) − 4(λ22 − 1)
A2 = γ(β2(x) − 1),
(1)
A3 = γ 2(β2(x) − 1) − (λ22 − 1) ,
(1)
A4 = 1 − γ(λ22 − 1),
(1)
A5 = γ(β2(x) − 1).
(iii) For α = −1, the second part of the proposed estimator becomes a transformed
ratio estimator. When we put α = −1 in (14), (19), (20) and (21), we get
2
2(−1) Sx
(28) SbPr = d1 s2y + d2 Sx2 − s2x 2− ,
s2x
2(1)
The optimum MSE of Sb Pr and the corresponding bias are given by
(29)
2(−1) 2
)∼
(1) 2 (1)
Bias(SbPr = (d1 − 1)Sy + d1 γSy (λ22 − 1) − (β2(x) − 1
(−1)
− d2 Sx2 γ(β2(x) − 1)
and
!
(−1) 2(−1) 2(−1) (−1) (−1) (−1)
A1 A5 + A2 A4 − 2A3 A4 A5
MSE(SbPr )opt ∼
2(−1) 4
= Sy 1− (−1) 2(−1)
,
A1 A2 − A3
or
b2(−1) ∼ 4 (β2(x) − 1)U3 Sy4 γU3∗
(30) MSE(Spr )opt = Sy 1 − = ,
U4 U4
where
Variance Estimation Using Auxiliary Information 63
Condition (ii):
2(0) 2
MSE(SbPr )opt < MSE(SbR ) if
Var(Sby2 )(1 − ρ2(s2
2 )
2 y, sx )
MSE(SbR )− > 0, or
1+A
MSE(SbR
2
)A + MSE(SbR
2
) − Var(Sby2 )(1 − ρ2(s2 , s2 )
y x)
> 0, or
(1 + A)
n 2
o
(λ22 −1)
MSE(SbR
2
)A + γSy4 (β2(x) − 1) − 2(λ22 − 1) + (β2(x) −1)
> 0, or
(1 + A)
2
b2 4 (λ22 −1)
MSE(SR )A + γSy (β2(x) − 1) 1 − (β2(x) −1)
> 0.
(1 + A)
64 S. Gupta, J. Shabbir
Condition (iii):
2(0) 2
MSE(SbPr )opt < Var(SbReg ) if
Var(Sby2 )(1 − ρ2(s2 2 )
2 y, sx )
MSE(SbReg )− > 0, or
1+A
V ar(SbReg
2
)A
> 0,
(1 + A)
using (6).
Conditions (iv-vii):
2(0) 2
MSE(SbPr )opt < MSE(SbKCi ) (i = 1, 2, 3, 4) if
Var(Sby2 )(1 − ρ2(s2 2 )
2 y, sx )
MSE(SbKCi )− > 0 or
1+A
MSE(SbKCi
2
)A + MSE(SbKCi
2
) − Var(Sby2 )(1 − ρ2(s2 ,s2 ) )
y x
> 0, or
(1 + A)
n o
(λ22 −1)2
MSE(SbKCi
2
)A + γSy4 (β2(x) − 1)ψi2 − 2(λ22 − 1)ψi + (β2(x) −1)
> 0, or
(1 + A)
2
(λ22 −1)
MSE(SbKCi )A + γSy (β2(x) − 1) ψi −
2 4
(β2(x) −1)
> 0.
(1 + A)
The conditions (i)-(vii) above are always true.
Condition (viii):
2(0) 2
MSE(SbPr )opt < MSE(SbKC )opt if
Var(Sby2 )(1 − ρ2(s2, s2 ) )
2
MSE(SbKC x y
)opt − > 0, or
1+A
MSE(SbKC
2
)opt A + MSE(SbKC
2
)opt − Var(Sby2 )(1 − ρ2(s2 ,s2 ) )
y x
> 0, or
(1 + A)
MSE(SbKC
2
)opt A + γSy4 B
> 0,
(1 + A)
where B = (β2(y) − 1)(z 2 + ρ2(s2 ,s2 ) − 1) + α2∗2 τ 2 (β2(x) − 1) − 2zτ α2∗ (λ22 − 1).
y x
Obviously the proposed estimator will always perform better than Sby2 , SbR 2 b
, SKCi (i =
1, 2, 3, 4) and SbReg since Conditions (i)-(vii) always hold. It will also perform better than
2
SbKC if Condition (viii) holds, which indeed may be the case in some situations as shown
below for certain data sets that have been used in the literature.
Case 2: α = 1, −1.
2(1) 2(−1)
The proposed estimators SbP r and SbP r are only conditionally better then the other
2(1) 2(−1)
estimators considered here. This will be so if the optimum MSEs of SbP r and SbP r are
smaller than the MSEs of the estimators Sbi i = y, R, KCi (i = 1, 2, 3, 4), KC and Reg.
2
The data consist of 104 villages in the East Anatolia Region of Turkey in 1999. The
variates are defined as:
y is the level of apple production (in 100 tones)
and
x is the number of apple trees.
For this data, we have
N = 104, n = 20, γ = 0.04038, Y = 6.254, X = 13931.683,
Sy = 11.67, Sx = 23029.072, ρyx = 0.865, ρ(s2y , s2
x)
= 0.836757,
β2(y) = 16.523, β2(x) = 17.516, λ22 = 14.398, τ = 0.99766,
(0) (0) (1)
α1∗ = 0.18777, d1 = 0.84179, d2 = 0, d1 = 0.41072,
(1) (−1) (−1)
d2 = 0, d1 = 0.46619, d2 = 0.
and
x is the number of apple trees (1 unit = 100 trees).
For this data, we have:
N = 106, n = 20, γ = 0.04056, Y = 15.37, X = 243.76,
Sy = 64.25, Sx = 491.89, ρyx = 0.82, ρ(s2y , s2
x)
= 0.730458,
β2(y) = 80.13, β2(x) = 25.71, λ22 = 33.30, τ = 0.974196,
(0) (0) (1)
α1∗ = −0.17269, d1 = 0.40048, d2 = 0.00893, d1 = 1.97072,
(1) (−1) (−1)
d2 = −0.00623, d1 = 0.79816, d2 = 0.02797.
We obtain the Percent Relative Efficiency (PRE) of the various estimators as compared
to Sby2 . These are given in Table 1.
References
[1] Cochran, W. G. Sampling Techniques, 3rd edn. (Wiley & Sons, 1977).
[2] Das, A. K. Contribution to the theory of sampling strategies based on auxiliarly information
(Ph.D. thesis submitted to Bidhan Chandra Krishi Vishwavidyalaya, Mohanpur, Nadia, west
Bengal, India, 1988).
[3] Garcia, M. R. and Cebrian, A. A. Repeated substitution method: the ratio estimator for the
population variance, Metrika. 43, 101–105, 1996.
[4] Isaki, C. T. Variance estimation using auxiliary information, Journal of American Statistical
Association 78, 117–123, 1983.
[5] Kadilar, C. and Cingi, H. Ratio estimators for population variance in simple and stratified
sampling, Applied Mathematics and Computation 173, 1047–1058, 2006.
[6] Kadilar, C. and Cingi, H. Improvement in variance estimation using auxiliary information,
Hacettepe Journal of Mathematics and Statistics 35 (1), 111–115, 2006.
[7] Kaur, P. An efficient regression type estimator in survey sampling, Biom. Journal 27 (1),
107–110, 1985.
[8] Sahahi, A. and Ray, S. K. An efficient estimator using auxiliary information, Metrika 27,
271–275, 1980.
[9] Shabbir, J. and Yaab, Z. Improvement over transformed auxiliary variables in estimating
the finite population mean, Biometrical Journal 45 (6), 723–729, 2003.
[10] Upadhyaya, L. N. and Singh, H. P. Almost unbiased ratio and product-type estimators of
finite population variance in sample surveys, Statistics in Transition 7 (5), 1087–1096, 2006.