Professional Documents
Culture Documents
Florentin Smarandache
University of New Mexico, Gallup, USA
Published in:
Rajesh Singh, Florentin Smarandache (Editors)
ON IMPROVEMENT IN ESTIMATING POPULATION PARAMETER(S)
USING AUXILIARY INFORMATION
Educational Publishing (Columbus) & Journal of Matter Regularity (Beijing),
USA - China, 2013
ISBN: 978-1-59973-230-5
pp. 17 - 24
Abstract
In this paper, we have studied the problem of estimating the finite population mean
when information on two auxiliary attributes are available. Some improved estimators in
simple random sampling without replacement have been suggested and their properties are
studied. The expressions of mean squared error’s (MSE’s) up to the first order of
approximation are derived. An empirical study is carried out to judge the best estimator out of
the suggested estimators.
Key words: Simple random sampling, auxiliary attribute, point bi-serial correlation, phi
correlation, efficiency.
Introduction
17
In this paper, we assume that both auxiliary attributes have significant point bi-serial
correlation with the study variable and there is significant phi-correlation (see Yule (1912))
between the auxiliary attributes.
Consider a sample of size n drawn by simple random sampling without replacement
(SRSWOR) from a population of size N. let yj, φ ij (i=1,2) denote the observations on variable
y and φ i (i=1,2) respectively for the jth unit (i=1,2,3,……N) . We note that φ ij =1, if jth unit
N n
possesses attribute φ ij =0 otherwise . Let A i = φ ij , a i = φ ij ; i=1,2 denotes the total
j=1 j=1
number of units in the population and sample respectively, possessing attribute φ . Similarly,
Ai a
let Pi = and p i = i ;(i=1,2 ) denotes the proportion of units in the population and
N n
P
t 1 = y 1
p1 (1.1)
p
t 2 = y 2
P2 (1.2)
P −p
t 3 = y exp 1
1
P +p
1 1 (1.3)
p − P2
t 4 = y exp 2
p 2 + P2 (1.4)
The bias and MSE expression’s of the estimator’s t i (i=1, 2, 3, 4) up to the first order of
approximation are, respectively, given by
[
B(t 1 ) = Yf1C 2p1 1 − K pb1 ]
(1.5)
B(t 2 ) = Yf 1 K pb 2
2
C
p2
(1.6)
18
C 2p1 1
B(t 3 ) = Yf 1 − K pb1
2 4 (1.7)
C 2p 2 1
B(t 4 ) = Yf1 + K pb 2
2 4 (1.8)
[ (
MSE (t 1 ) = Y f1 C 2y + C 2p1 1 − 2K pb1
2
)]
(1.9)
[ (
MSE (t 2 ) = Y f1 C 2y + C 2p1 1 + 2K pb2
2
)] (1.10)
2 1
MSE (t 3 ) = Y f1 C 2y + C 2p1 − K pb1
4 (1.11)
2 1
MSE (t 4 ) = Y f1 C 2y + C 2p2 + K pb2
4 (1.12)
where, f1 =
1 1
-
n N
, Sφ2 j =
1 N
(φ ji − Pj ) ,
N − 1 i=1
S yφ j =
1 N
N − 1 i=1
( )
yi − Y (φ ji − Pj ),
S yφ j Sy Sφ j Cy Cy
ρ pb j = , Cy = , Cp j = ; ( j = 1,2), K pb1 = ρ pb1 , K pb2 = ρ pb2 .
S y Sφ j Y Pj C p1 C p2
1 n s
s φ1φ2 =
n − 1 i =1
(φ1i − p1 )(φ 2i − p 2 ) and ρ φ = φ1φ2 be the sample phi-covariance and phi-
s φ1 s φ2
1 N S
and phi-correlation Sφ1φ2 =
N − 1 i =1
(φ1i − P1 )(φ 2i − P2 ) and ρφ = φ1φ2 .
Sφ1 Sφ2
In this paper we have proposed some improved estimators of population mean using
information on two auxiliary attributes in simple random sampling without replacement. A
comparative study is also carried out to compare the optimum estimators with respect to usual
mean estimator with the help of numerical data.
19
2. Proposed Estimators
Following Olkin (1958), we propose an estimator t 1 as
P P
t 5 = y w 1 1 + w 2 2
p1 p2 (2.1)
where w 1 and w 2 are constants, such that w 1 + w 2 = 1.
Consider another estimator t6 as
[ ] P − p2
t 6 = K 61 y + K 62 (P1 − p1 ) exp 2
P2 + p 2 (2.2)
where K 61 and K 62 are constants.
Following Shaoo et al. (1993), we propose another estimator t7 as
t 7 = y + K 71 (P1 − p1 ) + K 72 (P2 − p 2 )
(2.3)
where K 71 and K 72 are constants.
To obtain the bias and MSE expressions of the estimators t i (i = 5,6,7) to the first
degree of approximation, we define
y−Y p1 − P1 p − P2
e0 = , e1 = , e2 = 2
Y P1 P2
Also,
Cy Cy C p1
K pb1 = ρ pb1 , K pb2 = ρ pb2 , K φ = ρφ
C p1 C p2 Cp 2
P1 P2
t 5 = Y(1 + e 0 ) w 1 + w2
P1 (1 + e1 ) P2 (1 + e 2 )
20
[
t 5 = Y(1 + e 0 ) w 1 (1 + e1 ) + w 2 (1 + e 2 )
−1 −1
] (3.1)
Expanding the right hand side of (3.1) and retaining terms up to second degrees of e’s, we have,
[
t 5 = Y 1 + e 0 − w 1e1 − w 2 e 2 + w1e12 + w 2 e 22 − w1e 0 e1 − w 2 e 0 e 2 ] (3.2)
Taking expectations of both sides of (3.1) and then subtracting Y from both sides, we get the bias of
estimator t 5 upto the first order of approximation as
[ (
Bias( t 5 ) = Yf 1 w 1C 2p1 1 − K pb1 + w 2 C 2p 2 1 − K pb 2 ) ( )] (3.3)
(t 5 )
− Y ≅ Y[e 0 − w 1e1 − w 2 e 2 ]
(3.4)
Squaring both sides of (3.4) and then taking expectations, we get the MSE of t5 up to the first order of
approximation as
2
[
MSE ( t 5 ) = Y f 1 C 2y + w 12 C 2p1 + w 22 C 2p 2 − 2 w 1 K pb1 C 2p1 − 2 w 2 K pb 2 C 2pb 2 + 2 w 1 w 2 K φ C 2p 2 ] (3.5)
Minimization of (3.5) with respect to w1 and w2, we get the optimum values of w1 and w2 , as
=
[
C 2p1 1 − K pb1 ]=w *
(say )
2 2 2
C − Kφ C
p1 p2
Similarly, we get the bias and MSE expressions of estimator t6 and t7 respectively, as
3 1 1
Bias( t 6 ) = K 61 Y 1 + f1C 2p 2 − K pb 2 + K 22 P1f1 K φ C 2p 2 (3.6)
8 2 2
Bias( t 7 ) = 0 (3.7)
And
21
P12 A 2 − 2 K 61 K 62 P1 YA 3 + (1 − 2 K 61 )Y (3.8)
2 22 2
MSE( t 6 ) = K 61 Y A1 + K 62
1
where A1 = 1 + f1 C 2y + C 2p2 − K pb2
4
2
A 2 = f1C p1
1
A 3 = f1 k pb1 C 2p1 − K φ C 2p
2
A2
K 61( opt ) = = K *61 (say )
A1A 2 − A 32
YA 3
K 62 ( opt ) = = K *62 (say )
P1 (A1A 2 − A 32 )
2
MSE ( t 7 ) = Y f1C 2y + K 71
2
P12 f 1C 2p1 + K 72
2
P22 f1C 2p 2 − 2K 71 P1 Yf1 K pb1 C 2p1 − 2K 72 P2 Yf 1 K pb 2 C 2p 2
+ 2K 71 K 72 P1 P2 f 1 K φ C 2p 2 (3.9)
Y K pb1 C p1 − K pb2 K φ C p2
2 2
4. Empirical Study
The population consists rice cultivation areas in 73 districts of Pakistan. The variables are defined as:
Y= rice production (in 000’ tonnes, with one tonne = 0.984 ton) during 2003,
P1 = production of farms where rice production is more than 20 tonnes during the year 2002, and
P2 = proportion of farms with rice cultivation area more than 20 hectares during the year 2003.
2 2 2
N=73, Y =61.3, P1 =0.4247, P2 =0.3425, S y =12371.4, S φ1 =0.225490, Sφ2 =0.228311,
22
The percent relative efficiency (PRE’s) of the estimators ti (i=1,2,…7) with respect to unusual
Estimator PRE
y 100.00
t1 162.7652
t2 48.7874
t3 131.5899
t4 60.2812
t5 165.8780
t6 197.7008
t7 183.2372
Conclusion
In this paper we have proposed some improved estimators of population mean using
information on two auxiliary attributes in simple random sampling without replacement. From the
Table 4.1 we observe that the estimator t6 is the best followed by the estimator t7 .
References
Government of Pakistan, 2004, Crops Area Production by Districts (Ministry of Food, Agriculture
and Livestock Division, Economic Wing, Pakistan).
23
Malik, S. and Singh, R. (2013): A family of estimators of population mean using information on
point bi-serial and phi correlation coefficient. IJSE
Naik, V. D and Gupta, P.C.(1996): A note on estimation of mean with known population proportion
of an auxiliary character. Jour. Ind. Soc. Agri. Stat., 48(2), 151-158.
Olkin, I. (1958): Multivariate ratio estimation for finite populations, Biometrika, 45, 154–165.
Singh, R., Chauhan, P., Sawan, N. and Smarandache, F.( 2007): Auxiliary information and a priori
values in construction of improved estimators. Renaissance High press.
Singh, R., Chauhan, P., Sawan, N. and Smarandache, F. (2008): Ratio Estimators in Simple Random
Sampling Using Information on Auxiliary Attribute. Pak. Jour. Stat. Oper. Res. Vol. IV, No.1, pp. 47-
53.
Singh, R., Kumar, M. and Smarandache, F. (2010): Ratio estimators in simple random sampling
when study variable is an attribute. WASJ 11(5): 586-589.
Yule, G. U. (1912): On the methods of measuring association between two attributes. Jour. of the
Royal Soc. 75, 579-642.
24