Professional Documents
Culture Documents
Distributions
Chapter 7
Unbiasedness expected = true
Bias= = the difference between
the expected value of the estimator and the true
value in the population.
Efficiency - Smallest Mean Squared Error
How well the estimator does in predicting.
We want the estimator that has the smallest
squared error around the true value
Properties of Estimators that We
Desire
u = u)
( E
u u)
( E
Efficiency is variance + squared bias
2 2
2 2
2
2
2
E 2
E . E . S . M
u u + u u =
u u u u + u u + u u =
u u + u u =
u u + u u =
u u =
Squared Bias
Variance
This is always
zero
Efficiency (cont) - Among UNBIASED estimators
therefore, we want the one with the smallest
variance
Consistency
As sample size increases, variation of the
estimator from the true population value
decreases
Properties of Estimators that We
Desire
Unbiasedness
Biased Unbiased
P(X)
X
Efficiency
Sampling
Distribution
of Median
Sampling
Distribution of
Mean
X
P(X)
Larger
sample size
Smaller
sample size
Consistency
X
P(X)
A
B
Estimation
Sample Statistic Estimates Population Parameter
e.g. X = 50 estimates Population Mean,
Problems: Many samples provide many estimates of the
Population Parameter.
Determining adequate sample size: large sample give better
estimates. Large samples more costly.
How good is the estimate?
Approach to Solution: Theoretical Basis is Sampling
Distribution.
_
Sampling Distributions
Sampling
Distributions
Sampling
Distributions
of the
Mean
Sampling
Distributions
of the
Proportion
Sampling Distributions
A sampling distribution is a
distribution of all of the
possible values of a statistic
for a given size sample
selected from a population
Developing a
Sampling Distribution
Assume there is a population
Population size N=4
Random variable, X,
is age of individuals
Values of X: 18, 20,
22, 24 (years)
A
B
C
D
.3
.2
.1
0
18 20 22 24
A B C D
Uniform Distribution
P(x)
x
(continued)
Summary Measures for the Population Distribution:
Developing a
Sampling Distribution
21
4
24 22 20 18
N
X
i
=
+ + +
=
=
2.236
N
) (X
2
i
=
=
1
st
2
nd
Observation
Obs 18 20 22 24
18 18,18 18,20 18,22 18,24
20 20,18 20,20 20,22 20,24
22 22,18 22,20 22,22 22,24
24 24,18 24,20 24,22 24,24
16 possible samples
(sampling with
replacement)
Now consider all possible samples of size
n=2
1st 2nd Observation
Obs 18 20 22 24
18 18 19 20 21
20 19 20 21 22
22 20 21 22 23
24 21 22 23 24
(continued)
Developing a
Sampling Distribution
16 Sample
Means
1st 2nd Observation
Obs 18 20 22 24
18 18 19 20 21
20 19 20 21 22
22 20 21 22 23
24 21 22 23 24
Sampling Distribution of All Sample
Means
18 19 20 21 22 23 24
0
.1
.2
.3
P(X)
X
Sample Means
Distribution
16 Sample Means
_
Developing a
Sampling Distribution
(continued)
(no longer uniform)
_
Summary Measures of this Sampling
Distribution:
Developing a
Sampling Distribution
(continued)
21
16
24 21 19 18
N
X
i
X
=
+ + + +
= =
1.58
16
21) - (24 21) - (19 21) - (18
N
) X (
2 2 2
2
X
i
X
=
+ + +
=
X
=
If the Population is Normal
If a population is normal with mean and
standard deviation , the sampling
distribution of is also normally
distributed with
and
(This assumes that sampling is with replacement or
sampling is without replacement from an infinite
population)
X
X
=
n
X
=
Z-value for Sampling
Distribution
of the Mean
Z-value for the sampling distribution of
:
where: = sample mean
= population mean
= population standard deviation
n = sample size
X
) X (
) X (
Z
X
X
=
=
X
Normal Population
Distribution
Normal Sampling
Distribution
(has the same mean)
Sampling Distribution
Properties
(i.e. is unbiased )
x
x
x
x
=
x
=
n
Central Limit Theorem
As the
sample
size gets
large
enough
the sampling
distribution
becomes
almost normal
regardless of
shape of
population
x
Population Distribution
Sampling Distribution
(becomes normal as n increases)
Central Tendency
Variation
(Sampling with
replacement)
x
x
Larger
sample
size
Smaller
sample size
If the Population is not
Normal
(continued)
Sampling distribution
properties:
x
=
n
x
=
x
0.5
36
3
n
x
= = =
Example
Solution
(continued):
(continued)
0.3830 0.5) Z P(-0.5
36
3
8 - 8.2
n
-
36
3
8 - 7.8
P 8.2) P(7.8
X
X
= < < =
|
|
|
.
|
\
|
< < = < <
Z
7.8 8.2
-0.5 0.5
Sampling
Distribution
Standard Normal
Distribution
.1915
+.1915
Population
Distribution
?
?
?
?
?
?
? ?
?
?
?
?
Sample Standardize
8 =
8
X
=
0
z
=
x
X
Sampling Distributions
of the Proportion
Sampling
Distributions
Sampling
Distributions
of the
Mean
Sampling
Distributions
of the
Proportion
Population Proportions, p
p = the proportion of the population
having
some characteristic
Sample proportion ( p
s
) provides an estimate
of p:
0 p
s
1
p
s
has a binomial distribution
(assuming sampling with replacement from a finite
population or without replacement from an infinite
population)
size sample
interest of stic characteri the having sample the in items of number
n
X
p
s
= =
Sampling Distribution of p
Approximated by a
normal distribution if:
where
and
(where p = population proportion)
Sampling Distribution
P( p
s
)
.3
.2
.1
0
0 . 2 .4 .6 8 1 p
s
p
s
p
=
n
p) p(1
s
p
=
5 p) n(1
5 np
and
>
>
Z-Value for Proportions
If sampling is without
replacement and n is
greater than 5% of the
population size, then
must use the finite
population correction
factor:
1 N
n N
n
p) p(1
s
p
=
n
p) p(1
p p
p p
Z
s
p
s
s
=
Standardize p
s
to a Z value with the formula:
p
Example
If the true proportion of voters who
support Proposition A is p = .4, what is
the probability that a sample of size 200
yields a sample proportion between .40
and .45?
i.e.: if p = .4 and n = 200, what is
P(.40 p
s
.45) ?
Example
if p = .4 and n = 200, what is
P(.40 p
s
.45) ?
(continued)
.03464
200
.4) .4(1
n
p) p(1
s
p
=
=
1.44) Z P(0
.03464
.40 .45
Z
.03464
.40 .40
P .45) p P(.40
s
s s =
|
.
|
\
|
s s
= s s
Find :
Convert to
standard
normal:
s
p
Example
Z
.45 1.44
.4251
Standardize
Sampling Distribution
Standardized
Normal Distribution
if p = .4 and n = 200, what is
P(.40 p
s
.45) ?
(continued)
Use standard normal table: P(0 Z 1.44) = .4251
.40 0
p
s