Professional Documents
Culture Documents
Statistics
1 / 64
Sampling Distribution of the Sample Mean
For
Pn example, the sample mean, X̄, denotes the specific formula,
i=1 Xi /n, and it is used to estimate E(X) = µ. So,it is an
estimator of µ.
2 / 64
Sampling Distribution of the Sample Mean
3 / 64
Sampling distribution of the sample mean
X̄ ∼ N (µ, σ 2 /n).
1
X̄ = (X1 + X2 + · · · + Xn )
n
4 / 64
Sampling distribution of the sample mean
E(X) = n1 E (X1 + X2 + . . . + Xn ) = n1 nµ = µ
1 1 σ2
Var(X̄) = n2
Var (X1 + X2 + . . . + Xn ) = n2
nσ 2 = n
X−µ X−µ
p = √ = Z ∼ N(0, 1)
2
σ /n σ/ n
5 / 64
Example
Example:
Assume that annual percentage revenue increases for all Australian
companies are normally distributed with mean 12.2% and standard
deviation 3.6%. A random sample of 9 observations from this
population is taken. What is the probability that the sample mean
will be less than 10%?
X ∼ N(12.2, 3.62 ).
6 / 64
Example cont.
7 / 64
Notes
8 / 64
Central Limit Theorem (CLT)
9 / 64
Central Limit Theorem (CLT)
X−µ
√
σ/ n
= Z ∼ N(0, 1)
10 / 64
How large does n need to be?
11 / 64
Example 1
Example
Consider the marks of all students who did an economics test. If
marks are normally distributed, with mean=72 and standard
deviation=9.
X ∼ N (µ = 72, σ 2 = 92 )
12 / 64
Example 1
a) The probability that any one student will have a mark over 78.
X−µ 78−72
− Want to find P(X>78). P (X > 78) = P σ > 9
6
=P Z> 9
= P (Z > 0.67)
= 1 − P (Z ≤ 0.67)
= 0.2514
13 / 64
Example 1
− X ∼ N (µ = 72, σ 2 = 92 )
− By CLT,
σ2 92
X̄ ∼ N (µ = 72, = )
n 10
→ P(X̄>78)
X−µ 78−72
P(X > 78) = P √
σ/ n
> √
9/ 10
= P(Z > 2.11)
= 1 − P(Z < 2.11)
15 / 64
Example1
1-P(Z<2.11)
=1-0.9826
=0.0174
16 / 64
Example 2
Assuming that the sample is large enough for the sample mean to
follow a normal distribution, find the probability that a random
sample of 16 bulbs will have an average life of less than 775 hours.
17 / 64
Let X be the life of a randomly selected light bulb.
2
402
X ∼ (µ = 800, σ 2 = 402 ) X̄ ∼ N µ = 800, σn = 16
!
X−µ 775 − 800
P(X < 775) = P √ < √
σ/ n 40/ 16
= P(Z < −2.5)
= 0.0062
There is a 0.62% chance that the sample mean will be less than 775
hours.
18 / 64
Sampling Distribution of the Sample Proportion
µ = E(X) = np
σ 2 = var(X) = np(1 − P )
19 / 64
Sampling Distribution of the Sample Proportion
p(1 − p)
p̂ approx. ∼ N p,
n
20 / 64
Properties of the sample proportion
p̂ − p
q ≈ Z ∼ N (0, 1)
p(1−p)
n
s s
p(1 − p) p̂(1 − p̂)
≈
n n
21 / 64
Example
p is give, so
p̂ ∼N(o.44, o.44(1-0.44)/200)=N(0.44, 0.001232)
22 / 64
Statistical Inference
Statistical Inference
Point Estimators
Interval Estimators
23 / 64
What is Statistical Inference?
Statistical Inference refers to the problem of determining the
behavior of a large population by studying a small sample from that
population.
24 / 64
What is Statistical Inference?
Estimation
Hypothesis testing
25 / 64
Estimation
26 / 64
Estimator
Point Estimator
Interval Estimator
27 / 64
Two types of Estimators
28 / 64
Properties of Estimators 1: Linearity
29 / 64
Property 2: Unbiasedness
2. Unbiasedness
I An unbiased estimator of a population parameter is an
estimator whose expected value is equal to that parameter.
That is, estimator θ̂ of parameter θ is said to be unbiased if
E(θ̂) = θ.
30 / 64
Property 3: Efficiency
Example:
Both the sample median and sample mean are unbiased estimators
of the population mean, however, the sample median has a greater
variance than the sample mean, so we choose sample mean since it
is relatively efficient when compared to the sample median.
31 / 64
Property 3: Efficiency
X̄ = (X1 + X2 + X3 + X4 )/4
X̃ = (0.5X1 + X2 + X3 + 0.5X4 )/3
32 / 64
Best Linear Unbiased Estimator
I It can be shown that the sample mean is in fact BLUE for the
population mean.
33 / 64
Property 4: Consistency
I In simple terms, an estimator is consistent if it converges on
the true value when the sample size is large.
I Convergence in probability: If the sampling distribution of
an estimator of θ, θ̂ , collapses onto a constant α as the sample
size increases indefinitely, θ̂ is said to converge in probability to
α. Thus, α is the probability limit of θ̂, and it is written as
p
plim θ̂ = α or θ̂ → α.
I If plim θ̂ = α = θ, then θ̂ is said to be a consistent estimator
of θ.
I It is possible for an estimator to be biased but consistent.
I E.g. the sample variance S 2 is a consistent estimator of σ 2
which is also unbiased. This implies that an alternative
estimator for σ 2 which is defined by (Xi − X̄)2 /n is also
P
35 / 64
Interval Estimation
Assume that A and B are two random variables (A < B) that satisfy
P (a < µ < b) = 1 − α
36 / 64
Estimating µ when σ is known
37 / 64
Estimating µ when σ is known
We know that P(-1.96<Z<1.96)=1-2*P(Z<-1.96)=95% because
P(Z<-1.96)=0.025.
38 / 64
Estimating µ when σ is known
39 / 64
Estimating µ when σ is known
I Thus
P(-1.96<Z<1.96)=0.95
!
X−µ
P −1.96 < √ < 1.96 = 0.95
σ/ n
σ σ
P X̄ − 1.96 √ < µ < X̄ + 1.96 √ = 0.95
n n
40 / 64
Estimating µ when σ is known
σ σ
P X̄ − 1.96 √ < µ < X̄ + 1.96 √ = 0.95
n n
σ σ
I [X̄ − 1.96 √ , X̄ + 1.96 √ ] is called a 95% confidence
n n
interval estimator for µ.
σ
I i.e. the interval is the range ±1.96 √ on either side of X̄.
n
I Interpretation of the Confidence Interval:
In repeated sampling, 95% of the intervals created in this way
would contain true population µ.
41 / 64
Example 1
42 / 64
Example 1
I Interpretation:
If the experiment were carried out multiple times, 95% of the
intervals created in this way would contain true population µ.
43 / 64
Notation for Confidence Interval
σ σ
P X̄ − Zα/2 √ < µ < X̄ + Zα/2 √ =1−α
n n
σ
I Confidence interval estimator (CI): X̄ ± Zα/2 √
n
σ
Lower Confidence Limit (LCL): X̄ − Zα/2 √
n
σ
Upper Confidence Limit (UCL): X̄ + Zα/2 √
n
44 / 64
Notation for Confidence Interval
I 1-α is the confidence level, that is, the proportion of the values
of X̄ for which the interval
σ
X̄ ± Zα/2 √
n
45 / 64
Notation for Confidence Interval
I What does Zα/2 mean?
46 / 64
Factors affecting CI
Confidence Interval:
σ
X̄ ± Zα/2 √
n
47 / 64
IMPORTANT!
48 / 64
Example 2
Example
The average height of a sample of 25 men is found to be 178cm.
Assume that the standard deviation of male heights is known to be
10cm, and that heights follow a normal distribution. Find
a) A 95% confidence interval for the population mean height.
49 / 64
Example 2
σ 10
X̄ ± Z0.025 √ = 178 ± 1.96 √
n 25
= [174.08, 181.92]
50 / 64
Example 2
σ 10
X̄ ± Z0.05 √ = 178 ± 1.645 √
n 25
= [174.71, 181.29]
51 / 64
Selecting the sample size
We can use the CLT to find the minimum sample size required
to meet this condition, if the standard deviation of the
population is known.
52 / 64
Example 3
Example
I time my morning bus trips to work, and get an average of 35
minutes. Assuming that the times are normally distributed and the
standard deviation of times is known to be 5 minutes, I want to
estimate the true population mean length to within 3 minutes, with
99% certainty. What is the sample size required?
X̄ = 35 and σ = 5
I Solve for n
3
√ = 2.575
5/ n
n = 18.24 ≈ 19
55 / 64
Estimating µ when σ is unknown (use Student’s
t-distribution
I Remarks
The t distribution is very similar to the standardized normal
distribution except that its tails are fatter.
It is symmetric about zero.
For large degrees of freedom, the two distributions are virtually
identical.
t∞ = N (0, 1)
I This implies that if the t distribution (table or computer
program) is not available, the standardized normal distribution
can be used as a good approximation for cases of large samples
(say, d.f. > 30).
56 / 64
Estimating µ when σ is unknown (use Student’s
t-distribution
I When the population standard deviation is unknown for a
normally distributed variable, the appropriate distribution to get
a confidence interval for the population mean is the t
distribution with (n-1) degrees of freedom.
I That is, the 100(1-α)% confidence interval for µ is given by
s s
X̄ − tn−1, α2 √ < µ < X̄ + tn−1, α2 √
n n
57 / 64
Example 4
X̄ − µ
t= √ ∼ tn−1 .
s/ n
58 / 64
Example 4
s s
P X̄ − tn−1, α2 √ < µ < X̄ + tn−1, α2 √ = 0.95
n n
5000 5000
P X̄ − t24,0.025 √ < µ < X̄ + t24,0.025 √ = 0.95
25 25
5000 5000
P 21000 − 2.064 × √ < µ < 21000 + 2.064 × √ = 0.95
25 25
59 / 64
Example 5
63 63 68 71 42 49 59 55 59
60 / 64
I To transform X̄ into a standardized normal when σ is known or
a t distribution variable when σ is unknown, we have to
confirm that X̄ follows a normal distribution first.
I Since we do not have any information on the shape of the
distribution of X, the only justification for us to approximate
the distribution of X̄ by a normal distribution would be the
central limit theorem.
I However, the sample size is only of 9 which is not large enough
to validate the theorem.
I So, the best we can do is to assume that the distribution of X is
normal, so that the sample mean which is a linear combination
of X follows a normal distribution. (This implies that the
results will be sensitive to the validity of this assumption.)
I Assume that
X ∼ N (µ, ?).Therefore, it follows thatX̄ ∼ N (µ, ?)
61 / 64
I Since the population standard deviation is unknown, we cannot
standardize X into a standardized normal variable. Instead, we
transform it into a t variable:
X̄ − µ
t= √ ∼ tn−1 .
s/ n
.
I In the current case, n=9. Hence the above statistic follows the
t distribution with 8 degrees of freedom.
I To get the 95% confidence interval, we have to find the cut-off
point that
X̄ − µ
P (−t8,0.025 < √ < t8,0.025 ) = 0.95
s/ 9
s s
P (X̄ − t8,0.025 √ < µ < X̄ + t8,0.025 √ ) = 0.95
9 9
62 / 64
Thus, substituting the observed s and the cut-off point from the
table gives the 95% confidence interval for the population mean as
9.09 9.09
58.78 − 2.306 √ < µ < 58.78 + 2.306 √
9 9
63 / 64
Summary
64 / 64