Professional Documents
Culture Documents
Introduction
In a previous lecture we considered using the mean X̄ of a random sample, X1, X2, X3 , ..., Xn , to
estimate the mean of a population. Recall what we discovered:
If the sample size n is sufficiently large (n>30) the sample means follow an approximate
Normal distribution with mean (or expected value) and standard deviation (or standard
error) / √n
From our Normal tables, we know that in theory 95% of sample means should fall within 1.96
standard errors of .
But what about in practice? If we take one sample, what can we learn from its mean?
For a particular sample mean x̄ , we are 95% confident that the interval
x̄
± 1.96 / √n
We do not say
We can say
120MP L19 1
Confidence intervals in general
Example 1
A random sample of 100 jam jars had mean weight 249g. The standard deviation of all jars
produced is known to be 4g.
(a) Find a 90% confidence interval for the mean weight of all jars.
(b) Find a 99% confidence interval for the mean weight of all jars.
120MP L19 2
unknown
In practice it is very unlikely that the population variance will be known. So we must estimate it
from the sample. This presents two complications. These will make our confidence interval a bit
wider than if the population variance were known.
First complication
2
Σ(x −x)
2
If we calculate the variance of the sample using the formula: σ = n
n
2
2 2 Σ (x−x)
So to overcome this bias we estimate by s =σ n−1=
n−1
The numerator of this fraction is called the sum of squares, while the denominator is called the
degrees of freedom.
Second complication
If we calculate the confidence interval using s instead of , we shall be introducing some additional
uncertainty because we are now not sure about the standard error.
This means we need to use the t distribution instead of the Normal distribution to obtain our (larger)
critical value.
0.4
0.3
0.2
0.1
0
-5 -4 -3 -2 -1 0 1 2 3 4 5
t
120MP L19 3
To use the t-tables you need to know both the percentage confidence and the degrees of freedom.
The bottom row of the t-tables gives the corresponding figure from the Normal tables. Why?
• Note that you should only use the t-distribution if the population measurements are
Normally distributed.
• If they aren’t, you need to use the z tables. (However, this should only be done if n is large)
Example 2
A random sample of 10 jam jars had mean weight 249g. The standard deviation of the sample
was 4g. Calculate a 99% confidence interval for the mean weight of all jars.
120MP L19 4
Upper Percentage Points of the Student’s t-Distribution
= degrees of freedom
120MP L19 5