You are on page 1of 33

Chapter 6

Confidence Intervals
§ 6.1
Confidence Intervals
for the Mean
(Large Samples)
Point Estimate for Population μ
A point estimate is a single value estimate for a population
parameter. The most unbiased point estimate of the
population mean, , is the sample mean, x.
Example:
A random sample of 32 textbook prices (rounded to the nearest
dollar) is taken from a local college bookstore. Find a point
estimate for the population mean, .
34 34 38 45 45 45 45 54
56 65 65 66 67 67 68 74
x  74.22
79 86 87 87 87 88 90 90
94 95 96 98 98 101 110 121

The point estimate for the population mean of textbooks


in the bookstore is $74.22.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 3
Interval Estimate
An interval estimate is an interval, or range of values,
used to estimate a population parameter.

Point estimate
for textbooks

74.22

interval estimate

How confident do we want to be that the interval estimate


contains the population mean, μ?

Larson & Farber, Elementary Statistics: Picturing the World, 3e 4


Level of Confidence
The level of confidence c is the probability that the
interval estimate contains the population parameter.

c is the area beneath the


c normal curve between
the critical values.

1 1
(1 – c) (1 – c)
2 2
z
−zc z=0 zc
Use the Standard
Critical values Normal Table to find the
corresponding z-scores.
The remaining area in the tails is 1 – c .
Larson & Farber, Elementary Statistics: Picturing the World, 3e 5
Common Levels of Confidence
If the level of confidence is 90%, this means that we are
90% confident that the interval contains the population
mean, μ.
0.90

0.05 0.05

z
−zc = −−z1.645
c z = 0 zc = z1.645
c

The corresponding z-scores are ± 1.645.

Larson & Farber, Elementary Statistics: Picturing the World, 3e 6


Common Levels of Confidence
If the level of confidence is 95%, this means that we are
95% confident that the interval contains the population
mean, μ.
0.95

0.025 0.025

z
−zc = −
−z1.96
c z=0 zc =z1.96
c

The corresponding z-scores are ± 1.96.

Larson & Farber, Elementary Statistics: Picturing the World, 3e 7


Common Levels of Confidence
If the level of confidence is 99%, this means that we are
99% confident that the interval contains the population
mean, μ.
0.99

0.005 0.005

z
−zc = −−z2.575
c z = 0 zc = z2.575
c

The corresponding z-scores are ± 2.575.

Larson & Farber, Elementary Statistics: Picturing the World, 3e 8


Margin of Error
The difference between the point estimate and the actual
population parameter value is called the sampling error.
When μ is estimated, the sampling error is the difference
μ – x . Since μ is usually unknown, the maximum value for
the error can be calculated using the level of confidence.

Given a level of confidence, the margin of error (sometimes


called the maximum error of estimate or error tolerance) E
is the greatest possible distance between the point estimate
and the value of the parameter it is estimating.
E = z cσ x = z c σ
n When n  30, the sample standard
deviation, s, can be used for .

Larson & Farber, Elementary Statistics: Picturing the World, 3e 9


Margin of Error
Example:
A random sample of 32 textbook prices is taken from a
local college bookstore. The mean of the sample is x = 74.22,
and the sample standard deviation is s = 23.44.
Use a 95% confidence level and find the margin of error for
the mean price of all textbooks in the bookstore.

Since n  30, s can be


E = z c σ = 1.96  23.44 substituted for σ.
n 32
 8.12

We are 95% confident that the margin of error for the


population mean (all the textbooks in the bookstore) is
about $8.12.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 10
Confidence Intervals for μ
A c-confidence interval for the population mean μ is
x−E   x+E
The probability that the confidence interval contains μ is c.

Example:
A random sample of 32 textbook prices is taken from a
local college bookstore. The mean of the sample is x =
74.22, the sample standard deviation is s = 23.44, and the
margin of error is E = 8.12.
Construct a 95% confidence interval for the mean price of
all textbooks in the bookstore.
Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 11
Confidence Intervals for μ
Example continued:
Construct a 95% confidence interval for the mean price of
all textbooks in the bookstore.
x = 74.22 s = 23.44 E = 8.12
Left endpoint = ? Right endpoint = ?

• x =•
74.22 •
x − E = 74.22 − 8.12 x + E = 74.22 + 8.12
= 66.1 = 82.34

With 95% confidence we can say that the cost for all
textbooks in the bookstore is between $66.10 and $82.34.

Larson & Farber, Elementary Statistics: Picturing the World, 3e 12


Finding Confidence Intervals for μ
Finding a Confidence Interval for a Population Mean (n
 30 or σ known with a normally distributed population)
In Words In Symbols
1. Find the sample statistics n and x. x =
x
n
2. Specify , if known. Otherwise, if n  30, ( x − x )2
find the sample standard deviation s and s=
n −1
use it as an estimate for .
3. Find the critical value zc that corresponds Use the Standard
Normal Table.
to the given level of confidence.
E = zc σ
4. Find the margin of error E. n
Left endpoint: x − E
5. Find the left and right endpoints Right endpoint: x + E
and form the confidence interval. Interval: x − E    x + E
Larson & Farber, Elementary Statistics: Picturing the World, 3e 13
Confidence Intervals for μ ( Known)
Example:
A random sample of 25 students had a grade point average
with a mean of 2.86. Past studies have shown that the
standard deviation is 0.15 and the population is normally
distributed.
Construct a 90% confidence interval for the population
mean grade point average.
n = 25 x = 2.86  = 0.15
σ = 1.645  0.15
zc = 1.645 E = zc  0.05
n 25
x + E = 2.86 ± 0.05 2.81 < μ < 2.91
With 90% confidence we can say that the mean grade point
average for all students in the population is between 2.81 and 2.91.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 14
Sample Size
Given a c-confidence level and a maximum error of
estimate, E, the minimum sample size n, needed to
estimate , the population mean, is

 zc 
2

n=  .
 E 
If  is unknown, you can estimate it using s provided you
have a preliminary sample with at least 30 members.

Example:
You want to estimate the mean price of all the textbooks in
the college bookstore. How many books must be included in
your sample if you want to be 99% confident that the sample
mean is within $5 of the population mean? Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 15
Sample Size
Example continued:
You want to estimate the mean price of all the textbooks in
the college bookstore. How many books must be included in
your sample if you want to be 99% confident that the sample
mean is within $5 of the population mean?

x = 74.22   s = 23.44 zc = 2.575

 zc   2.575  23.44 


2 2

n=  = 
 E   5 
 145.7 (Always round up.)

You should include at least 146 books in your sample.

Larson & Farber, Elementary Statistics: Picturing the World, 3e 16


§ 6.2
Confidence Intervals
for the Mean
(Small Samples)
The t-Distribution
When a sample size is less than 30, and the random variable x
is approximately normally distributed, it follow a t-distribution.
t =x −μ
s
n
Properties of the t-distribution
1. The t-distribution is bell shaped and symmetric about the mean.
2. The t-distribution is a family of curves, each determined by a
parameter called the degrees of freedom. The degrees of freedom
are the number of free choices left after a sample statistic such as x
is calculated. When you use a t-distribution to estimate a
population mean, the degrees of freedom are equal to one less than
the sample size.
d.f. = n – 1 Degrees of freedom
Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 18
The t-Distribution
3. The total area under a t-curve is 1 or 100%.
4. The mean, median, and mode of the t-distribution are equal to zero.
5. As the degrees of freedom increase, the t-distribution approaches the
normal distribution. After 30 d.f., the t-distribution is very close to
the standard normal z-distribution.

The tails in the t-distribution


are “thicker” than those in the
standard normal distribution.
d.f. = 2
d.f. = 5 t
0
Standard
normal curve

Larson & Farber, Elementary Statistics: Picturing the World, 3e 19


Critical Values of t
Example:
Find the critical value tc for a 95% confidence when the
sample size is 5.
Appendix B: Table 5: t-Distribution
Level of
confidence, c 0.50 0.80 0.90 0.95 0.98
One tail,  0.25 0.10 0.05 0.025 0.01
d.f. Two tails,  0.50 0.20 0.10 0.05 0.02
1 1.000 3.078 6.314 12.706 31.821
2 .816 1.886 2.920 4.303 6.965
3 .765 1.638 2.353 3.182 4.541
4 .741 1.533 2.132 2.776 3.747
5 .727 1.476 2.015 2.571 3.365
d.f. = n – 1 = 5 – 1 = 4
c = 0.95 tc = 2.776 Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 20
Critical Values of t
Example continued:
Find the critical value tc for a 95% confidence when the
sample size is 5.
95% of the area under the t-distribution curve with 4
degrees of freedom lies between t = ±2.776.

c = 0.95

t
−tc = − 2.776 tc = 2.776

Larson & Farber, Elementary Statistics: Picturing the World, 3e 21


Confidence Intervals and t-Distributions
Constructing a Confidence Interval for the Mean: t-
Distribution
In Words In Symbols
1. Identify the sample statistics n, x , x =
x ( x − x )2
n s=
and s. n −1

2. Identify the degrees of freedom, d.f. = n – 1


the level of confidence c, and the
critical value tc.
3. Find the margin of error E. E = tc s
n
4. Find the left and right endpoints
and form the confidence interval. Left endpoint: x − E
Right endpoint: x + E
Interval: x − E    x + E
Larson & Farber, Elementary Statistics: Picturing the World, 3e 22
Constructing a Confidence Interval
Example:
In a random sample of 20 customers at a local fast food
restaurant, the mean waiting time to order is 95 seconds,
and the standard deviation is 21 seconds. Assume the wait
times are normally distributed and construct a 90%
confidence interval for the mean wait time of all customers.
n = 20 x = 95 s = 21
21 = 8.1
d.f. = 19 tc = 1.729 E = tc s = 1.729 
n 20
x  E = 95 ± 8.1 86.9 < μ < 103.1

We are 90% confident that the mean wait time for all
customers is between 86.9 and 103.1 seconds.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 23
Normal or t-Distribution?
Use the normal distribution with
Is n  30? Yes E = zc σ .
n
No If  is unknown, use s instead.

Is the population normally, or


You cannot use the normal
approximately normally, No
distribution or the t-distribution.
distributed?

Yes
Use the normal distribution with
Is  known? Yes
E = zc σ .
n
No

Use the t-distribution with


E = tc s
n
and n – 1 degrees of freedom.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 24
Normal or t-Distribution?
Example:
Determine whether to use the normal distribution, the
t-distribution, or neither.
a.) n = 50, the distribution is skewed, s = 2.5
The normal distribution would be used because the
sample size is 50.
b.) n = 25, the distribution is skewed, s = 52.9
Neither distribution would be used because n < 30 and
the distribution is skewed.
c.) n = 25, the distribution is normal,  = 4.12
The normal distribution would be used because although
n < 30, the population standard deviation is known.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 25
§ 6.3
Confidence Intervals
for Population
Proportions
Point Estimate for Population p
The probability of success in a single trial of a binomial
experiment is p. This probability is a population
proportion.
The point estimate for p, the population proportion of
successes, is given by the proportion of successes in a
sample and is denoted by
pˆ = x
n
where x is the number of successes in the sample and n is
the number in the sample. The point estimate for the
proportion of failures is qˆ = 1 – p̂. The symbols p̂ and qˆ
are read as “p hat” and “q hat.”

Larson & Farber, Elementary Statistics: Picturing the World, 3e 27


Point Estimate for Population p
Example:
In a survey of 1250 US adults, 450 of them said that their favorite
sport to watch is baseball. Find a point estimate for the population
proportion of US adults who say their favorite sport to watch is
baseball.
n = 1250 x = 450

pˆ = x = 450 = 0.36
n 1250
The point estimate for the proportion of US adults who
say baseball is their favorite sport to watch is 0.36, or
36%.

Larson & Farber, Elementary Statistics: Picturing the World, 3e 28


Confidence Intervals for p
A c-confidence interval for the population proportion p is
pˆ − E  p  pˆ + E
where

E = zc pq
ˆ ˆ.
n
The probability that the confidence interval contains p is c.

Example:
Construct a 90% confidence interval for the proportion of
US adults who say baseball is their favorite sport to watch.

n = 1250 x = 450 p̂ = 0.36


Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 29
Confidence Intervals for p
Example continued:
n = 1250 x = 450 p̂ = 0.36 E = z c pq
ˆˆ
n
qˆ = 0.64 (0.36)(0.64)  0.022
= 1.645
1250

Left endpoint = ? Right endpoint = ?

• p̂ = •
0.36 •
p̂ − E = 0.36 − 0.022 p̂ + E = 0.36 + 0.022
= 0.338 = 0.382
With 90% confidence we can say that the proportion of all
US adults who say baseball is their favorite sport to watch
is between 33.8% and 38.2%.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 30
Finding Confidence Intervals for p
Constructing a Confidence Interval for a Population Proportion
In Words In Symbols
1. Identify the sample statistics n and x.
2. Find the point estimate p̂. pˆ = x
n
3. Verify that the sampling distribution npˆ  5, nqˆ  5
can be approximated by the normal
distribution.
4. Find the critical value zc that Use the Standard
corresponds to the given level of Normal Table.
confidence.
5. Find the margin of error E. E = z c pq
ˆˆ
n
6. Find the left and right endpoints and Left endpoint: p̂ − E
form the confidence interval. Right endpoint: p̂ + E
Interval: pˆ − E  p  pˆ + E
Larson & Farber, Elementary Statistics: Picturing the World, 3e 31
Sample Size
Given a c-confidence level and a margin of error, E, the
minimum sample size n, needed to estimate p is
2
 zc 
n = pq
ˆˆ  .
E
This formula assumes you have an estimate for p̂ and qˆ.
If not, use pˆ = 0.5 and qˆ = 0.5.

Example:
You wish to find out, with 95% confidence and within 2% of
the true population, the proportion of US adults who say
that baseball is their favorite sport to watch.
Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 32
Sample Size
Example continued:
You wish to find out, with 95% confidence and within 2% of
the true population, the proportion of US adults who say
that baseball is their favorite sport to watch.
n = 1250 x = 450 p̂ = 0.36
2 2
z 
ˆ ˆ  c  = (0.36)(0.64) 
1.96 
n = pq 
E  0.02 
 2212.8 (Always round up.)

You should sample at least 2213 adults to be 95% confident.

Larson & Farber, Elementary Statistics: Picturing the World, 3e 33

You might also like