Professional Documents
Culture Documents
Decisions
Interval Estimation
2
Margin of Error and the Interval Estimate
A point estimator cannot be expected to provide the exact value of the population parameter.
An interval estimate can be computed by adding and subtracting a margin of error to the point
estimate.
The purpose of an interval estimate is to provide information about how close the point estimate is
to the value of the parameter.
The general form of an interval estimate of a population mean is
x Margin of Error
3
Interval Estimate of a Population Mean:
s Known
In order to develop an interval estimate of a population mean, the margin of error must be
computed using either:
▪ the population standard deviation s , or
▪ the sample standard deviation s
▪ s is rarely known exactly, but often a good estimate can be obtained based on historical data.
▪ We refer to such cases as the s known case.
4
Interval Estimate of a Population Mean:
s Known
◼ Interval Estimate of m
s
x z /2
n
5
Interval Estimate of a Population Mean:
s Known
Sampling
distribution
of x
x
m
7
Interval Estimate of a Population Mean:
s Known
Sampling Sampling
distribution distribution
of x of z
x 𝑍
m 0
Sampling
distribution
of x
1 - of all
/2 /2
x values
interval
does not x
include m m interval
z /2 s x z /2 s x includes m
x
x
x 9
Interval Estimate of a Population Mean:
s Known
◼ Interval Estimate of m
s
x z /2
n
10
Interval Estimate of a Population Mean:
s Known
Values of z/2 for the Most Commonly Used Confidence Levels
Confidence Table
Level α α/2 Look-up Area zα/2
11
Interval Estimate of a Population Mean:
s Known
Because 90% of all the intervals constructed using 𝑥ҧ ± 1.645𝜎𝑥ҧ will contain the population mean, we
say we are 90% confident that the interval 𝑥ҧ ± 1.645𝜎𝑥ҧ includes the population mean µ.
We say that this interval has been established at the 90% confidence level.
The value .90 is referred to as the confidence coefficient.
12
Interval Estimate of a Population Mean:
s Known
Example: Lloyd’s Department Store
Each week Lloyd’s Department Store selects a random sample of 100 customers in order to learn about the
amount spent per shopping trip.
With x representing the amount spent per shopping trip, the sample mean provides a point estimate of μ, the
mean amount spent per shopping trip for the population of all Lloyd’s customers.
Lloyd’s has been using the weekly survey for several years. Based on the historical data, Lloyd’s now
assumes a known value of σ = $20 for the population standard deviation. The historical data also indicate
that the population follows a normal distribution.
During the most recent week, Lloyd’s surveyed 100 customers (n=100) and obtained a sample mean of 𝑥ҧ =
$82. The sample mean amount spent provides a point estimate of the population mean amount spent per
shopping trip, μ.
Determine the interval estimate of μ with 95% confidence level.
13
Interval Estimate of a Population Mean:
s Known
Example: Lloyd’s Department Store
95% of the sample means that can be observed are within + 1.96𝜎𝑥ҧ of the population mean m.
s 20
z /2 = 1.96 = 3.92
n 100
Thus, at 95% confidence,
the margin of error is $3.92.
14
Interval Estimate of a Population Mean:
s Known
Example: Lloyd’s Department Store
Interval estimate of m is:
$82 + $3.92
or
$78.08 to $85.92
We are 95% confident that the interval contains the population mean.
15
Interval Estimate of a Population Mean:
s Known
Example: Lloyd’s Department Store
Confidence Margin
Level of Error Interval Estimate
90% 3.29 78.71 to 85.29
95% 3.92 78.08 to 85.92
99% 5.15 76.85 to 87.15
16
Interval Estimate of a Population Mean:
s Known
Adequate Sample Size
In most applications, a sample size of n = 30 is adequate.
If the population distribution is highly skewed or contains outliers, a sample size of 50 or more is
recommended.
If the population is not normally distributed but is roughly symmetric, a sample size as small as 15
will suffice.
If the population is believed to be at least approximately normal, a sample size of less than 15 can
be used.
17
Interval Estimate of a Population Mean:
s unknown
If an estimate of the population standard deviation s cannot be developed prior to sampling, we use
the sample standard deviation s to estimate s .
This is the s unknown case.
In this case, the interval estimate for m is based on the t distribution.
18
t Distribution
William Gosset, writing under the name “Student”, is the founder of the t distribution.
Gosset was an Oxford graduate in mathematics and worked for the Guinness Brewery in Dublin.
He developed the t distribution while working on small-scale materials and temperature
experiments.
20
t Distribution
A specific t distribution depends on a parameter known as the degrees of freedom.
Degrees of freedom refer to the number of independent pieces of information that go into the
computation of s.
A t distribution with more degrees of freedom has less dispersion.
As the degrees of freedom increases, the difference between the t distribution and the standard
normal probability distribution becomes smaller and smaller.
21
t Distribution
t distribution
Standard (20 degrees
normal of freedom)
distribution
t distribution
(10 degrees
of freedom)
z, t
0
22
t Distribution
For more than 100 degrees of freedom, the standard normal z value provides a good approximation
to the t value.
The standard normal z values can be found in the infinite degrees (∞) row of the t distribution table.
Standard normal
z values
23
t Distribution
24
Interval Estimate of a Population Mean:
s unknown
◼ Interval Estimate
s
x t /2
n
25
Interval Estimate of a Population Mean:
s unknown
Example: Credit Card Debt
To illustrate the interval estimation procedure for the σ unknown case, we will consider a study designed to
estimate the mean credit card debt for the population of U.S. households.
A sample of n = 70 households provided the credit card balances shown in Table 8.3. For this situation, no
previous estimate of the population standard deviation σ is available. Thus, the sample data must be used to
estimate both the population mean and the population standard deviation.
Using the data in Table 8.3, we compute the sample mean 𝑥ҧ = $9312 and the sample standard deviation s =
$4007.
26
Interval Estimate of a Population Mean:
s unknown
Example: Credit Card Debt
27
Interval Estimate of a Population Mean:
s unknown
➢ At 95% confidence, = .05, and /2 = .025.
➢ t.025 is based on n - 1 = 70 - 1 = 69 degrees of freedom.
➢ In the t distribution table we see that t.025 = 1.995.
28
Interval Estimate of a Population Mean:
s unknown
29
Interval Estimate of a Population Mean:
s unknown
Interval Estimate
s
x t.025 Margin
n of Error
4007
9312 1.995 = 9312 955
70
We are 95% confident that the mean credit card balance for the population
of all household is between $8,357 and $10,267.
30
Interval Estimate of a Population Mean:
s unknown
Adequate Sample Size
31
Interval Estimate of a Population Mean:
s unknown
Adequate Sample Size
32
Summary of Interval Estimation Procedures
for a Population Mean
33
Sample Size for an Interval Estimate
of a Population Mean
Let E = the desired margin of error.
E is the amount added to and subtracted from the point estimate to obtain an interval estimate.
If a desired margin of error is selected prior to sampling, the sample size necessary to satisfy the
margin of error can be determined.
Margin of Error s
➢ E = z /2
n
35
Sample Size for an Interval Estimate
of a Population Mean
Example: Lloyd’s Department Store
Recall that Lloyd’s Department Store is planning to get an estimate of the mean amount spent per
shopping trip. Suppose that Lloyd’s management team wants an estimate of the population mean
such that there is a 0.95 probability that the sampling error is $2 or less.
36
Sample Size for an Interval Estimate
of a Population Mean
Example: Lloyd’s Department Store
s
z / 2 =2
n
At 95% confidence, z.025 = 1.96. Recall that s = 20.
(1.96)2 (20)2
n= 2
= 384.16 = 385
(2)
37
Interval Estimate of a Population Proportion
Sampling
𝑝(1 − 𝑝)
distribution 𝜎𝑝lj =
𝑛
of 𝑝ҧ
Political Science, Inc. (PSI) specializes in voter polls and surveys designed to keep
political office seekers informed of their position in a race. Using telephone surveys,
PSI interviewers ask registered voters who they would vote for if the election were
held that day.
Interval Estimate of a Population Proportion
In a current election campaign, PSI has just found that 220 registered voters, out of 500
contacted, favor a particular candidate. PSI wants to develop a 95% confidence interval
estimate for the proportion of the population of registered voters that favor the candidate.
Interval Estimate of a Population Proportion
p (1 - p )
p z / 2
n
where: n = 500, 𝑝ҧ = 220/500 = .44, z/2 = 1.96
.44(1 - .44)
.44 1.96 = .44 + .0435
500
p (1 - p )
E = z / 2
n
Solving for the necessary sample size, we get
( z / 2 ) 2 p (1 - p )
n=
E2
However, 𝑝ҧ will not be known until after we have selected the sample. We will use the planning value
p* for 𝑝ҧ .
Sample Size for an Interval Estimate of a Population Proportion
Necessary Sample Size
( z / 2 ) 2 p* (1 - p* )
n=
E2
Suppose that PSI would like a .99 probability that the sample proportion is within + .03
of the population proportion. How large a sample size is needed to meet the required
precision? (A previous sample of similar units yielded .44 for the sample proportion.)
Sample Size for an Interval Estimate of a Population Proportion
p * (1 - p * )
z /2 = .03
n
At 99% confidence, z.005 = 2.576. Recall that p* = .44.
( z /2 )2 p * (1 - p * ) (2.576)2 (.44)(.56)
n= = 1817
E2 (.03) 2