Inference About 2 Population Means

Inference About Means and
Proportions with
Two Populations
 Inferences About the Difference Between
Two Population Means: s 1 and s 2 Known
Two Population Means: s 1 and s 2 Unknown
Two Population Means: Matched Samples
Inferences About the Difference Between
Two Population Means: s 1 and s 2 Known
 Interval Estimation of m 1 – m 2
 Hypothesis Tests About m 1 – m 2
Estimating the Difference Between
Two Population Means
 Let m1 equal the mean of population 1 and m2 equal
the mean of population 2.
 The difference between the two population means is
m1 - m2.
 To estimate m1 - m2, we will select a simple random
sample of size n1 from population 1 and a simple
random sample of size n2 from population 2.
 Let x1 equal the mean of sample 1 and x2 equal the
mean of sample 2.
 The point estimator of the difference between the
means of the populations 1 and 2 is x1  x2.
Sampling Distribution of x1  x2
 Expected Value
E ( x1  x2 )  m1  m 2
 Standard Deviation (Standard Error)
s12 s 22
s x1  x2  
n1 n2
where: s1 = standard deviation of population 1

s2 = standard deviation of population 2
n1 = sample size from population 1
n2 = sample size from population 2
Interval Estimation of m1 - m2:
s 1 and s 2 Known
 Interval Estimate
s 12 s 22
x1  x2  z / 2 
n1 n2
where:
1 -  is the confidence coefficient
s 1 and s 2 Known
 Example: Par, Inc.
Par, Inc. is a manufacturer of golf equipment and
has developed a new golf ball that has been
designed to provide “extra distance.”
In a test of driving distance using a mechanical
driving device, a sample of Par golf balls was
compared with a sample of golf balls made by Rap,
Ltd., a competitor. The sample statistics appear on
the next slide.
s 1 and s 2 Known
Sample #1 Sample #2
Par, Inc. Rap, Ltd.
Sample Size 120 balls 80 balls
Sample Mean 295 yards 278 yards
Based on data from previous driving distance

tests, the two population standard deviations are
known with s 1 = 15 yards and s 2 = 20 yards.
s 1 and s 2 Known
Let us develop a 95% confidence interval estimate
of the difference between the mean driving distances of
the two brands of golf ball.
Estimating the Difference Between
Two Population Means
Population 1 Population 2
Par, Inc. Golf Balls Rap, Ltd. Golf Balls
m1 = mean driving m2 = mean driving
distance of Par distance of Rap
golf balls golf balls
m1 – m2 = difference between
the mean distances
Simple random sample Simple random sample
of n1 Par golf balls of n2 Rap golf balls
x1 = sample mean distance x2 = sample mean distance
for the Par golf balls for the Rap golf balls
x1 - x2 = Point Estimate of m1 – m2
Point Estimate of m1 - m2
Point estimate of m1  m2 = x1  x2
= 295  278
= 17 yards
where:
m1 = mean distance for the population
of Par, Inc. golf balls
of Rap, Ltd. golf balls
s 1 and s 2 Known
s12 s 22 (15) 2 ( 20) 2

x1  x2  z / 2   17  1. 96 
n1 n2 120 80
17 + 5.14 or 11.86 yards to 22.14 yards
We are 95% confident that the difference between
the mean driving distances of Par, Inc. balls and Rap,
Ltd. balls is 11.86 to 22.14 yards.
Hypothesis Tests About m 1  m 2:
s 1 and s 2 Known
 Hypotheses
H 0 : m1  m2  D0 H 0 : m1  m2  D0 H 0 : m1  m2  D0
H a : m1  m 2  D0 H a : m1  m2  D0 H a : m1  m2  D0
Left-tailed Right-tailed Two-tailed
 Test Statistic
( x1  x2 )  D0
z
s 12 s 22

n1 n2
s 1 and s 2 Known
Can we conclude, using  = .01, that the
mean driving distance of Par, Inc. golf balls is
greater than the mean driving distance of Rap, Ltd.
golf balls?
s 1 and s 2 Known
 p –Value and Critical Value Approaches
1. Develop the hypotheses. H0: m1 - m2 < 0 

Ha: m1 - m2 > 0
where:
of Par, Inc. golf balls
of Rap, Ltd. golf balls
2. Specify the level of significance.  = .01

s 1 and s 2 Known
3. Compute the value of the test statistic.
( x1  x2 )  D0
z
s 12 s 22

n1 n2
(295  278)  0 17
z   6.49
(15) 2 (20 ) 2 2.62

120 80
s 1 and s 2 Known
 p –Value Approach
4. Compute the p–value.

For z = 6.49, the p –value < .0001.
5. Determine whether to reject H0.

Because p–value <  = .01, we reject H0.
At the .01 level of significance, the sample evidence
indicates the mean driving distance of Par, Inc. golf
balls is greater than the mean driving distance of Rap,
Ltd. golf balls.
s 1 and s 2 Known
 Critical Value Approach
4. Determine the critical value and rejection rule.
For  = .01, z.01 = 2.33

Reject H0 if z > 2.33

Because z = 6.49 > 2.33, we reject H0.
The sample evidence indicates the mean driving
distance of Par, Inc. golf balls is greater than the mean
driving distance of Rap, Ltd. golf balls.
Two Population Means: s 1 and s 2 Unknown
 Interval Estimation of m 1 – m 2
 Hypothesis Tests About m 1 – m 2
A Basic Question:
If standard deviations for two populations are
not known what will be the possibilities about
two standard deviations?
s1 = s2 or s1 ≠ s2
s 1 and s 2 Unknown and s 1 ≠ s 2
When s 1 and s 2 are unknown, we will:
• use the sample standard deviations s1 and s2
as estimates of s 1 and s 2 , and
• replace z/2 with t/2.
s 1 and s 2 Unknown and s 1 ≠ s 2
 Interval Estimate with t-test statistics
s12 s22
x1  x2  t / 2 
n1 n2
Where the degrees of freedom for t/2 are:
2
2 2
s s 
1 2
  
 n1 n2 
df  2 2
2 2
1  s1  1  s2 
    
n1  1  n1  n2  1  n2 
s 1 and s 2 Unknown and s 1 = s 2
Degrees of
freedom
t- statistics value
t- statistics value
But how to decide which case is applicable?
STEPS
 F-test for equal variances where the hypotheses will

be tested whether two population variances are equal
or not equal.
 On the basis of above we will decide which t-test
 Then apply the required t-test to conclude the
original hypotheses testing.
 Interpret your result.
Direct and Broker-Purchased Mutual Funds
Millions of investors buy mutual funds choosing from

thousands of possibilities. Some funds can be purchased
directly from banks or other financial institutions whereas
others must be purchased through brokers, who charge a fee
for this service. This raises the question, Can investors do
better by buying mutual funds directly than by purchasing
mutual funds through brokers? To help answer this question,
a group of researchers randomly sampled the annual returns
from mutual funds that can be acquired directly and mutual
funds that are bought through brokers and recorded the net
annual returns, which are the returns on investment after
deducting all relevant fees. These are listed next.
Can we conclude at the 5% significance level that directly
purchased mutual funds outperform mutual funds bought
through brokers?
Data
Spreadsheet
Difference Between Two Population Means:
s 1 and s 2 Unknown
 Example: Specific Motors
Specific Motors of Detroit has developed a new
Automobile known as the M car. 24 M cars and 28 J
cars (from Japan) were road tested to compare miles-
per-gallon (mpg) performance. The sample statistics
are shown on the next slide.
s 1 and s 2 Unknown
Sample #1 Sample #2
M Cars J Cars
24 cars 28 cars Sample Size
29.8 mpg 27.3 mpg Sample Mean
2.56 mpg 1.81 mpg Sample Std. Dev.
s 1 and s 2 Unknown
Let us develop a 90% confidence interval estimate
of the difference between the mpg performances of
the two models of automobile.
Point Estimate of m 1  m 2
Point estimate of m1  m2 = x1  x2
= 29.8 - 27.3
= 2.5 mpg
where:
m1 = mean miles-per-gallon for the
population of M cars
m2 = mean miles-per-gallon for the
population of J cars
Interval Estimation of m 1  m 2:
s 1 and s 2 Unknown
The degrees of freedom for t/2 are:
2 2 2
 (2.56) (1.81) 
  
 24 28 
df  2 2
 24.07  24
1  (2.56) 2  1  (1.81) 2 
    
24  1  24  28  1  28 
With /2 = .05 and df = 24, t/2 = 1.711

Interval Estimation of m 1  m 2:
s 1 and s 2 Unknown
s12 s22 (2.56)2 (1.81) 2

x1  x2  t / 2   29.8  27.3  1.711 
n1 n2 24 28
2.5 + 1.069 or 1.431 to 3.569 mpg
We are 90% confident that the difference between

the miles-per-gallon performances of M cars and J cars
is 1.431 to 3.569 mpg.
s 1 and s 2 Unknown
 Hypotheses
H 0 : m1  m2  D0 H 0 : m1  m2  D0 H 0 : m1  m2  D0
H a : m1  m2  D0 H a : m1  m2  D0 H a : m1  m2  D0
Left-tailed Right-tailed Two-tailed
 Test Statistic
( x1  x2 )  D0
t
s12 s22

n1 n2
s 1 and s 2 Unknown
Can we conclude, using a .05 level of significance,
that the miles-per-gallon (mpg) performance of M cars
is greater than the miles-per-gallon performance of J
cars?
s 1 and s 2 Unknown
1. Develop the hypotheses.
H0: m1 - m2 < 0 
Ha: m1 - m2 > 0
where:
m1 = mean mpg for the population of M cars
m2 = mean mpg for the population of J cars
s 1 and s 2 Unknown
( x1  x2 )  D0 (29.8  27.3)  0
t   4.003
s12 s22 (2.56) 2 (1.81) 2
 
n1 n2 24 28
s 1 and s 2 Unknown
4. Compute the p –value.
The degrees of freedom for t are:
2 2 2
 (2.56) (1.81) 
  
 24 28 
df  2 2
 40.566  41
1  (2.56) 2  1  (1.81)2 
    
24  1  24  28  1  28 
Because t = 4.003 > t.05 = 1.683, the p–value < .05.
In fact, the p–value < .005.
s 1 and s 2 Unknown
We are at least 95% confident that the miles-per-
gallon (mpg) performance of M cars is greater than
the miles-per-gallon performance of J cars?.
s 1 and s 2 Unknown
For  = .05 and df = 41, t.05 = 1.683
Reject H0 if t > 1.683

Because 4.003 > 1.683, we reject H0.
We are at least 95% confident that the miles-per-
gallon (mpg) performance of M cars is greater than
the miles-per-gallon performance of J cars?.
 With a matched-sample design each sampled item
provides a pair of data values.
 This design often leads to a smaller sampling error
than the independent-sample design because
variation between sampled items is eliminated as a
source of sampling error.
 Example: Express Deliveries
A Chicago-based firm has documents that must
be quickly distributed to district offices throughout
the U.S. The firm must decide between two delivery
services, UPX (United Parcel Express) and INTEX
(International Express), to transport its documents.
 Example: Express Deliveries
In testing the delivery times of the two services,
the firm sent two reports to a random sample of its
district offices with one report carried by UPX and
the other report carried by INTEX. Do the data on
the next slide indicate a difference in mean delivery
times for the two services? Use a .05 level of
significance.
Delivery Time (Hours)

District Office UPX INTEX Difference
Seattle 32 25 7
Los Angeles 30 24 6
Boston 19 15 4
Cleveland 16 15 1
New York 15 13 2
Houston 18 15 3
Atlanta 14 15 -1
St. Louis 10 8 2
Milwaukee 7 9 -2
Denver 16 11 5
1. Develop the hypotheses.
H0: md = 0 
Ha: md 
Let md = the mean of the difference values for the
two delivery services for the population
of district offices
 di ( 7  6... 5)
d    2. 7
n 10
 ( di  d ) 2 76.1
sd    2. 9
n 1 9
d  md 2.7  0
t   2.94
sd n 2.9 10
4. Compute the p –value.
For t = 2.94 and df = 9, the p–value is between
.02 and .01. (This is a two-tailed test, so we double
the upper-tail areas of .01 and .005.)

We are at least 95% confident that there is a
difference in mean delivery times for the two
services?
For  = .05 and df = 9, t.025 = 2.262.

Reject H0 if t > 2.262

Because t = 2.94 > 2.262, we reject H0.
We are at least 95% confident that there is a
difference in mean delivery times for the two
services?
End of BS -I
THANK YOU
FOR
YOUR TOLERANCE

Inference About 2 Population Means

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Inference About 2 Population Means

Uploaded by

Copyright:

Available Formats

Inference About Means and

 Standard Deviation (Standard Error)

where: s1 = standard deviation of population 1

Based on data from previous driving distance

s12 s 22 (15) 2 ( 20) 2

1. Develop the hypotheses. H0: m1 - m2 < 0 

2. Specify the level of significance.  = .01

3. Compute the value of the test statistic.

4. Compute the p–value.

5. Determine whether to reject H0.

4. Determine the critical value and rejection rule.

For  = .01, z.01 = 2.33

5. Determine whether to reject H0.

 F-test for equal variances where the hypotheses will

Millions of investors buy mutual funds choosing from

With /2 = .05 and df = 24, t/2 = 1.711

s12 s22 (2.56)2 (1.81) 2

We are 90% confident that the difference between

3. Compute the value of the test statistic.

5. Determine whether to reject H0.

Delivery Time (Hours)

3. Compute the value of the test statistic.

5. Determine whether to reject H0.

For  = .05 and df = 9, t.025 = 2.262.

5. Determine whether to reject H0.

You might also like