Inference For Matched Pairs and Two-Sample Data: Will Landau

Inference for
Matched Pairs and

Two-Sample Data
Will Landau
Matched Pairs
Two-Sample
Inference for Matched Pairs and Inference: Large
Samples
Two-Sample Data
Will Landau
Iowa State University
Apr 4, 2013
© Will Landau Iowa State University Apr 4, 2013 1 / 24

Inference for
Outline Matched Pairs and
Two-Sample Data
Will Landau
Matched Pairs
Two-Sample
Inference: Large
Samples
Matched Pairs
Two-Sample Inference: Large Samples

Inference for
Matched pairs Matched Pairs and
Two-Sample Data
Will Landau
Matched Pairs
Two-Sample
I A matched pairs dataset is for which measurements Inference: Large
Samples
naturally group into pairs.
I Examples:
I Practice SAT scores before and after a prep course.
I Severity of a disease before and after a treatment.
I Leading edge measurement and trailing edge
measurement for each workpiece in a sample.
I Your height and the height of your friend, measured
once each year for several years.
I Bug bites on on right arm and bug bites on left arm
(one has repellent and the other doesn’t).

Inference for
Example: fuel economy Matched Pairs and
Two-Sample Data
Will Landau
I Twelve cars were equipped with radial tires and driven over a test
Matched Pairs
course.
Two-Sample
I Then the same 12 cars (with the same drivers) were equipped with Inference: Large
regular belted tires and driven over the same course. Samples
I After each run, the cars gas economy (in km/l) was measured.
1 2 3 4 5 6
Radial 4.2 4.7 6.6 7.0 6.7 4.5
Belted 4.1 4.9 6.2 6.9 6.8 4.4
7 8 9 10 11 12
Radial 5.7 6.0 7.4 4.9 6.1 5.2
Belted 5.7 5.8 6.9 4.7 6.0 4.9
I Using significance level α = 0.05 and the method of critical values, test
for a difference in fuel economy between the radial tires and belted tires.
I Construct a 95% confidence interval for true mean difference due to tire
type.

Inference for
Two-Sample Data
Will Landau
Matched Pairs
I First, calculate the differences (radial - belted): Two-Sample

Inference: Large
Samples
1 2 3 4 5 6
Radial 4.2 4.7 6.6 7.0 6.7 4.5
Belted 4.1 4.9 6.2 6.9 6.8 4.4
Difference 0.1 -0.2 0.4 0.1 -0.1 0.1
7 8 9 10 11 12
Radial 5.7 6.0 7.4 4.9 6.1 5.2
Belted 5.7 5.8 6.9 4.7 6.0 4.9
Difference 0 0.2 0.5 0.2 0.1 0.3
I d = 0.142, sd = 0.198

Inference for
Two-Sample Data
1. H0 : µd = 0, Ha : µd 6= 0 Will Landau
2. α = 0.05
Matched Pairs
3. I use the test statistic:
Two-Sample
Inference: Large
d −0 Samples
K = √
sd / n
which has a tn−1 = t11 distribution, assuming:

I H0 is true.
I d1 , . . . , d12 were independent draws from N(µd , σd2 )
I I will reject H0 if |K | > |t11,1−α/2 | = t11,0.975 = 2.20
4. The moment of truth:
0.142
K = √ = 2.48
0.198/ 12
5. With K = 2.48 > 2.20, I reject H0 .

6. There is enough evidence to conclude that the fuel economy differs
between radial tires and belted tires.

Inference for
Two-Sample Data
Will Landau
I The two-sided 95% confidence interval for the true Matched Pairs
mean fuel economy difference is: Two-Sample
Inference: Large
sd sd Samples
= (d − t11,1−α/2 √ , d − t11,1−α/2 √ )
n n
0.198 0.198
= (0.142 − t11,0.975 √ , 0.142 + t11,0.975 √ )
12 12
= (0.142 − 2.20 · 0.057, 0.142 + 2.20 · 0.057)
= (0.0166, 0.2674)
I We’re 95% confident that for the car type studied,

radial tires get between 0.0166 km/l and 0.2674 km/l
more in fuel economy than belted tires.

Inference for
Your Turn: wood product Matched Pairs and
Two-Sample Data
I Consider the operation of an end-cut router in the manufacture of a
Will Landau
company’s wood product.
I Both a leading-edge and a trailing-edge measurement were made on
Matched Pairs
each wooden piece to come off the router.
Two-Sample
Inference: Large
Samples
I Is the leading edge measurement different from the trailing edge

measurement for a typical wood piece? Do a hypothesis test at
α = 0.05 to find out.
I Make a two-sided 95% confidence interval for the true mean of the
difference between the measurements.
Inference for
Answers: wood product Matched Pairs and
Two-Sample Data
Will Landau
I Take paired differences (leading edge - trailing edge). Matched Pairs
Two-Sample
Inference: Large
Samples
I The sample mean is d = −8 × 10−4 , and the sample

standard deviation is sd = 0.0023.
I Let µd be the true mean of the differences.

Inference for
Two-Sample Data
1. H0 : µd = 0, Ha : µd 6= 0. Will Landau
2. α = 0.05, n = 5.
3. Since σd is unknown, I use the test statistic: Matched Pairs
Two-Sample
d −0 Inference: Large
K = √ Samples
sd / n
I Assume d1 , . . . , d5 ∼ N(µd , σd2 )

I K ∼ tn−1 = t4 .
I Reject H0 if |K | > |t4, 1−α/2 |
4. The moment of truth:
−8 × 10−4 − 0
K = √ = −0.78
0.0023/ 5
t4,1−α/2 = t4,1−0.05/2 = t4,0.975 = 2.78
5. Since |K | = 0.78 6> 2.78 = t4,0.975 , I fail to reject H0 .

6. There is not enough evidence to conclude that the leading edge
measurements differ significantly from the trailing edge measurements.

Inference for
Two-Sample Data
Will Landau
Matched Pairs
I I can make a two-sided 95% confidence interval for µd in the usual way: Two-Sample
Inference: Large
Samples
s s
d − t4, · √ , d + t4, 1−α/2 · √
1−α/2
n n

0.0023 0.0023
= −8 × 10−4 − t4,0.975 · √ , −8 × 10−4 + t4,0.975 · √
5 5
= −8 × 10−4 − 2.78 · 0.0010, −8 × 10−4 + 2.78 · 0.0010

= (−0.00358, 0.00198)
I We are 95% confident that the true mean difference between leading
edge and trailing edge measurements is between -0.00358 in and
0.001298 in.

Inference for
Outline Matched Pairs and
Two-Sample Data
Will Landau
Matched Pairs
Two-Sample
Inference: Large
Samples
Matched Pairs
Two-Sample Inference: Large Samples

Inference for
Two-sample inference Matched Pairs and
Two-Sample Data
I Comparing the means of two distinct populations Will Landau
without pairing up individual measurements. Matched Pairs

I Examples: Two-Sample
Inference: Large
I SAT scores of high school A vs. high school B. Samples
I Severity of a disease in women vs. in men.
I Heights of New Zealanders vs. heights of Ethiopians.
I Coefficients of friction after wear of sandpaper A vs.
sandpaper B.
I Notation:
Sample 1 2
Sample size n1 n2
True mean µ1 µ2
Sample mean x1 x2
True variance σ12 σ22
Sample variance s12 s22
Inference for
n1 ≥ 25 and n2 ≥ 25, variances known Matched Pairs and
Two-Sample Data
I We want to test H0 : µ1 − µ2 = # with some alternative hypothesis
I If σ12 and σ22 are known, use the test statistic: Will Landau
(x 1 − x 2 ) − # Matched Pairs
K = r
σ12 σ22 Two-Sample
n1
+ n2 Inference: Large
Samples
which has a N(0, 1) distribution if:
I H0 is true.
I The sample 1 points are iid with mean µ1 and variance
σ12 , and the sample 2 points are iid with mean µ2 and
variance σ22 .
I The confidence intervals (2-sided, 1-sided upper, and 1-sided lower,
respectively) for µ1 − µ2 are:
 s s 
σ12 σ22 σ12 σ22
(x1 − x 2 ) − z1−α/2 + , (x1 − x 2 ) + z1−α/2 + 
n1 n2 n1 n2
 s 
σ12 σ22
−∞, (x1 − x 2 ) + z1−α + 
n1 n2
 s 
σ12 σ22
(x1 − x 2 ) − z1−α + , ∞
n1 n2
Inference for
n1 ≥ 25 and n2 ≥ 25, variances UNknown Matched Pairs and
Two-Sample Data
Will Landau
I If σ12 and σ22 are UNknown, use the test statistic:
Matched Pairs
(x 1 − x 2 ) − # Two-Sample
K = r Inference: Large
s12 s22 Samples
n1
+ n2
I And confidence intervals for µ1 − µ2 :

 s s 
s12 s2 s12 s22
(x1 − x 2 ) − z1−α/2 + 2 , (x1 − x 2 ) + z1−α/2 + 
n1 n2 n1 n2
 s 
s12 s22
−∞, (x1 − x 2 ) + z1−α + 
n1 n2
 s 
s12 s22
(x1 − x 2 ) − z1−α + , ∞
n1 n2

Inference for
Example: packing weights Matched Pairs and
Two-Sample Data
Will Landau
I A company research effort involved finding a workable
Matched Pairs
geometry for molded pieces of a solid.
Two-Sample
I One comparison made was between the weight (in Inference: Large
Samples
grams) of molded pieces of a particular geometry that
could be poured into a standard container, and the
weight of irregularly shaped pieces (obtained through
crushing), that could be poured into the same container.
I n1 = 24 crushed pieces and n2 = 24 molded pieces were
made and weighed.
I µ1 is the true mean packing weight of the crushed
pieces, and µ2 is the true mean packing weight of the
molded pieces.
I I want to formally test the claim that the crushed
weights are greater than the molded weights.

Inference for
Two-Sample Data
Will Landau
1. H0 : µ1 − µ2 = 0, Ha : µ1 − µ2 > 0. Matched Pairs
Two-Sample
2. α = 0.05 Inference: Large
Samples
3. The test statistic is:
(x 1 − x 2 ) − 0
K= q 2
s1 s22
n1 + n2
I n1 and n2 are each < 25, but each sample is normally

distributed enough to flex that rule and allow
n1 = n2 = 24.
I Assume the crushed weights are iid (µ1 , σ12 ).
I Assume the molded weights are iid (µ2 , σ22 ).
I K ∼ N(0, 1) under the null hypothesis.

Inference for
Two-Sample Data
Will Landau
4. The moment of truth: Matched Pairs
Two-Sample
(x 1 − x 2 ) − 0 179.55 − 132.97 − 0 Inference: Large
K= = q = 18.3 Samples
s12 s22 (8.34)2 (9.31)2
n1 + n2 + 24
24
p-value = P(Z > K ) = 1 − Φ(K ) = 1 − Φ(18.3)
= 4 × 10−75
5. With a p-value of 4 × 10−75 < α, we reject H0 in favor

of Ha .
6. There is overwhelming evidence that more crushed solid
material by weight can be poured into the container
than molded solid material.

Inference for
Two-Sample Data
Will Landau
I The analogous lower 95% confidence interval for
µ1 − µ2 is: Matched Pairs
Two-Sample
 s  Inference: Large
2 2 Samples
(x1 − x 2 ) − z1−α s1 + s2 , ∞
n1 n2
r !
(8.34)2 (9.31)2
= (179.55 − 132.97) − z0.95 + , ∞
24 24
= (46.58 − 1.64 · 2.55, ∞)
= (42.40, ∞)
I We’re 95% confident that the true mean packing weight

of crushed solids is at least 42.40 g greater than that of
the molded solids.
Inference for
Two-Sample Data
Will Landau
Matched Pairs
Two-Sample
Inference: Large
Samples

Inference for
Your turn: anchor bolts Matched Pairs and
Two-Sample Data
I An experiment carried out to study various characteristics of anchor Will Landau
bolts resulted in 78 observations on shear strength (kip) of 3/8-in.
diameter bolts and 88 observations on strength of 1/2-in. diameter Matched Pairs
bolts. Two-Sample
Inference: Large
Samples
I Let Sample 1 be the 1/2 in diameter bolts and Sample 2 be the 3/8 in
diameter bolts.
I Using a significance level of α = 0.01, find out if the 1/2 in bolts are
more than 2 kip stronger (in shear strength) than the 3/8 in bolts.
I Calculate and interpret the appropriate 99% confidence interval to
support the analysis.

Inference for
Answers: anchor bolts Matched Pairs and
Two-Sample Data
I n1 = 88, n2 = 78. Will Landau
I x 1 = 7.14, x 2 = 4.25 Matched Pairs

I s1 = 1.68, s2 = 1.3 Two-Sample
Inference: Large
Samples
1. H0 : µ1 − µ2 = 2, Ha : µ1 − µ2 > 2
2. α = 0.01
3. The test statistic is:
(x 1 − x 2 ) − 2
K= q 2
s1 s22
n1 + n2
I Assume:
I H0 is true.
I Sample 1 points are drawn from iid (µ1 , σ12 )
distributions.
I Sample 2 points are drawn from iid (µ2 , σ22 )
distributions.
I Then, K ∼ N(0, 1)
Inference for
Two-Sample Data
Will Landau
4. The moment of truth: Matched Pairs
Two-Sample
(x 1 − x 2 ) − 2) (7.14 − 4.25) − 2 Inference: Large
K= = q = 3.84 Samples
s12 s22 (1.68)2 (1.3)2
n1 + n2 + 78
88
p-value = P(Z > K ) = 1 − P(Z ≤ K ) = 1 − P(Z ≤ 3.84)
= 1 − Φ(3.84) ≈ 0
5. With a p-value ≈ 0 < α = 0.01, we reject H0 in favor of

Ha .
6. There is overwhelming evidence that the 1/2 in anchor
bolts are more than 2 kip stronger in shear strength than
the 3/8 in bolts.

Inference for
Two-Sample Data
Will Landau
I I use a lower confidence interval for µ1 − µ2 :
Matched Pairs
 s  Two-Sample
Inference: Large
(x1 − x 2 ) − z1−α s12 s22 Samples
+ , ∞
n1 n2
r !
1.682 1.32
= (7.14 − 4.25) − z0.99 · + , ∞
88 78
= (2.89 − 2.33 · 0.232, ∞)
= (2.35, ∞)
I We’re 99% confident that the true mean shear strength

of the 1/2 in anchor bolts is at least 2.35 kip more than
the true mean shear strength of the 3/8 in anchor bolts.

Inference For Matched Pairs and Two-Sample Data: Will Landau

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Inference For Matched Pairs and Two-Sample Data: Will Landau

Uploaded by

Copyright:

Available Formats

Inference for

Matched Pairs and

Iowa State University

© Will Landau Iowa State University Apr 4, 2013 1 / 24

Two-Sample Inference: Large Samples

© Will Landau Iowa State University Apr 4, 2013 2 / 24

© Will Landau Iowa State University Apr 4, 2013 3 / 24

© Will Landau Iowa State University Apr 4, 2013 4 / 24

I First, calculate the differences (radial - belted): Two-Sample

© Will Landau Iowa State University Apr 4, 2013 5 / 24

which has a tn−1 = t11 distribution, assuming:

5. With K = 2.48 > 2.20, I reject H0 .

© Will Landau Iowa State University Apr 4, 2013 6 / 24

I We’re 95% confident that for the car type studied,

© Will Landau Iowa State University Apr 4, 2013 7 / 24

I Is the leading edge measurement different from the trailing edge

I Take paired differences (leading edge - trailing edge). Matched Pairs

I The sample mean is d = −8 × 10−4 , and the sample

© Will Landau Iowa State University Apr 4, 2013 9 / 24

I Assume d1 , . . . , d5 ∼ N(µd , σd2 )

5. Since |K | = 0.78 6> 2.78 = t4,0.975 , I fail to reject H0 .

© Will Landau Iowa State University Apr 4, 2013 10 / 24

© Will Landau Iowa State University Apr 4, 2013 11 / 24

Two-Sample Inference: Large Samples

© Will Landau Iowa State University Apr 4, 2013 12 / 24

I Comparing the means of two distinct populations Will Landau

without pairing up individual measurements. Matched Pairs

I And confidence intervals for µ1 − µ2 :

© Will Landau Iowa State University Apr 4, 2013 15 / 24

© Will Landau Iowa State University Apr 4, 2013 16 / 24

1. H0 : µ1 − µ2 = 0, Ha : µ1 − µ2 > 0. Matched Pairs

I n1 and n2 are each < 25, but each sample is normally

© Will Landau Iowa State University Apr 4, 2013 17 / 24

4. The moment of truth: Matched Pairs

5. With a p-value of 4 × 10−75 < α, we reject H0 in favor

© Will Landau Iowa State University Apr 4, 2013 18 / 24

I We’re 95% confident that the true mean packing weight

© Will Landau Iowa State University Apr 4, 2013 20 / 24

© Will Landau Iowa State University Apr 4, 2013 21 / 24

I x 1 = 7.14, x 2 = 4.25 Matched Pairs

4. The moment of truth: Matched Pairs

5. With a p-value ≈ 0 < α = 0.01, we reject H0 in favor of

© Will Landau Iowa State University Apr 4, 2013 23 / 24

I We’re 99% confident that the true mean shear strength

© Will Landau Iowa State University Apr 4, 2013 24 / 24

You might also like