Professional Documents
Culture Documents
Work Book
STA 408
2017 Sep
TABLE OF CONTENTS
CHAPTER 4 - Z-TEST
CHAPTER 5 - ANOVA
CHAPTER 6 - REGRESSION
1
CHAPTER 1 - INTRODUCTION TO USING MINITAB
(Reference: https://en.wikipedia.org/wiki/Minitab)
When you open Minitab, you will typically see something that looks like this:
The top window — called the "Session" The bottom window — called the "Worksheet"
window — is where Minitab will output the window — is where you will key-in data or
results of your requested statistical analyses. copy-and-paste data.
The third type of window — called a "Graphics" window — only appears when you've asked
Minitab to plot something. Here is one example of a graphics window:
(Reference: https://onlinecourses.science.psu.edu/statprogram/minitab)
2
CHAPTER 2 - CONFIDENCE INTERVAL
The scores for a music examination were obtained from a random sample of seven students.
The scores are listed below.
Score: 32, 65, 84, 65, 60, 65, 60
Using Minitab:
Minitab output:
One-Sample T: Score
(i) Show that the standard error of mean (SE Mean) is 5.8099.
(ii) Show that the 90% confidence interval for the average score is (50.28, 72.86).
(iii) Based on the 90% confidence interval for the average score, can you conclude that a
student will get a score of 70?
Solution:
3
2.2 Confidence Interval for Two Population Means
Question 1
The scores for an English oral test were obtained from a random sample of male and female
students. The scores are listed below. Assume equal variances for both populations.
Male Female
65, 54, 89, 60, 56, 60 80, 62, 95, 78, 82, 80, 85
Using Minitab:
Minitab output:
(i) Show that the pooled standard deviation (Pooled StDev) is 11.3.
(ii) Assume equal variances for both populations. Show that the 95% confidence interval for
the difference between mean score of male and female is (-30.1, -2.5).
(iii) Based on the 95% confidence interval for the difference between mean score of male
and female, can you conclude that the mean score for male and female students are
different?
Solution:
4
Question 2
For testing the effect of a new medication on pulse rate, 60 subjects are randomly divided
into two groups of 30. Group 1 is given the new medicine and Group 2 is given the placebo.
Minitab output:
(i) Write down the 95% confidence interval for the difference between means of pulse rate
for new medicine and placebo.
(ii) Based on the 95% confidence interval, determine whether there is a difference in the
means of pulse rate between new medicine and placebo.
Solution:
5
2.3 Confidence Interval for Paired Samples
Question 1
A random sample of 6 persons was selected to follow a special exercise program for 10
weeks. The following table gives the weights (in kg) of these 6 persons before and after the
program.
Adult 1 2 3 4 5 6
Before 76 72 74 68 60 85
After 70 75 70 65 62 80
Using Minitab:
Minitab output:
(i) Show that the 99% confidence interval for the mean difference of the paired samples is
(-4.029, 8.362).
(ii) Based on the 99% confidence interval for the mean difference of the paired samples,
can you conclude that there is a difference between the mean weight before and after
following the special exercise program?
Solution:
6
Question 2
Ten pairs of measurements of zinc concentration (in ppb) from 10 reservoirs were taken. For
each reservoir, the amount of zinc concentration in the water at the bottom and at the
surface of the reservoir was measured. The data collected was analysed and the output is
shown below.
Minitab output:
(i) Show that the 95% confidence interval for the mean difference of the paired samples is
(0.043, 0.118).
(ii) Based on the 95% confidence interval for the mean difference of the paired samples,
can you conclude that the average zinc concentration in the water at the bottom and at
the surface of the reservoir are equal?
Solution:
7
2.4 Confidence Interval for One Population Variance
Question 1
The number of carbohydrates found in a random sample of fast-food entrees is listed below.
Carbohydrates in fast food: 53, 46, 47, 38, 73, 39, 39, 43, 41, 30
Minitab output:
Statistics
Tests
Test
Variable Method Statistic DF P-Value
Carbohydrates in fast food Chi-square 12.11 9 0.406
Solution:
8
2.5 Confidence Interval for Two Population Variances
Question 1
A researcher was interested in exploring the driving behaviour of male and female university
students based on driving speed (in mph). She conducted a survey of a random sample of
34 male and 29 female university students. Table below is a descriptive summary of the
results of her survey.
Minitab output:
Statistics
Tests
Test
Method DF1 DF2 Statistic P-Value
F 28 33 0.37 0.009
(i) Construct a 90% confidence interval for the ratio of the two variances.
(ii) Based on the 90% confidence interval, can we conclude that the variances are
different?
Solution:
9
CHAPTER 3 - HYPOTHESIS TEST
An educator claims that the average salary of substitute teachers in school districts in
Allegheny County, Pennsylvania, is less than $60 per day. A random sample of 8 school
districts is selected, and the daily salaries (in dollars) are listed below.
At 10% level of significance, is there enough evidence to support the educator’s claim that
average salary is less than $60 per day? Use p-value.
Using Minitab:
Minitab output:
One-Sample T: Salary
Test of mu = 60 vs < 60
90%
Upper
Variable N Mean StDev SE Mean Bound T P
Salary 8 58.8750 5.0832 1.7972 61.4179 -0.63 0.276
Solution:
10
Question 2
pH: 6.2, 7.3, 6.4, 5.8, 6.5, 7.0, 6.2, 6.7, 6.7
At 0.05 , is there enough evidence to support a researcher’s claim that average pH value
is different from 6 ? Use p-value.
Using Minitab:
Minitab output:
One-Sample T: pH
Test of mu = 6 vs not = 6
Solution:
11
3.2 Hypothesis Test for Two Population Means
Question 1
A researcher hypothesizes that people who are allowed to sleep for only 4 hours will score
differently on a cognitive skills test compare to people who are allowed to sleep for 8 hours.
He brings 16 participants into his sleep lab and randomly assigns them to one of two groups.
In one group he has participants sleep for 8 hours and in the other group he has them
sleeping for 4 hours. The next morning he administers the SCAT (Sam’s Cognitive Ability
Test) to all participants. Scores on the SCAT range from 1-9 with higher scores representing
better performance. Assume that the population variances are equal.
Test if there is a difference between the two means scores at 5% level of significance. Use
p-value.
Using Minitab:
Minitab output:
Solution:
12
Question 2
The statistics examination scores obtained from a random sample of students are listed
below. Assume that the population variances are equal.
Using 5% level of significance, test whether male students outperform female students. Use
p-value.
Using Minitab:
Minitab output:
Solution:
13
3.3 Hypothesis Test for Paired Samples
Question 1
A psychologist wishes to verify that a certain drug increases the reaction time to a given
stimulus. The following reaction times (in tenths of a second) were recorded before and after
injection of the drug for each of 4 subjects:
Based on the above computer output, test at 5% level of significance to determine whether
the drug significantly increases reaction time. Use p-value.
Using Minitab:
Minitab output:
Solution:
14
Question 2
As an aid for improving students study habits, 9 students were randomly selected to attend a
seminar on the importance of education in life. The table below shows the number of hours
each student study per week before and after the seminar. At 10% level of significance, did
attending the seminar increase the number of hours the students study per week?
Before After
6 9
9 9
15 20
12 17
7 6
10 15
18 21
13 22
3 2
Using Minitab:
Minitab output:
Solution:
15
3.4 Hypothesis Test for One Population Variance
Question 1
The number of carbohydrates found in a random sample of fast-food entrees is listed below.
Carbohydrates in fast food: 53, 46, 47, 38, 73, 39, 39, 43, 41, 30
Minitab output:
Statistics
Tests
Test
Variable Method Statistic DF P-Value
Carbohydrates in fast food Chi-square 12.11 9 0.406
(i) At 5% level of significance, is there sufficient evidence to conclude that the variance
differs from 100? Use p-value.
(ii) Prove that the test statistic is 12.11.
Solution:
16
3.5 Hypothesis Test for Two Population Variances
Question 1
A researcher was interested in exploring the driving behaviours of male and female
university students based on driving speed (in mph). She conducted a survey of a random
sample of 34 male and 29 female university students. Table below is a descriptive summary
of the results of her survey.
Minitab output:
Statistics
Tests
Test
Method DF1 DF2 Statistic P-Value
F 28 33 0.37 0.009
(i) At the 10% level of significance, is there sufficient evidence to conclude that the
variance of the speed driven by male university students differs from the variance of the
speed driven by female university students? Use p-value.
(ii) Prove that the test statistic is 0.37.
(iii) What is the degree of freedom for male university students for the F test?
Solution:
17
CHAPTER 4 - Z-TEST
Question 1
The amount of coffee (in ounces) filled by a machine in 6 randomly selected jars are listed
below.
Amount: 15.9, 15.7, 16.3, 15.9, 15.7, 16.2
Assume a normal distribution for the amount of coffee in a jar with population standard
deviation of 0.229 ounce.
Using Minitab:
Minitab output:
One-Sample Z: Amount
Test of mu = 16 vs not = 16
The assumed standard deviation = 0.229
Solution:
(i) Traditional method of z-test for a mean
Population standard deviation is known, = 0.229.
Sample mean x = 15.95, sample size n = 6.
Step 1: Ho: µ = 16
H1: µ ≠ 16
18
(ii) p-value method
Step 1: Ho: µ = 16
H1: µ ≠ 16
Solution:
Population standard deviation is known, = 0.229.
Sample mean x = 15.95, sample size n = 6.
19
CHAPTER 5 - ANOVA
A lot of different factors contribute to air pollution. One particular factor, particulate matter,
was measured for prominent cities of 3 continents. Particulate matter includes smoke, soot,
dust and liquid droplets from combustion such that the particle is less than 10 microns in
diameter and thus capable of reaching deep into the respiratory system. The measurements
are listed below.
Using Minitab:
Minitab output:
Source DF SS MS F P
Factor 2 4230 2115 6.65 0.020
Error 8 2544 318
Total 10 6774
Solution:
20
5.2 Two-way ANOVA
Question 1
A company recently purchased a new computer software package. The employees were
given training on using the new software. There are 4 different training methods with 3
training instructors. After completing the training, each employee was given identical
computer task. The time (in minutes) to complete the tasks were recorded, resulting in the
following data. The main objective of this experiment is to investigate whether different
training methods have an effect on the time to complete the tasks.
Method
A B C D
I 20 15 30 24
Instructor II 17 19 21 16
III 22 25 26 21
Using Minitab:
Minitab output:
Source DF SS MS F P
Instructor 2 60.167 30.0833 2.36 0.175
Method 3 76.000 25.3333 1.99 0.217
Error 6 76.500 12.7500
Total 11 212.667
Solution:
21
Question 2
An official of a small transit system with only five buses wants to study the tread wear of 4
types of tyres. Each of the buses runs a different route so that terrain and driving conditions
differ from bus to bus. One tyre of each type is placed on each bus, with the wheel positions
being assigned randomly. The tyres are run for 15,000 kilometers, after which the tread wear
(in millimeters) is measured. The computer output is shown below.
Minitab output:
Source DF SS MS F P
Tyre (A) (C) 133.779 (E) 0.000
Bus 4 81.525 20.381 9.35 0.001
Error (B) 26.167 (D)
Total 19 509.030
(iv) By using the p-value, test at 5% level of significance whether there is sufficient evidence
to indicate that a difference exists in the tread wear in different types of tyres.
(v) Is there sufficient evidence to indicate that tread wear of tyres varies from bus to bus?
Test at 5% level of significance. Use p-value.
Solution:
22
5.3 Two-way ANOVA with Interaction
Question 1
Suppose we want to compare type of popcorn and brand of popcorn with respect to their
yield (in terms of cups per popcorn). Factor A is the type of popcorn: oil-based versus air-
based. Factor B is the brand of popcorn: gourmet versus national versus generic. For each
combination of popper type and brand, three separate measurements are taken as shown in
the table below.
Brand of popcorn
Gourmet National Generic
6 5 3
Oil 5.5 4.5 4
Type
7 4 3.5
of
7 5 5
popcorn
Air 6 5 4.5
6.5 5.5 4
Using Minitab:
Minitab output:
Source DF SS MS F P
Type of popcorn 1 2.0000 2.00000 (e) 0.020
Brand (a) 16.7778 8.38889 30.20 0.000
Interaction 2 (c) 0.16667 0.60 0.564
Error 12 3.3333 (d)
Total (b) 22.4444
(v) Find the missing values (a, b, c, d, e) in the ANOVA table above.
23
(vi) Do the data provide sufficient evidence to indicate an interaction between the two
factors at 5% significance level? Use p-value.
(vii) At 5% level of significance, do the data provide sufficient evidence to indicate that there
is a significant effect on mean yield due to type of popcorn? Use p-value.
(viii) At 5% level of significance, do the data provide sufficient evidence to indicate that
there is a significant effect on mean yield due to brand of popcorn? Use p-value.
24
CHAPTER 6 - REGRESSION
Question 1
The body mass index, BMI (ratio of weight in kilograms to height in meter 2) and HDL
cholesterol (in miligrams per deciliter, mg/dL) of 10 randomly selected samples are given in
the following table.
BMI
20.3 21.6 21.9 24 25 27.2 30 32 28.8 31.8
(kg/m2)
HDL
64 57 60 52 50 47 45 40 55 43
(mg/dL)
Using Minitab:
Minitab output:
Analysis of Variance
Source DF SS MS F P
Regression 1 431.54 431.54 31.80 0.000
Residual Error 8 108.56 13.57
Total 9 540.10
(ii) Write down the estimated least squares regression line. Interpret the meaning of the
regression coefficients, a and b.
25
(iii) Estimate Ahmad’s HDL cholesterol if his BMI is 26.
(vi) At 5% level of significance, test whether the linear regression model is significant. Use
p-value.
26
27