You are on page 1of 28

Minitab

Work Book
STA 408

2017 Sep
TABLE OF CONTENTS

CHAPTER 1 - INTRODUCTION TO USING MINITAB

CHAPTER 2 - CONFIDENCE INTERVAL

2.1 Confidence Interval for One Population Mean


2.2 Confidence Interval for Two Population Means
2.3 Confidence Interval for Paired Samples
2.4 Confidence Interval for One Population Variance
2.5 Confidence Interval for Two Population Variances

CHAPTER 3 - HYPOTHESIS TEST

3.1 Hypothesis Test for One Population Mean


3.2 Hypothesis Test for Two Population Means
3.3 Hypothesis Test for Paired Samples
3.4 Hypothesis Test for One Population Variance
3.5 Hypothesis Test for Two Population Variances

CHAPTER 4 - Z-TEST

CHAPTER 5 - ANOVA

5.1 One-way ANOVA


5.2 Two-way ANOVA
5.3 Two-way ANOVA with Interaction

CHAPTER 6 - REGRESSION

1
CHAPTER 1 - INTRODUCTION TO USING MINITAB

Minitab is a statistics package developed at the Pennsylvania State University by


researchers Barbara F. Ryan, Thomas A. Ryan, Jr., and Brian L. Joiner in 1972.

(Reference: https://en.wikipedia.org/wiki/Minitab)

Three Types of Windows

When you open Minitab, you will typically see something that looks like this:

The top window — called the "Session" The bottom window — called the "Worksheet"
window — is where Minitab will output the window — is where you will key-in data or
results of your requested statistical analyses. copy-and-paste data.

The third type of window — called a "Graphics" window — only appears when you've asked
Minitab to plot something. Here is one example of a graphics window:

(Reference: https://onlinecourses.science.psu.edu/statprogram/minitab)

2
CHAPTER 2 - CONFIDENCE INTERVAL

2.1 Confidence Interval for One Population Mean


Question 1

The scores for a music examination were obtained from a random sample of seven students.
The scores are listed below.
Score: 32, 65, 84, 65, 60, 65, 60

Using Minitab:

Select Stat > Basic Statistics > 1-Sample t


- Choose Score as the variable
- Click [Options] , Confidence: 90, Alternative: not equal

Minitab output:

One-Sample T: Score

Variable N Mean StDev SE Mean 90% CI


Score 7 61.5714 15.3716 5.8099 (50.2817, 72.8611)

(i) Show that the standard error of mean (SE Mean) is 5.8099.
(ii) Show that the 90% confidence interval for the average score is (50.28, 72.86).
(iii) Based on the 90% confidence interval for the average score, can you conclude that a
student will get a score of 70?

Solution:

3
2.2 Confidence Interval for Two Population Means
Question 1

The scores for an English oral test were obtained from a random sample of male and female
students. The scores are listed below. Assume equal variances for both populations.

Male Female
65, 54, 89, 60, 56, 60 80, 62, 95, 78, 82, 80, 85

Using Minitab:

Select Stat > Basic Statistics > 2-Sample t


- Choose Male and Female as the variables
- Click [Assume equal variances]
- Click [Option], Confidence: 95, Test difference: 0, Alternative: not equal

Minitab output:

Two-Sample T-Test and CI: Male, Female

Two-sample T for Male vs Female

N Mean StDev SE Mean


Male 6 64.0 12.8 5.2
Female 7 80.29 9.84 3.7

Difference = mu (Male) - mu (Female)


Estimate for difference: -16.2857
95% CI for difference: (-30.1170, -2.4544)
T-Test of difference = 0 (vs not =): T-Value = -2.59 P-Value = 0.025
DF = 11
Both use Pooled StDev = 11.2953

(i) Show that the pooled standard deviation (Pooled StDev) is 11.3.
(ii) Assume equal variances for both populations. Show that the 95% confidence interval for
the difference between mean score of male and female is (-30.1, -2.5).
(iii) Based on the 95% confidence interval for the difference between mean score of male
and female, can you conclude that the mean score for male and female students are
different?

Solution:

4
Question 2

For testing the effect of a new medication on pulse rate, 60 subjects are randomly divided
into two groups of 30. Group 1 is given the new medicine and Group 2 is given the placebo.

Minitab output:

Two-Sample T-Test and CI:

N Mean StDev SE Mean


Group 1 30 65.20 7.80 1.4
Group 2 30 70.30 8.40 1.5

Difference = mu (Male) - mu (Female)


Estimate for difference: -5.10
95% CI for difference: (-9.29, -0.91)
T-Test of difference = 0 (vs not =): T-Value = -2.44 P-Value = 0.018
DF = 57

(i) Write down the 95% confidence interval for the difference between means of pulse rate
for new medicine and placebo.
(ii) Based on the 95% confidence interval, determine whether there is a difference in the
means of pulse rate between new medicine and placebo.

Solution:

5
2.3 Confidence Interval for Paired Samples
Question 1

A random sample of 6 persons was selected to follow a special exercise program for 10
weeks. The following table gives the weights (in kg) of these 6 persons before and after the
program.

Adult 1 2 3 4 5 6
Before 76 72 74 68 60 85
After 70 75 70 65 62 80

Using Minitab:

Select Stat > Basic Statistics > Paired t


- Choose Before and After as the variables
- Click [Option], Confidence: 99, Test mean: 0, Alternative: not equal

Minitab output:

Paired T-Test and CI: Before, After

Paired T for Before - After

N Mean StDev SE Mean


Before 6 72.5000 8.3367 3.4034
After 6 70.3333 6.5320 2.6667
Difference 6 2.16667 3.76386 1.53659

99% CI for mean difference: (-4.02909, 8.36242)


T-Test of mean difference = 0 (vs not = 0): T-Value = 1.41 P-Value = 0.218

(i) Show that the 99% confidence interval for the mean difference of the paired samples is
(-4.029, 8.362).
(ii) Based on the 99% confidence interval for the mean difference of the paired samples,
can you conclude that there is a difference between the mean weight before and after
following the special exercise program?

Solution:

6
Question 2

Ten pairs of measurements of zinc concentration (in ppb) from 10 reservoirs were taken. For
each reservoir, the amount of zinc concentration in the water at the bottom and at the
surface of the reservoir was measured. The data collected was analysed and the output is
shown below.

Minitab output:

Paired T-Test and CI: bottom, surface

Paired T for bottom - surface

N Mean StDev SE Mean


bottom 10 0.5649 0.146781 0.046416
surface 10 0.4845 0.131229 0.041498
Difference 10 0.0804 0.052273 0.016530

95% CI for mean difference: (0.043006, 0.117794)


T-Test of mean difference = 0 (vs not = 0): T-Value = 4.86 P-Value = 0.000

(i) Show that the 95% confidence interval for the mean difference of the paired samples is
(0.043, 0.118).
(ii) Based on the 95% confidence interval for the mean difference of the paired samples,
can you conclude that the average zinc concentration in the water at the bottom and at
the surface of the reservoir are equal?

Solution:

7
2.4 Confidence Interval for One Population Variance
Question 1

The number of carbohydrates found in a random sample of fast-food entrees is listed below.

Carbohydrates in fast food: 53, 46, 47, 38, 73, 39, 39, 43, 41, 30

Minitab output:

Test and CI for One Variance: Carbohydrates in fast food

Null hypothesis Sigma-squared = 100


Alternative hypothesis Sigma-squared not = 100

The chi-square method is only for the normal distribution.

Statistics

Variable N StDev Variance


Carbohydrates in fast food 10 11.6 135

Tests

Test
Variable Method Statistic DF P-Value
Carbohydrates in fast food Chi-square 12.11 9 0.406

(i) Construct a 95% confidence interval for the variance.


(ii) Based on the 95% confidence interval, can we conclude that the variance is different
from 100?

Solution:

8
2.5 Confidence Interval for Two Population Variances
Question 1

A researcher was interested in exploring the driving behaviour of male and female university
students based on driving speed (in mph). She conducted a survey of a random sample of
34 male and 29 female university students. Table below is a descriptive summary of the
results of her survey.

Female (F) Male (M)


n 29 34
mean 90.9 105.5
sample standard deviation 12.2 20.1

Minitab output:

Test for Two Variances

Null hypothesis Sigma (F) / Sigma (M) = 1


Alternative hypothesis Sigma (F) / Sigma (M) not = 1
Significance level Alpha = 0.10

Statistics

Sample N StDev Variance


F 29 12.200 148.840
M 34 20.100 404.010

Ratio of standard deviations = 0.607


Ratio of variances = 0.368

Tests
Test
Method DF1 DF2 Statistic P-Value
F 28 33 0.37 0.009

(i) Construct a 90% confidence interval for the ratio of the two variances.
(ii) Based on the 90% confidence interval, can we conclude that the variances are
different?

Solution:

9
CHAPTER 3 - HYPOTHESIS TEST

3.1 Hypothesis Test for One Population Mean


Question 1

An educator claims that the average salary of substitute teachers in school districts in
Allegheny County, Pennsylvania, is less than $60 per day. A random sample of 8 school
districts is selected, and the daily salaries (in dollars) are listed below.

Salary: 60, 56, 60, 55, 70, 55, 60, 55

At 10% level of significance, is there enough evidence to support the educator’s claim that
average salary is less than $60 per day? Use p-value.

Using Minitab:

Select Stat > Basic Statistics > 1-Sample t


- Choose Salary as the variable
- Test mean: 60
- Click [Option], Confidence: 90, Alternative: less than

Minitab output:

One-Sample T: Salary

Test of mu = 60 vs < 60
90%
Upper
Variable N Mean StDev SE Mean Bound T P
Salary 8 58.8750 5.0832 1.7972 61.4179 -0.63 0.276

Solution:

10
Question 2

The pH values of a random sample of 9 rivers are listed below.

pH: 6.2, 7.3, 6.4, 5.8, 6.5, 7.0, 6.2, 6.7, 6.7

At   0.05 , is there enough evidence to support a researcher’s claim that average pH value
is different from 6 ? Use p-value.

Using Minitab:

Select Stat > Basic Statistics > 1-Sample t


- Choose pH as the variable
- Test mean: 6
- Click [Option], Confidence: 95, Alternative: not equal

Minitab output:

One-Sample T: pH

Test of mu = 6 vs not = 6

Variable N Mean StDev SE Mean 95% CI T P


pH 9 6.53333 0.45277 0.15092 (6.18530, 6.88136) 3.53 0.008

Solution:

11
3.2 Hypothesis Test for Two Population Means
Question 1

A researcher hypothesizes that people who are allowed to sleep for only 4 hours will score
differently on a cognitive skills test compare to people who are allowed to sleep for 8 hours.
He brings 16 participants into his sleep lab and randomly assigns them to one of two groups.
In one group he has participants sleep for 8 hours and in the other group he has them
sleeping for 4 hours. The next morning he administers the SCAT (Sam’s Cognitive Ability
Test) to all participants. Scores on the SCAT range from 1-9 with higher scores representing
better performance. Assume that the population variances are equal.

Group SCAT scores


Sleep 8 hours 5, 7, 5, 3, 5, 3, 3, 9
Sleep 4 hours 8, 1, 4, 6, 6, 4, 1, 2

Test if there is a difference between the two means scores at 5% level of significance. Use
p-value.

Using Minitab:

Select Stat > Basic Statistics > 2-Sample t


- Choose Sleep 8 hours and Sleep 4 hours as the variables
- Click [Assume equal variances]
- Click [Option], Confidence: 95, Test difference: 0, Alternative: not equal

Minitab output:

Two-Sample T-Test and CI: Sleep 8 hours, Sleep 4 hours

Two-sample T for Sleep 8 hours vs Sleep 4 hours

N Mean StDev SE Mean


Sleep 8 hours 8 5.00 2.14 0.76
Sleep 4 hours 8 4.00 2.56 0.91

Difference = mu (Sleep 8 hours) - mu (Sleep 4 hours)


Estimate for difference: 1.00000
95% CI for difference: (-1.53126, 3.53126)
T-Test of difference = 0 (vs not =): T-Value = 0.85 P-Value = 0.411
DF = 14
Both use Pooled StDev = 2.3604

Solution:

12
Question 2

The statistics examination scores obtained from a random sample of students are listed
below. Assume that the population variances are equal.

Male 34, 34, 30, 18, 24, 16, 15,


26, 13, 21, 24, 11, 11, 15
Female 18, 33, 27, 11, 23, 23, 18,
20, 26, 10, 20, 22, 14, 21

Using 5% level of significance, test whether male students outperform female students. Use
p-value.

Using Minitab:

Select Stat > Basic Statistics > 2-Sample t


- Choose Male and Female as the variables
- Click [Assume equal variances]
- Click [Option], Confidence: 95, Test difference: 0, Alternative: greater than

Minitab output:

Two-Sample T-Test and CI: Male, Female

Two-sample T for Male vs Female

N Mean StDev SE Mean


Male 14 20.86 8.00 2.1
Female 14 20.43 6.20 1.7

Difference = mu (Male) - mu (Female)


Estimate for difference: 0.428571
95% lower bound for difference: -4.184162
T-Test of difference = 0 (vs >): T-Value = 0.16 P-Value = 0.438
DF = 26
Both use Pooled StDev = 7.1553

Solution:

13
3.3 Hypothesis Test for Paired Samples
Question 1

A psychologist wishes to verify that a certain drug increases the reaction time to a given
stimulus. The following reaction times (in tenths of a second) were recorded before and after
injection of the drug for each of 4 subjects:

Subject Before After


1 7 13
2 2 3
3 12 18
4 12 13

Based on the above computer output, test at 5% level of significance to determine whether
the drug significantly increases reaction time. Use p-value.

Using Minitab:

Select Stat > Basic Statistics > Paired t


- Choose Before and After as the variables
- Click [Option], Confidence: 95, Test mean: 0, Alternative: less than

Minitab output:

Paired T-Test and CI: Before, After

Paired T for Before - After

N Mean StDev SE Mean


Before 4 8.2500 4.7871 2.3936
After 4 11.7500 6.2915 3.1458
Difference 4 -3.50000 2.88675 1.44338

95% upper bound for mean difference: -0.10321


T-Test of mean difference = 0 (vs < 0): T-Value = -2.42 P-Value = 0.047

Solution:

14
Question 2

As an aid for improving students study habits, 9 students were randomly selected to attend a
seminar on the importance of education in life. The table below shows the number of hours
each student study per week before and after the seminar. At 10% level of significance, did
attending the seminar increase the number of hours the students study per week?

Before After
6 9
9 9
15 20
12 17
7 6
10 15
18 21
13 22
3 2

Using Minitab:

Select Stat > Basic Statistics > Paired t


- Choose Before and After as the variables
- Click [Option], Confidence: 90, Test mean: 0, Alternative: less than

Minitab output:

Paired T-Test and CI: Before, After

Paired T for Before - After

N Mean StDev SE Mean


Before 9 10.3333 4.6904 1.5635
After 9 13.4444 7.1957 2.3986
Difference 9 -3.11111 3.33333 1.11111

90% upper bound for mean difference: -1.55909


T-Test of mean difference = 0 (vs < 0): T-Value = -2.80 P-Value = 0.012

(i) State the null and alternative hypotheses.

(ii) Show that the mean difference is - 3.11.

(iii) Show that the test statistic is t = -2.80.

(iv) Conclude your testing based on the p-value.

Solution:

15
3.4 Hypothesis Test for One Population Variance
Question 1

The number of carbohydrates found in a random sample of fast-food entrees is listed below.

Carbohydrates in fast food: 53, 46, 47, 38, 73, 39, 39, 43, 41, 30

Minitab output:

Test and CI for One Variance: Carbohydrates in fast food

Null hypothesis Sigma-squared = 100


Alternative hypothesis Sigma-squared not = 100

The chi-square method is only for the normal distribution.

Statistics

Variable N StDev Variance


Carbohydrates in fast food 10 11.6 135

Tests

Test
Variable Method Statistic DF P-Value
Carbohydrates in fast food Chi-square 12.11 9 0.406

(i) At 5% level of significance, is there sufficient evidence to conclude that the variance
differs from 100? Use p-value.
(ii) Prove that the test statistic is 12.11.

Solution:

16
3.5 Hypothesis Test for Two Population Variances
Question 1

A researcher was interested in exploring the driving behaviours of male and female
university students based on driving speed (in mph). She conducted a survey of a random
sample of 34 male and 29 female university students. Table below is a descriptive summary
of the results of her survey.

Female (F) Male (M)


n 29 34
Sample mean 90.9 105.5
Sample standard deviation 12.2 20.1

Minitab output:

Test for Two Variances

Null hypothesis Sigma (F) / Sigma (M) = 1


Alternative hypothesis Sigma (F) / Sigma (M) not = 1
Significance level Alpha = 0.10

Statistics

Sample N StDev Variance


F 29 12.200 148.840
M 34 20.100 404.010

Ratio of standard deviations = 0.607


Ratio of variances = 0.368

Tests
Test
Method DF1 DF2 Statistic P-Value
F 28 33 0.37 0.009

(i) At the 10% level of significance, is there sufficient evidence to conclude that the
variance of the speed driven by male university students differs from the variance of the
speed driven by female university students? Use p-value.
(ii) Prove that the test statistic is 0.37.
(iii) What is the degree of freedom for male university students for the F test?

Solution:

17
CHAPTER 4 - Z-TEST
Question 1

The amount of coffee (in ounces) filled by a machine in 6 randomly selected jars are listed
below.
Amount: 15.9, 15.7, 16.3, 15.9, 15.7, 16.2
Assume a normal distribution for the amount of coffee in a jar with population standard
deviation of 0.229 ounce.

Using Minitab:

Select Stat > Basic Statistics > 1-Sample Z


- Choose Amount as the variable
- Standard: 0.229
- Test mean: 16
- Click [Option], Confidence: 95, Alternative: not equal

Minitab output:

One-Sample Z: Amount

Test of mu = 16 vs not = 16
The assumed standard deviation = 0.229

Variable N Mean StDev SE Mean 95% CI Z P


Amount 6 15.9500 0.2510 0.0935 (15.7668, 16.1332) -0.53 0.593

(A) HYPOTHESIS TEST


At 5% level of significance, is the mean amount of coffee filled in a jar differs from 16
ounces?
(i) Use Traditional method of z-test for a mean
(ii) Use p-value method

Solution:
(i) Traditional method of z-test for a mean
Population standard deviation is known,  = 0.229.
Sample mean x = 15.95, sample size n = 6.

Step 1: Ho: µ = 16
H1: µ ≠ 16

18
(ii) p-value method

Step 1: Ho: µ = 16
H1: µ ≠ 16

(B) CONFIDENCE INTERVAL


Prove that the 95% confidence interval of the mean is the same as in the computer output
(15.7668, 16.1332).

Solution:
Population standard deviation is known,  = 0.229.
Sample mean x = 15.95, sample size n = 6.

95% confidence interval for mean:


   
x   Z   
 2  n 

Note (Pg 413, 427, 429) (Book-Bluman)


z-test is used when population standard deviation is known.
(t-test is used when population standard deviation is unknown)
Assumptions:
1. The sample is a random sample
2. Either n ≥ 30 or the population is normally distributed if n < 30.

19
CHAPTER 5 - ANOVA

5.1 One-way ANOVA


Question 1

A lot of different factors contribute to air pollution. One particular factor, particulate matter,
was measured for prominent cities of 3 continents. Particulate matter includes smoke, soot,
dust and liquid droplets from combustion such that the particle is less than 10 microns in
diameter and thus capable of reaching deep into the respiratory system. The measurements
are listed below.

Asia Europe Africa


79 35 43
104 34 16
73 30 33
40 43

Using Minitab:

Select Stat > ANOVA > One-Way (Unstacked)


- Choose Asia, Europe, Africa as the responses (in separate columns)
- Confidence: 95

Minitab output:

One-way ANOVA: Asia, Europe, Africa

Source DF SS MS F P
Factor 2 4230 2115 6.65 0.020
Error 8 2544 318
Total 10 6774

(i) Identify the type of experimental design used in this study.


(ii) At 0.05 level of significance, is there sufficient evidence to conclude that there is a
difference in the means? Use p-value.
(iii) Prove that the total sum of squares is 6774.

Solution:

20
5.2 Two-way ANOVA
Question 1

A company recently purchased a new computer software package. The employees were
given training on using the new software. There are 4 different training methods with 3
training instructors. After completing the training, each employee was given identical
computer task. The time (in minutes) to complete the tasks were recorded, resulting in the
following data. The main objective of this experiment is to investigate whether different
training methods have an effect on the time to complete the tasks.

Method
A B C D
I 20 15 30 24
Instructor II 17 19 21 16
III 22 25 26 21

Using Minitab:

Select Stat > ANOVA > Two-Way


- Choose Response: Time, Row factor: Instructor, Column factor: Method
- Confidence: 99

Minitab output:

Two-way ANOVA: Time versus Instructor, Method

Source DF SS MS F P
Instructor 2 60.167 30.0833 2.36 0.175
Method 3 76.000 25.3333 1.99 0.217
Error 6 76.500 12.7500
Total 11 212.667

(i) What is the type of experimental design used in this study?


(ii) Identify the treatment factor and blocking factor in this experiment.
(iii) At 1% level of significance, do the data provide enough evidence to indicate that training
method has an effect on time to complete the tasks? Use p-value.
(iv) At 1% significance level, do the data provide enough evidence to indicate that instructor
has an effect on time to complete the tasks? Use p-value.

Solution:

21
Question 2

An official of a small transit system with only five buses wants to study the tread wear of 4
types of tyres. Each of the buses runs a different route so that terrain and driving conditions
differ from bus to bus. One tyre of each type is placed on each bus, with the wheel positions
being assigned randomly. The tyres are run for 15,000 kilometers, after which the tread wear
(in millimeters) is measured. The computer output is shown below.

Minitab output:

Two-way ANOVA: Tread wear versus Tyre, Bus

Source DF SS MS F P
Tyre (A) (C) 133.779 (E) 0.000
Bus 4 81.525 20.381 9.35 0.001
Error (B) 26.167 (D)
Total 19 509.030

(i) Identify the type of experimental design used in this study.

(ii) Identify the response, factor and block variables.

(iii) Find the values of A, B, C, D and E.

(iv) By using the p-value, test at 5% level of significance whether there is sufficient evidence
to indicate that a difference exists in the tread wear in different types of tyres.

(v) Is there sufficient evidence to indicate that tread wear of tyres varies from bus to bus?
Test at 5% level of significance. Use p-value.
Solution:

22
5.3 Two-way ANOVA with Interaction
Question 1

Suppose we want to compare type of popcorn and brand of popcorn with respect to their
yield (in terms of cups per popcorn). Factor A is the type of popcorn: oil-based versus air-
based. Factor B is the brand of popcorn: gourmet versus national versus generic. For each
combination of popper type and brand, three separate measurements are taken as shown in
the table below.

Brand of popcorn
Gourmet National Generic
6 5 3
Oil 5.5 4.5 4
Type
7 4 3.5
of
7 5 5
popcorn
Air 6 5 4.5
6.5 5.5 4

Using Minitab:

Select Stat > ANOVA > Two-Way


- Choose Response: Yield, Row factor: Type of popcorn, Column factor: Brand
- Confidence: 95

Minitab output:

Two-way ANOVA: Yield versus Type, Brand

Source DF SS MS F P
Type of popcorn 1 2.0000 2.00000 (e) 0.020
Brand (a) 16.7778 8.38889 30.20 0.000
Interaction 2 (c) 0.16667 0.60 0.564
Error 12 3.3333 (d)
Total (b) 22.4444

(i) Identify the type of experimental design used in this study.

(ii) How many treatment combinations are there?

(iii) What are the names of the two factors?

(iv) How many levels does each factor contain?

(v) Find the missing values (a, b, c, d, e) in the ANOVA table above.

23
(vi) Do the data provide sufficient evidence to indicate an interaction between the two
factors at 5% significance level? Use p-value.

(vii) At 5% level of significance, do the data provide sufficient evidence to indicate that there
is a significant effect on mean yield due to type of popcorn? Use p-value.

(viii) At 5% level of significance, do the data provide sufficient evidence to indicate that
there is a significant effect on mean yield due to brand of popcorn? Use p-value.

24
CHAPTER 6 - REGRESSION
Question 1

The body mass index, BMI (ratio of weight in kilograms to height in meter 2) and HDL
cholesterol (in miligrams per deciliter, mg/dL) of 10 randomly selected samples are given in
the following table.

BMI
20.3 21.6 21.9 24 25 27.2 30 32 28.8 31.8
(kg/m2)
HDL
64 57 60 52 50 47 45 40 55 43
(mg/dL)

Using Minitab:

Select Stat > Regression > Regression


- Choose Response: HDL, Predictors: BMI

Minitab output:

Regression Analysis: HDL versus BMI

Predictor Coef SE Coef T P


Constant 93.399 7.556 12.36 0.000
BMI -1.6032 0.2843 -5.64 0.000

S = 3.68369 R-Sq = 79.9% R-Sq(adj) = 77.4%

Analysis of Variance

Source DF SS MS F P
Regression 1 431.54 431.54 31.80 0.000
Residual Error 8 108.56 13.57
Total 9 540.10

(i) State the independent variable and dependent variable.

(ii) Write down the estimated least squares regression line. Interpret the meaning of the
regression coefficients, a and b.

25
(iii) Estimate Ahmad’s HDL cholesterol if his BMI is 26.

(iv) Determine the coefficient of determination and interpret its meaning.

(v) Determine the correlation coefficient and interpret it.

(vi) At 5% level of significance, test whether the linear regression model is significant. Use
p-value.

26
27

You might also like