Professional Documents
Culture Documents
Regression Analysis.
Linear Regression
PLAN
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved.
13- 2
Regression Analysis
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved.
13- 3
Regression Analysis
The regression equation: Y’= a + bX, where:
Y’ is the average predicted value of Y for any X.
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved.
13- 4
Regression Analysis
The least squares principle is used to obtain a and b. The
equations to determine a and b are:
n( XY ) ( X )( Y )
b
n( X 2 ) ( X ) 2
Y X
a b
n n
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved.
13- 5
EXAMPLE 1
Dan Ireland, the student body president at
Toledo State University, is concerned about the
cost to students of textbooks. He believes there
is a relationship between the number of pages
in the text and the selling price of the book. To
provide insight into the problem he selects a
sample of eight textbooks currently on sale in
the bookstore. Draw a scatter diagram.
Compute the correlation coefficient.
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved.
13- 6
EXAMPLE 1 continued
EXAMPLE 1 continued
636 4,900
a 0.05143 48 .0
8 8
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved.
13- 8
Example 1 continued
The regression equation is:
Y’ = 48.0 + .05143X
Example 1 continued
Y 48 .0 0.05143 X
48 .0 0.05143 (800 ) 89 .14
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved.
13- 10
Example 2
Find the standard error of estimate for xathe problem involving the
number of pages in a book and the selling price from Example 1.
Y 2 aY bXY
s y. x
n2
51,606 48(636 ) 0.05143 (397 ,200 )
82
10 .408
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved.
13- 12
3. Assumptions Underlying Linear
Regression
For each value of X, there is a group of Y values,
and these Y values are normally distributed.
The means of these normal distributions of Y
values all lie on the straight line of regression.
The standard deviations of these normal
distributions are equal.
The Y values are statistically independent. This
means that in the selection of a sample, the Y values
chosen for a particular X value do not depend on the
Y values for any other X values.
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved.
13- 13
Confidence Interval
The confidence interval for the mean value of Y for a
given value of X is given by:
1 ( X X )2
Y ts y. x
n (X ) 2
X
2
n
1 (800 612 .5) 2
89 .14 2.447 (10 .408 )
8 (4900 ) 2
3,150 ,000
8
89 .14 15 .31
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved.
13- 14
Prediction Interval
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved.
13- 15
EXAMPLE 3
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved.