Professional Documents
Culture Documents
Correlation Analysis
PLAN
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved.
13- 3
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved.
13- 4
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved.
13- 5
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved.
13- 6
Zero Correlation
Y
10
9
8
7
6
5
4
3
2
1
0
0 1 2 3 4 5 6 7 8 9 10 X
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved.
13- 7
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved.
13- 8
Formula for r
( X X )(Y Y )
r
(n 1) s x s y
n(XY ) (X )(Y )
n(X ) (X ) nY Y
2 2 2 2
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved.
13- 9
Coefficient of Determination
The coefficient of determination (r2) is the
proportion of the total variation in the dependent
variable (Y) that is explained or accounted for by
the variation in the independent variable (X).
It is the square of the coefficient of correlation.
It ranges from 0 to 1.
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved.
13- 10
EXAMPLE 1
Dan Ireland, the student body president at
Toledo State University, is concerned about the
cost to students of textbooks. He believes there
is a relationship between the number of pages
in the text and the selling price of the book. To
provide insight into the problem he selects a
sample of eight textbooks currently on sale in
the bookstore. Draw a scatter diagram.
Compute the correlation coefficient.
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved.
13- 11
EXAMPLE 1 continued
Example 1 continued
Scatter Diagram of Number of Pages and Selling Price of Text
100
90
Price ($)
80
70
60
400 500 600 700 800
Page
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved.
13- 13
Example 1 continued
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved.
13- 14
Example 1 continued
n(XY ) (X )( Y )
r
n(X 2
) (X ) 2
nY Y
2 2
0.614
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved.
13- 15
3. Testing the Significance of
the Correlation Coefficient
Tests for Significance
• r is an estimate of the population correlation coefficient r
(rho).
• To test the hypothesis H0: r = 0, the test statistic is:
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved.
13- 16
Testing the Significance of
the Correlation Coefficient
Steps in Testing if r = 0
• Step 1: State the Hypotheses
Determine whether you are using a one or two-
tailed test and the level of significance (a).
H0: r = 0
H1: r ≠ 0
• Step 2: Calculate the Critical Value for degrees of
freedom f = n -2, look up the critical value ta at
t-values Distribution Table, then calculate
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved.
13- 17
Testing the Significance of
the Correlation Coefficient
Steps in Testing if r = 0
• Step 3: Make the Decision
If using the t statistic method, reject H0 if
t > ta or t <- ta or
if the p-value < a.
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved.
13- 18
Testing the Significance of
the Correlation Coefficient
Role of Sample Size
• As sample size increases, the critical value of r
becomes smaller.
• This makes it easier for smaller values of the
sample correlation coefficient to be considered
significant.
• A larger sample does not mean that the correlation
is stronger nor does its significance imply
importance.
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved.
13- 19
Testing the Significance,
EXAMPLE 1, continued
The correlation between the number of pages and the
selling price of the book is 0.614. This indicates a
moderate association between the variable.
Test the hypothesis that there is no correlation in the
population. Use a .02 significance level.
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved.
13- 20
Testing the Significance,
EXAMPLE 1, continued
EXAMPLE 1 continued
To find the value of the test statistic we use:
r n2 .614 8 2
t 1.905
1 r 2 1 (.614 ) 2
McGraw-Hill/Irwin Copyright © 2002 by The McGraw-Hill Companies, Inc. All rights reserved.