Professional Documents
Culture Documents
Defination
The extent to which two or more things are
related to one another -- “co-related”
OR
If the change in one variable affects a change in
other variable, the variables are s.t.b. correlated
Correlation is a bivariate measure of association
(strength) of linear relationship between two
variables.
Thus, correlation analysis is a statistical tool
which is used to describe the degree to which one
variable is linearly related to another.
Univariate distribution - distribution
involving only one variable.
Some situations occur in which our focus
is simultaneously on two or more
variables - bivariate distribution
The movement in one variable is
accompanied by movements in other
variable.
EXAMPLE : husband’s and wife’s age
move together, price and demand of
commodities.
Types of Correlation :
Graphicalmethod
Mathematical method
Graphical method
( scatter diagram)
Scatter diagram is mainly used to represent
bivariate data.
These diagrams indicate the existence of a
relationship, as well as the strength of that
relationship.
It is an easy way to highlight any relationship
that may exist and its type, whether direct or
inverse.
Steps of drawing the Scatter diagram
Collect data on two variables, one
independent and the other dependent.
Draw a diagram with the “cause” or
independent variable labeled on the
horizontal (X) axis and the “effect” or
dependent variable labeled on vertical (Y)
axis.
11-7
11-2 Scatter Plots - Example
Construct a scatter plot for the data obtained in
a study of age and blood pressure of six
randomly selected subjects.
The data is given on the next slide.
11-8
11-2 Scatter Plots - Example
Positive Relationship
150
150
Pressure
Pressure
140
140
130
130
120
120
40
40 50
50 60
60 70
70
Age
Age
11-2 Scatter Plots - Other Examples
11-10
Negative Relationship
90
90
80
80
grade
Finalgrade
70
70
Final
60
60
50
50
40
40
55 10
10 15
15
Number
Numberofofabsences
absences
11-2 Scatter Plots - Other Examples
11-11
No Relationship
10
10
55
Y
y
00
00 10
10 20
20 30
30 4040 50
50 60
60 70
70
xX
Merits : Scatter diagram
Simple to calculate & understand.
Attractive method of finding correlation
Rough idea at glance for positive or
negative correlation
Not influenced by extreme items
First step in finding the correlation
Demerits :Scatter diagram
Y
is standard deviations of Y
Interpretation
Interpretation Value of r
rxy ryx r
Relation between x and y is same as y and x.
Example 1
S. Ad.Expen. Sale
The following table shows 10
No. (in thousand) (Units)
years data of advertisement
expenditure and sales of a 1 50 700
company. Calculate the 2 50 650
correlation coefficient
between these two variables 3 50 600
for this company? 4 40 500
Determine coefficient of 5 30 450
determination.
6 20 400
Determine Probable error
and point out whether 7 20 300
coefficient of correlation is 8 15 250
significant. 9 10 210
Find the limits within which r
10 5 200
lies for another sample from
the same universe.
Example 2
Calculate the Export of Export of
correlation coefficient raw cotton manufactured
from the following (crores) goods (crores)
data : 42 56
44 49
58 53
55 58
89 65
98 76
66 58
EXAMPLE
x -4 -3 -2 -1 0 1 2 3 4
y 3 -3 -4 0 4 1 2 -2 -1
r=
Example 3
Find the coefficient of correlation between the age and the sum
assured from the following table:
Age group Sum assured ( in Rs.)
20-30 4 6 3 7 1
30-40 2 8 15 7 1
40-50 3 9 12 6 2
50-60 8 4 2 - -
Example 4
Find the coefficient of correlation from the following bivariate
frequency distribution :
Sales Advertising expenditure (Rs. ‘000)
revenue
(Rs. lakhs) 5-10 10-15 15-20 20-25
75-125 4 1 - -
125-175 7 6 2 1
175-225 1 3 4 2
225-275 1 1 3 4
Coefficient of determination
Coefficient of non-determination is
ratio of unexplained variation to
total variation.
Coefficient of non-determination is
denoted by K²
K² = 1- r²
K² =unexplained variance ∕ total variance
Standard error
ρ = r± PE(r)
Where ρ : population correlation
Functions : Probable error
First 1 6 5 10 3 2 4 9 7 8
judge
Second 6 4 9 8 1 2 3 10 5 7
judge
First 1 6 5 10 3 2 4 9 7 8
judge
Second 3 5 8 4 7 10 2 1 6 9
judge
Third 6 4 9 8 1 2 3 10 5 7
judge
Calculate 1 49 15.8
3 127 11.3
correlation 4 91 13.2
coefficient as a 5 72 13
relationship
8 11 20.2
9 191 10.8
between work-hrs 10 6 18.8
and annual wages. 11 63 13.8
13 43 15.1
14 57 24.2
15 82 13.9
Tied / Repeated Ranks
When two or more individuals get the same rank with
respect to either of the two characteristics being
studied, common rank is assigned to the
observations, that are repeated.
This common rank is the average of the ranks which
these observations would have assumed if they had
been different from one another.
EXAMPLE : If two obs. Were ranked equal at the
fourth place, then both these obs. Would be ranked
as 4+5/2 = 4.5
the next obs. Would be ranked 6 and so on.
Then rank correlation coefficient is
given by :
Rs = 1- 6 { d 2 + 1 m1(m1 2– 1)
2 12
+ 1 m2(m2 – 1)+…………..}
12
2
n(n – 1 )
Where, mi = no. of times the ith
repeated item is repeated. i = 1,2……..
Student Midterm Final
score exam
Example score
Neha 82 94
The scores of 10
Chani 81 92
students on the
mid term Aditi 80 85
examination and Sumit 68 75
the final Aditya 70 73
examination are
Mohit 92 95
given. Compute
the rank Reha 76 69
correlation Rahul 80 86
coefficient. Sakshi 86 90
charu 62 69
Example
Applican Marks in Marks in
ts accounts statistic
An examination of 8 s
applicants for a clerical
post was taken by a A 15 40
firm. From the marks B 20 30
obtained by the
applicants in the C 28 50
accounts and statistics D 12 30
paper, compute the
rank correlation E 40 20
coefficient.
F 60 10
G 20 30
H 80 60
EXAMPLE