Professional Documents
Culture Documents
According Simpson and Kafka, “Correlation analysis deals with the association between two or more
variables.
The problem of analysing the relationship between different data should be broken into following
steps:
It should be noted that the detection and analysis of correlation between two statistical variable
requires relationship of some kind which associates the observation in pairs, one of each pair being a
value of each of the two variables.
According to Simpson and Kafka “ Correlation Analysis deals with the association between two or
more variables”
Types of correlation
1. Positive or negative
2. Graphic method
The value of correlation coefficient lies between -1 to +1. When r = +1 infers that there is a perfect
positive correlation between the variables. When r = -1, it means that there is perfect negative
correlation between variables. When r=o, It means that there is no relationship between two
variables. In practical r = +1, -1, and 0 are rare.
Probable error is possible to determine the reliability of the value of the coefficient and it depends
on the conditions of the random sampling.
P.E = 0.6745 1-r1/√N, where r is the coefficient of correlation and N is the number of observations.
If the value of r is less than the P.E, there is no evidence of correlation. It further reveals that the
value of correlation is not significant.
If the value of r is more than six times the probable error, the coefficient of correlation is practically
certain. It further reveals that the correlation coefficient is significant.
Adding and subtracting the value of probable error from the coefficient of correlation, we get
respectively the upper and lower limit of correlation coefficient in the population. It can be
represented as ρ = r + or – P.E, where ρ is the correlation coefficient of population.
The Probable error can be used for the following conditions is satisfied:
2. The sample selected should be in unbiased manner and the individual items must be independent.
3. The statistical measure for which P.E is measured must have been estimated for a sample.
2. Direct method
4. Grouped data
1. The following table gives indices of industrial production of registered unemployed in hundred
thousand. Calculate the value of the coefficient of correlation.
No.Unemployed : 14 11 12 12 13 13 18 27
X = 832/8 = 104, Y = 120/8 = 15
X Y x= X-X Y= Y-Y x2 y2 xy
101 14 -3 -1 9 1 3
103 11 -1 -4 1 16 4
105 12 1 -3 1 9 -3
108 12 4 -3 16 9 -12
104 13 0 -2 0 4 0
110 13 6 -2 36 4 -12
103 18 -1 3 1 9 -3
98 27 -6 12 36 144 -72
r=∑xy/√(∑x2∑y2)
= -95/√100X196
= -0.679
There is a high negative correlation between index of production and number unemployed.
This method can be useful if the variable has minimum numerical value.
X: 9 8 7 6 5 4 3 2 1
Y: 14 17 13 14 10 13 9 7 11
X Y X2 Y2 XY
9 14 81 196 126
8 17 64 289 136
7 13 49 169 91
6 14 36 196 84
5 10 25 100 50
4 13 16 169 52
3 9 9 81 27
2 7 4 49 14
1 11 1 121 22
…. …. ….. ……. ……
= -0.381
The following table gives the distribution of items of production and also the relatively defective
items among them, according to size groups. Find the correlation coefficient between size and
defect in quality and its possible error.
Solution:
=-738.3/797.78 =-0.925
Grouped Data
dx -2 -1 0 1 2
m Y dy
22.5 20-25 0 - - 3 0 10 0 3 0 4 0
f 10 19 41 18 12
fdx2 40 19 0 18 48
fdy2 10 0 30 160
= -7900-300/111.763X100 = -8200/11176.3
=-0.733
= 0.031
Rank Correlation
Two boys were asked to rank 7 different brands of mobiles. The ranks given by them are as follows:
Brand of
Mobile : A B C D E F G
Bhavesh:1 2 3 4 7 5 6
Suresh :3 1 4 2 6 5 7
1 3 4
2 1 1
3 4 1
4 2 4
7 6 1
5 5 0
6 7 1
………..
12
……..
R = 1- 6∑D2/N(N2-1)
= 1-6X12/7(72-1)
= 1-0.214
= 0.786
1 97 73
2 99 85
3 98 78
4 96 75
5 94 77
6 95 67
7 93 83
Solution:
1 97 3 73 6 9
2 99 1 85 1 1
3 98 2 78 3 1
4 96 4 75 5 1
5 94 6 77 4 4
6 95 5 67 7 4
7 93 7 83 2 25
………
45
……….
R = 1 – 6X45/7(72-1) = 0.919
There is a high positive correlation between Debenture Price and Share Price
Equal Ranks
Candidate: 1 2 3 4 5 6 7 8
Judge X : 21 23 29 24 31 31 24 25
Judge Y : 29 25 25 26 27 28 33 31
Solution:
1 21 29 8 3 25
2 23 25 7 7.5 0.25
3 29 25 3 7.5 20.25
4 24 26 5.5 6 0.25
5 31 27 1.5 5 12.25
6 31 28 1.5 4 6.25
7 24 33 5.5 1 20.25
8 25 31 4 2 4
……………
88.5
= -0.036
There is a negative correlation between the ranks given by judge X and Judge Y.