Professional Documents
Culture Documents
Correlation
• Meaning and Uses of Correlation The Phi or Fourfold Coefficient
• Graphs of Degree of Correlation • The Tetrachoric r, 𝑟!
and the Correlation Coefficient
• Pearson r Correlation or Product- No Special Correlation Coefficient
Moment Coefficient of Correlation. • Partial r
• Spearman Rho Coefficient of • Multiple Correlation
Correlation
• Test of Significance Coefficient Gamma
• Gamma
The Chi-Square Test
• Chi-Square Test Correlation Between Nominal Data
• Lambda
Kendall's Tau Correlation Between Ranks
• Kendall's Tau Correlation Between Correlation Between An Interval And
Ranks - No Ties Nominal Data
• Kendall's Tau Correlation Between • The Correlation Ratio
Ranks - With Ties
• Kendall's Coefficient of
Concordance W
x x
x x
data. 1 21 27 8 6 2 4
2 22 28 6.5 3.5 3 9
The formula is:
3 28 27 2.5 6 3.5 12.25
𝟔𝜮𝑫² 4 27 10 4.5 12 0.5 56.25
𝒓𝒉𝒐 = 𝟏 −
𝑵(𝑵² − 𝟏) 5 48 30 1 1.5 0.5 0.25
6 22 21 6.5 9.5 3 9
Where:
7 27 27 4.5 6 1.5 2.5
8 6 21 12 9.5 2.5 6.25
rho – is the coefficient of 𝚺𝐃² – is the sum of
9 11 21 11 9.5 1.5 2.25
correlation by the rank column 𝐃²
10 12 28 10 3.5 6.5 42.25
difference method N – is the sum of the pairs
11 16 30 9 1.5 7.5 56.25
1 – is constant of scores or
12 28 21 2.5 9.5 7 49
6 – is constant measures
ƩD²= 249
𝟖(𝟏𝟗𝟗𝟐𝟖)8(𝟐𝟑𝟐)(𝟔𝟕𝟐)
𝑷ᵣ = Conclusion:
𝟖 𝟕𝟎𝟐𝟐 8 𝟐𝟑𝟐 𝟐 [𝟖 𝟓𝟕𝟑𝟑𝟒 8 𝟔𝟕𝟐 𝟐 ]
Computation
O E O–E (O – E)2
𝒅𝒇 = 𝒄 − 𝟏 𝒓 − 𝟏
(𝑶%𝑬)²
𝑬
𝝀² = 𝟏𝟖. 𝟖𝟏𝟑𝟕
For the example, Example 3: The sales manager is asked by the corporate
board to provide a more specific answer with regard to the
𝒅𝒇 = (𝟑 − 𝟏)(𝟑 − 𝟏) = 𝟐(𝟐) = 𝟒 importance of the location in selling a particular type of product
by doing empirical research.
The sales manager gives you the following data to find
Conclusion: out if dependency relationship exist between sales volume and
location of 80 establishments.
The computed value of 18.8317 is greater than
Sales Volume
the tabular value of 13.277 at 0.01 level of Quiapo Cubao Pasig Makati Total
TOTAL 23 20 20 17 80
Conclusion:
3 3.45 -0.45 0.2025 0.0587
3 3 0 0 0
4 3 1 1 0.3333
1 1.7 -0.7 0.49 0.2882 between sales volume and location of 80 establishments.
λ² = 2.6977
𝐏"𝐐
𝐓 = 𝐍(𝐍"𝟏)⁄𝟐
𝟏𝟕"𝟒
𝐓=
𝟕(𝟔)⁄𝟐
𝟏𝟑
𝐓=
𝟐𝟏
𝐓 = 𝟎. 𝟔𝟐
When ties appears, certain adjustments have to be Table 8.4 Calculation of Kendall’s T
made. The number of individual’s ranking higher and lower
than each individual on the Y variable is again determined,
Coefficient – With Ties
resulting in P = 33 and Q = 11.
In handling ties we first take the X distribution and
for each set of ties determine (x)(x-1). Where x is the
number of tied for a particular rank. These are summed
and divided by 2. We have:
𝟐 𝟐%𝟏 A𝟑(𝟑%𝟏)
=
𝟐
𝟐A𝟔
=
𝟐
𝟖
=
𝟐
=𝟒
We repeat the process for Y distribution: And take the square root of the product:
𝟐(𝟐%𝟏) 𝟐 𝟏𝟖𝟎𝟒 = 𝟒𝟐. 𝟓
= =𝟏
𝟐 𝟐 Then:
Next we calculate: 𝐏−𝐐
𝐓=
𝑵(𝑵%𝟏) 𝟏𝟎%(𝟗) 𝟒𝟐. 𝟓
= = 𝟒𝟓 𝟐𝟐
𝟐 𝟐 𝐓=
𝟒𝟐. 𝟓
The correction obtained above for each 𝐓 = 𝟎. 𝟓𝟐
distribution is subtracted from this:
𝟒𝟓 − 𝟒 = 𝟒𝟏 Kendall’s tau, like rs, has many applications. For reasons
𝟒𝟓 − 𝟏 = 𝟒𝟒 beyond the scope of this text, many statisticians prefer tau
We next multiply this two terms: over rs. As illustrated, when both rs and tau are computed for
the same data, tau is then smaller. The range of tau is the
𝟒𝟒 𝟒𝟏 = 𝟏𝟖𝟎𝟒 same as that of rs and both statistics are interpreted in the
same way.
Kendall’s Coefficient of Concordance W
Total sum of
If we wish to determine the relationship among three or more
sets of ranks, one rank could be selected and a spearman rs
rank:
coefficient computed between it and all of the others, and this process
could then be continued until an rs coefficient has been obtained
𝒎(𝑵)(𝑵H𝟏)
between each set of two ranks. Then these rs ‘s could be averaged for
an overall measure of relationship. =
𝟐
Kendall though has developed a technique and a statistic that =
make all of this unnecessary. Supposed the five judges (m) rank the
projects of ten individual (N) in judging contest, and we wish to
𝟓(𝟏𝟎)(𝟏𝟏)
determine the overall relationship among the ratings of the five judges. 𝟐
The rankings of these judges have been set up in table 8.5. First the
rankings by the five judges of first of the projects are summed. The
sums appear in column 3. Then column 3 is summed to give the total
=
sum of the ranks. This can be checked for the total sum of the rank as
follows:
𝟐𝟕𝟓
INTERPRETATION OF W.
Since 𝑵𝒕 − 𝑵𝒑 = 𝑵𝒘, the number missing the item or responding to it Table 8.6: SCORES ON A CONTINOUS AND ON A DICHOTOMOUS
correctly: VARIABLE TO ILLUSTRATE THE COMPUTATION OF
𝑵𝒑 𝑵𝒘 THE POINT –BISERIAL
𝜮𝒙𝟐 =
𝑵𝒕 (1) (2) (3) (4) (5) (6) (7)
Y fp fw f Fy fY2 fpY
Also, in obtaining Ʃ𝑋𝑌 only those values where 𝑋 = 1 will enter 10 2 0 2 20 200 20
into the calculations, so actually Ʃ𝑋𝑌 may be written as Σ𝑓6 𝑌, each 𝑌
value multiplied by the frequency passing. 9 4 0 4 36 324 36
Let us now take equation (7.2) and write it wit the three basic 8 6 1 7 56 448 48
parts as fractions: 7 7 1 8 56 392 49
𝜮𝑿𝒀 − (𝜮𝑿)(𝜮𝒀)/𝑵 6 8 2 10 60 360 48
𝒓=
5 6 4 10 50 250 30
𝚺𝑿 𝟐 𝚺𝒀 𝟐 4 5 6 11 44 176 20
𝜮𝑿𝟐 − 𝜮𝒀𝟐 −
𝑵 𝑵
3 3 8 11 33 99 9
And substitute some of the information presented above into it. We 2 2 7 9 18 36 4
now have:
1 1 8 9 9 9 1
𝜮𝒇𝒑 𝒀 − 𝑵𝒑 (𝜮𝒇𝒑 𝒀)/𝑵 0 0 9 9 0 0 0
𝒓= ∑fp =44 ∑fw =46 ∑f=90 ∑fY=382 ∑fY2=2294 ∑fpY=265
𝑵𝒑 𝑵𝒘 /𝑵𝒕 − 𝜮𝒇𝒀𝟐 − (𝜮𝒇𝒀)𝟐 /𝑵𝒕
40 60 100
𝟎.𝟎𝟐 𝐫𝟏𝟑 = 𝟎. 𝟔𝟎
𝐫𝟐𝟑@𝟏 =
𝟎.𝟒𝟖
𝐫𝟐𝟑 = 𝟎. 𝟒𝟎
𝐫𝟐𝟑@𝟏 = 𝟎. 𝟎𝟒
GAMMA
𝟎.𝟓𝟎²7𝟎.𝟔𝟎²8𝟐(𝟎.𝟓𝟎)(𝟎.𝟔𝟎)(𝟎.𝟒𝟎)
𝐑 𝟏5𝟐𝟑 =
𝟏8𝟎.𝟒𝟎²
𝟎.𝟐𝟓7𝟎.𝟑𝟔8𝟎.𝟐𝟒
𝐑 𝟏5𝟐𝟑 =
𝟎.𝟖𝟒
𝟎.𝟑𝟕
𝐑 𝟏5𝟐𝟑 =
𝟎.𝟖𝟒
𝐑 𝟏5𝟐𝟑 = 𝟎. 𝟒𝟒𝟎𝟓
𝐑 𝟏5𝟐𝟑 = 𝟎. 𝟔𝟔
GAMMA Example 1: Compute the gamma for the data shown in
An alternative the rank – order correlation coefficient is the the table 6.1
Goodman’s and Kruskal’s gamma (G). The value of variable can
be estimated or predicated from the other variable when you Table 6.1 Employees Ranked on Socio – Economic
have the knowledge of their values. They can also be used when Status and Educational Status
ties are found in the ranking of data.
Educational Status
The formula is:
𝐍𝐬 %𝐍𝟏
𝐆= Upper Middle Lower Total
𝐍𝐬 A𝐍𝟏 Socio
Economic
Where: Status
Upper 24 19 5 48
Solution: 𝑵𝒔 =[(24)(54)+24(29)+24(26)+24(25)]+[19(29)+1
Step 1. Arrange the ordering for one of the two 9(25)+12[12(26)+12(25)]+[(54)(25)]
characteristics from the highest to the lowest or vice 𝑵𝒔 =[1296+696+624+600+551+475+312+300+13
versa from top to bottom through the rows and for the 50]
other characteristics from the highest to the lowest o
𝑵𝒔 =6204
vice versa from left to right though the column.
The procedure can also be written as:
𝐍𝐬 =[24(54+29+26+25)]+[19(29+25)]+[12(26+25)
Step 2. Compute Ns by multiplying the frequency in
every cell by the series of the frequencies in all of the ]+[(54)(25)]
other cells which are both to the right of the original 𝐍𝐬 =[1296+696+624+600+551+475+312+300+13
cell below it and then sum up the products obtained. 50]
𝐍𝐬 =6204
Solution: Formula :
𝑭𝒃𝒊 − 𝑴𝒃𝒄
𝚺𝐅𝐛𝐢%𝐌𝐛𝐜 𝚺𝐅𝐛𝐣%𝐌𝐛𝐫 𝝀=
𝛌𝐜 = 𝛌𝐫 = 𝑵 − 𝑴𝒃𝒄
𝐍%𝐌𝐛𝐜 𝐍%𝐌𝐛𝐫
(𝟒𝟗A𝟕𝟐A𝟐𝟔)%𝟏𝟐𝟐 (𝟒𝟗A𝟕𝟐A𝟐𝟏)%𝟏𝟐𝟕
𝛌𝐜 = 𝛌𝐜 = Where:
𝟐𝟗𝟎%𝟏𝟐𝟐 𝟐𝟗𝟎%𝟏𝟐𝟕
𝟏𝟒𝟕%𝟏𝟐𝟐 𝟏𝟒𝟐%𝟏𝟐𝟕
𝝀 = 𝑡ℎ𝑒 𝑙𝑎𝑚𝑏𝑑𝑎 𝑐𝑜𝑒𝑓𝑓𝑖𝑐𝑖𝑒𝑛𝑡
𝛌𝐜 = 𝛌𝐜 = 𝑭𝒃𝒊 = 𝑡ℎ𝑒 𝑏𝑖𝑔𝑔𝑒𝑠𝑡 𝑐𝑒𝑙𝑙 𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑖𝑒𝑠 𝑖𝑛 𝑡ℎ𝑒 𝑖𝑡ℎ 𝑟𝑜𝑤
𝟏𝟖𝟏 𝟏𝟔𝟑
𝟐𝟓 𝟏𝟓 (𝑤𝑖𝑡ℎ 𝑡ℎ𝑒 𝑠𝑢𝑚 𝑡𝑎𝑘𝑒𝑛 𝑜𝑣𝑒𝑟 𝑎𝑙𝑙 𝑜𝑓 𝑡ℎ𝑒 𝑟𝑜𝑤𝑠)
𝛌𝐜 = 𝛌𝐜 =
𝟏𝟖𝟏 𝟏𝟔𝟑 𝑴𝒃𝒄 = 𝑡ℎ𝑒 𝑏𝑖𝑔𝑔𝑒𝑠𝑡 𝑜𝑓 𝑡ℎ𝑒 𝑐𝑜𝑙𝑢𝑚𝑛 𝑡𝑜𝑡𝑎𝑙𝑠
𝛌𝐜 = 𝟎. 𝟏𝟑𝟖𝟏𝟐𝟏𝟓 𝛌𝐜 = 𝟎. 𝟎𝟗𝟐𝟎𝟐𝟒𝟓 𝑵 = 𝑡ℎ𝑒 𝑡𝑜𝑡𝑎𝑙 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛𝑠.
𝛌𝐜 = 𝟎. 𝟏𝟒 (Answer) 𝛌𝐜 = 𝟎. 𝟎𝟗 (Answer)
Example 1 : Compute ʎ𝑐 and ʎ𝑟 for the data The obtained lambda coefficient of .14 indicates
that when religion as treated as an independent
on table 6.2
variable, the error reduced in the prediction (increases
Table 6.2. A Segment of the Filipino Electorate According its accuracy) is 14 percent. While the obtained lambda
to Religion and Political Party. coefficient of .09 indicates that when political party
treated as independent variable, the error minimized in
KBL LABAN UNIDO TOTAL the prediction (increases accuracy) is 9 percent. These
CATHOLIC 49 25 18 92
results prove that religion accurately predicts political
party more than political party predicting religion.
IGLESIA NI 34 72 21 127
CRISTO
PROTESTANT 26 25 20 71
Example:
Solve the correlation ratio between sex and scores obtained
ΣrΣcYi2=(13)2+(19)2+(11)2+(26)2+(22)2+(30)2+(25)2
by eight male and five female psychology majors in Abnormal
Psychology class at the Philippine Normal College. +(29)2+(14)2+(9)2+(12)2+(8)2+(15)2
ΣrΣcYi2=169+361+121+676+484+900+625+841+1
Male 13 19 11 26 22 30 25 29
96+81+144+64+225
Female 14 9 12 8 15
ΣrΣcYi2=4887
Solution:
𝚺𝐍𝐢𝐘-𝐢²#𝐍𝐘-²
𝑵𝟏 = 𝟖 0 ₁= 𝟏𝟕𝟓 = 𝟐𝟏. 𝟖𝟖
𝒀 𝐄² =
𝟖 -²
𝚺𝐫 𝚺𝐜 𝐘𝐢#𝐍𝐘
𝟓𝟖
𝑵𝟐 = 𝟓 0 ₂= = 𝟏𝟏. 𝟔𝟎
𝒀
𝟓 𝟖 𝟐𝟏.𝟖𝟖 𝟐 3(𝟓)(𝟏𝟏.𝟔𝟎)²#𝟏𝟑(𝟏𝟕.𝟗𝟐)²
0 = 𝟐𝟑𝟑 = 𝟏𝟕. 𝟗𝟐
𝐄² =
𝑵 = 𝟏𝟑 𝒀 𝟒𝟖𝟖𝟕#(𝟏𝟑)(𝟏𝟕.𝟗𝟐)²
𝟏𝟑
𝟖 𝟒𝟕𝟖.𝟕𝟑 A𝟓(𝟏𝟑𝟒.𝟓𝟔)%𝟏𝟑(𝟑𝟐𝟏.𝟏𝟑)
𝐄² =
𝟒𝟖𝟖𝟕%𝟏𝟑(𝟑𝟐𝟏.𝟏𝟑)
𝟑𝟖𝟐𝟗.𝟖𝟒A𝟔𝟕𝟐.𝟖%𝟒𝟏𝟕𝟒.𝟔𝟗
𝐄² =
𝟒𝟖𝟖𝟕%𝟒𝟏𝟕𝟒.𝟔𝟗
𝟒𝟓𝟎𝟐.𝟔𝟒%𝟒𝟏𝟕𝟒.𝟔𝟗
𝐄² =
𝟕𝟏𝟐.𝟑𝟏
𝟑𝟐𝟕.𝟗𝟓
𝐄² =
𝟕𝟏𝟐.𝟑𝟏
𝐄² = 𝟎. 𝟒𝟔 Answer