You are on page 1of 2

Goodman and Kruskal's lambda

In probability theory and statistics, Goodman & Kruskal's lambda ( ) is a measure of proportional
reduction in error in cross tabulation analysis. For any sample with a nominal independent variable and
dependent variable (or ones that can be treated nominally), it indicates the extent to which the modal
categories and frequencies for each value of the independent variable differ from the overall modal category
and frequency, i.e., for all values of the independent variable together. is defined by the equation

where

is the overall non-modal frequency, and


is the sum of the non-modal frequencies for each value of the independent variable.

Values for lambda range from zero (no association between independent and dependent variables) to one
(perfect association).

Weaknesses
Although Goodman and Kruskal's lambda is a simple way to assess the association between variables, it
yields a value of 0 (no association) whenever two variables are in accord—that is, when the modal category
is the same for all values of the independent variable, even if the modal frequencies or percentages vary. As
an example, consider the table below, which describes a fictitious sample of 350 individuals, categorized by
relationship status and blood pressure. Assume that the relationship status is the independent variable, the
blood pressure is the dependent variable, i.e., the question asked is "can the blood pressure be predicted
better if the relationship status is known?"

Relationship Status and Blood Pressure (fictitious)


Relationship Status
Total
Unmarried Married

80% 51% 63.4%


Normal
Blood (120) (102) (222)
Pressure 20% 49% 36.6%
High
(30) (98) (128)

42.9% 57.1% 100%


Total
(150) (200) (350)

For this sample,

The reason is that the predicted nominal blood pressure is actually "Normal" in both columns (both upper
numbers are higher than the corresponding lower number). Thus, considering the relationship status will
not change the prediction that people have a normal blood pressure, even though the data indicate that
being married increases the probability of high blood pressure.

If the question is changed, e.g. by asking "What is the predicted relationship status based on blood
pressure?," will have a non-zero value.

That is:

See also
Proportional reduction in loss

References
Goodman, L.A., Kruskal, W.H. (1954) "Measures of association for cross classifications" (http
s://web.archive.org/web/20161222232929/http://nssl.noaa.gov/users/brooks/public_html/fed
a/papers/goodmankruskal1.pdf). Part I. Journal of the American Statistical Association, 49,
732–764. JSTOR 281536 (https://www.jstor.org/stable/281536)
Goodman, L.A., Kruskal, W.H. (1959) "Measures of Association for Cross Classifications. II:
Further Discussion and References" (https://bitbucket.org/mhunter/readinglists/src/d8e8010f
0b0d/ReadingList_NotreDame/GoodmanKruskal1959CrossClassificationMeasures2.pdf).
Journal of the American Statistical Association, 52, 123–163. JSTOR 2282143 (https://www.j
stor.org/stable/2282143)
Goodman, L.A., Kruskal, W.H. (1963) "Measures of Association for Cross Classifications III:
Approximate Sampling Theory", Journal of the American Statistical Association, 58, 310–
364. JSTOR 2283271 (https://www.jstor.org/stable/2283271)
doi:10.1080/01621459.1963.10500850 (https://doi.org/10.1080%2F01621459.1963.105008
50)

Retrieved from "https://en.wikipedia.org/w/index.php?title=Goodman_and_Kruskal%27s_lambda&oldid=1137886567"

You might also like