Professional Documents
Culture Documents
net/publication/237008786
Bartlett's Test
CITATIONS READS
0 3,244
2 authors, including:
J.P. Keating
University of Texas at San Antonio
82 PUBLICATIONS 1,003 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by J.P. Keating on 22 February 2015.
See also Box-and-Whisker Plot; Distribution; Graphical populations is known as Bartlett’s test, and Bar-
Display of Data; Histogram; Pie Chart tlett’s test statistic is given by
Qr
w
Further Readings ðS2i Þ i
‘1 ¼ i¼1
P r ,
Cleveland, W. S., & McGill, R. (1985). Graphical wi Si2
perception and graphical methods for analyzing i¼1
scientific data. Science, 229, 828–833.
Harris, R. L. (1999). Information graphics: A where wi ¼ ðni 1Þ=ðN rÞ is known as the
comprehensive illustrated reference. New York: P
r
weight for the ith group and N ¼ ni is the sum
Oxford University Press. i¼1
Playfair, W. (1786). The commercial and political atlas. of the individual sample sizes. In the equireplicate
London: Corry. case (i.e., n1 ¼ ¼ nr ¼ n), the weights are
Shah, P., & Hoeffner, J. (2002). Review of graph
equal, and wi ¼ 1=r for each i ¼ 1, . . . , r: The test
comprehension research: Implications for instruction.
statistic is the ratio of the weighted geometric
Educational Psychology Review, 14, 47–69.
Spence, I. (2000). The invention and use of statistical mean of the group sample variances to their
charts. Journal de la Société Francaise de Statistique, weighted arithmetic mean. The values of the test
141, 77–81. statistic are bounded as 0 ≤ ‘1 ≤ 1 by Jensen’s
Tufte, E. R. (1983). The visual display of quantitative inequality. Large values of 0 ≤ ‘1 ≤ 1 (i.e., values
information. Cheshire, CT: Graphics. near 1) indicate agreement with the null hypothe-
Wainer, H. (1996). Depicting error. American Statistician, sis, whereas small values indicate disagreement
50, 101–111. with the null. The terminology ‘1 is used to indi-
cate that Bartlett’s test is based on M. S. Bartlett’s
modification of the likelihood ratio test, wherein he
BARTLETT’S TEST replaced the sample sizes ni with their correspond-
ing degrees of freedom, ni 1. Bartlett did so to
make the test unbiased. In the equireplicate case,
The assumption of equal variances across treatment
Bartlett’s test and the likelihood ratio test result in
groups may cause serious problems if violated in
the same test statistic and same critical region.
one-way analysis of variance models. A common
The distribution of ‘1 is complex even when the
test for homogeneity of variances is Bartlett’s test.
null hypothesis is true. R. E. Glaser showed that
This statistical test checks whether the variances
the distribution of ‘1 could be expressed as a prod-
from different groups (or samples) are equal.
uct of independently distributed beta random vari-
Suppose that there are r treatment groups and
ables. In doing so he renewed much interest in the
we want to test
exact distribution of Bartlett’s test. We reject H0
Ho : s21 ¼ s22 ¼ ¼ s2r provided ‘1 ≤ ba ðn1 , . . . , nr Þ where Pr ‘1 < ba
ðn1 , . . . , nr ÞÞ ¼ a when H0 is true. The Bartlett
versus critical value ba ðn1 , . . . , nr Þ is indexed by level of
H1 : s2m 6¼ s2k for some m 6¼ k: significance and the individual sample sizes. The
critical values were first tabled in the equireplicate
In this context, we assume that we have inde- case, and the critical value was simplified to
pendently chosen random samples of size ba ðn, . . . , nÞ ¼ ba ðnÞ. Tabulating critical values
ni , i ¼ 1, . . . , r from each
of the
r independent with unequal sample sizes becomes counterproduc-
populations. Let Xij e N mi , s2i be independently tive because of possible combinations of groups,
distributed with a normal distribution having sample sizes, and levels of significance.
mean mi and variance s2i for each j ¼ 1, . . . , ni and
each i ¼ 1, . . . , r. Let Xi be the sample mean and
Example
S2i the sample variance of the sample taken from
the ith group or population. The uniformly most Consider an experiment in which lead levels are
powerful unbiased parametric test of size a for measured at five different sites. The data in Table 1
testing for equality of variances among r come from Paul Berthouex and Linfield Brown:
62 Bartlett’s Test
Table 1 Ten Measurements of Lead Concentration (mG=L) Measured on Waste Water Specimens
Measurement No.
Lab 1 2 3 4 5 6 7 8 9 10
1 3.4 3.0 3.4 5.0 5.1 5.5 5.4 4.2 3.8 4.2
2 4.5 3.7 3.8 3.9 4.3 3.9 4.1 4.0 3.0 4.5
3 5.3 4.7 3.6 5.0 3.6 4.5 4.6 5.3 3.9 4.1
4 3.2 3.4 3.1 3.0 3.9 2.0 1.9 2.7 3.8 4.2
5 3.3 2.4 2.7 3.2 3.3 2.9 4.4 3.4 4.8 3.0
Source: Berthouex, P. M., & Brown, L. C. (2002). Statistics for environmental engineers (2nd ed., p. 170). Boca Raton,
FL: Lewis.
From these data one can compute the sample So for the lead levels, we have the following
variances and weights, which are given below: values: b0:05 ð5; 10Þ¼0:8025.
_ At the 5% level of
significance, there is not enough evidence to reject
Labs Weight Variance the null hypothesis of equal variances.
1 0.2 0.81778 Because of the complexity of the distribution
2 0.2 0.19344 of ‘1 , Bartlett’s test originally employed
an
3 0.2 0.41156 approximation. Bartlett proved that ln ‘1 =
4 0.2 0.58400 c e w2 ðr 1Þ, where
5 0.2 0.54267
h iP
r
1 1 1
1þ Nr
By substituting these values into the formula for 3ðr1Þ
i¼1
ni 1
‘1 , we obtain c¼ :
Nr
0:46016
‘1 ¼ ¼ 0:90248: The approximation works poorly for small
0:509889 sample sizes. This approximation is more accurate
Critical values, ba ða, nÞ, of Bartlett’s test are as sample sizes increase, and it is recommended
tabled for cases in which the sample sizes are equal that minðni Þ ≥ 3 and that most ni > 5.
and a ¼ :05. These values are given in D. Dyer and
Jerome Keating for various values of r, the number
of groups, and n, the common sample size (see Assumptions
Table 2). Works by Glaser, M. T. Chao and Glaser, Bartlett’s test statistic is quite sensitive to non-
and S. B. Nandi provide tables of exact critical normality. In fact, W. J. Conover, M. E. Johnson,
values of Bartlett’s test. The most extensive set is and M. M. Johnson echo the results of G. E. P.
contained in Dyer and Keating. Extensions (for Box that Bartlett’s test is very sensitive to samples
larger numbers of groups) to the table of critical that exhibit nonnormal kurtosis. They recommend
values can be found in Keating, Glaser, and N. S. that Bartlett’s test be used only when the data con-
Ketchum. form to normality. Prior to using Bartlett’s test, it
is recommended that one test for normality using
an appropriate test such as the Shapiro–Wilk W
Approximation
test. In the event that the normality assumption is
In the event that the sample sizes are not equal, violated, it is recommended that one test equality
one can use the Dyer–Keating approximation to of variances using Howard Levene’s test.
the critical values:
Mark T. Leung and Jerome P. Keating
X
a
ni
ba ða; n1 , ,na Þ¼_ × ba ða,ni Þ: See also Critical Value; Likelihood Ratio Statistic;
i¼1
N Normality Assumption; Parametric Statistics; Variance
Bartlett’s Test 63