Professional Documents
Culture Documents
David C. Hoaglin
To cite this article: David C. Hoaglin (2019) Reconsidering the Human Face as Boxplot, CHANCE,
32:3, 62-63, DOI: 10.1080/09332480.2019.1662703
Article views: 9
I
welcome Sarjinder Singh’s creativity (“Box Plot
Versus Human Face,” CHANCE 32(2):28–29),
but the boxplot does not align with a human
face as closely as he suggests, and it does not classify
observations as outliers.
In the standard boxplot (in his notation), the inner
fences are:
LIF = Q 1 – 1.5(Q 3 – Q 1)
UIF = Q 3 + 1.5(Q 3 –Q1)
and the outer fences are:
LOF = Q 1 – 3(Q 3 – Q 1)
UOF = Q 3 + 3(Q 3 – Q 1)
(For Q 1 and Q 3 the boxplot uses a particular defini-
tion, known as the “fourths.”)
Importantly, all the data values are real, but some
may merit investigation. As in Professor Singh’s
construction, the left and right whiskers end at the
most-extreme data values that are inside the LIF and
UIF, respectively. Data values outside the inner fences
are plotted individually, and those beyond the outer
fences receive larger plotting symbols.
Exploratory data analysis (EDA) does not auto-
matically classify data values as “outliers.” It leaves
that judgment to the analyst, after investigation. Data
values between an inner fence and the corresponding
outer fence are “outside,” and those beyond the outer
fences are “far out.” Thus, Singh’s “mild outliers”
are merely “outside” and his “extreme outliers” are
“far out.”
It is helpful to have an idea of how often samples
of well-behaved (i.e., normal) data contain observa-
tions that are outside or far out. In this null situation,
the percentage of random samples that contain one
CHANCE
63