Professional Documents
Culture Documents
Mean Median Mode
Mean Median Mode
Lesson
A data set such as that shown in the following histogram
displays a fairly obvious center: the center bar. If you are
familiar with averages (means), you can probably already point
to the average of the data, which is the central (and tallest) bar
in the graph (assuming that the data values to which the bars
correspond are evenly distributed, as would be the case in a
histogram)
In this data set, the tallest peak is not at the center. If you
calculated the average of this data set, you'd also find that the
average doesn't correspond with the tallest peak. As a result,
we must not only do our math carefully, we must also carefully
select what kind of math we do so that we accurately represent
the data. Having shown why an average is not always the best
statistic to use when characterizing a data set, we can now turn
to the definition and use of (this and) other measures of central
tendency.
Mean (Average)
Data
Data Frequency
Frequency Value
Value
1 1
1 1
2 7
2 2
3 20
3 4
4 8 4 15
5 16 5 12
6 8 6 9
7 4 7 6
8 2 8 3
9 1 9 1
Table 1 Table 2
Using the mean formula for data with associated frequencies,
we calculate the mean of the data in Table 1 as 5. The mean
for the data in Table 2 is 4.38. Obviously, the mean in the case
of Table 1 does a good job of describing the data: the data
value 5 is the most frequent value, and the other values show
progressively lower frequencies. Thus, the mean shows the
central tendency of the data set in this case. In the case of
Table 2, the mean doesn't do such a good job: the most
frequent value is 3, but the mean is between two less frequent
values (4 and 5). As such, we must consider other measures of
central tendency for non-symmetric data sets.
Mode
{8, 1, 2, 0, 3, 6, 2, 8, 4, 5, 6, 1, 8, 6, 3, 9, 0, 9}
Solution: The mode is the data value (or values) that occurs
most frequently. One way to find the mode is to draw a graph of
the data (such as a histogram) and find the highest point on the
graph. Alternatively, we can order the data set and look to see
which value is the mode.
{0, 0, 1, 1, 2, 2, 3, 3, 4, 5, 6, 6, 6, 8, 8, 8, 9, 9}
Median
{24; 42; 64; 38; 49; 30; 34; 29; 2,350; 1,932, 61, 52, 51, 19, 28}
{19; 24; 28; 29; 30; 34; 38; 42; 49; 51; 52; 61; 64; 1,932; 2,350}
Let's now compare this result, 42, with the mean. Using the
formula given above, we calculate the mean of this data set as
approximately 320. Note carefully that the mean in this case is
well above the incomes of the majority of the people from
whom these data were taken--only 2 people in the group make
at least the mean income, whereas 13 people (the vast
majority) make far less than the mean income. The median
income, however, does a much better job of expressing the
central tendency of the data. If we were to ignore the two
individuals with extremely high incomes, we would find the
mean income of the remaining individuals to be about 40, which
is close to the median income.
{102, 403, 729, 843, 920, 360, 842, 941, 357, 483, 207, 670,
471, 109}
Solution: First, order the data. Note that because the set has 14
members, the median is the mean of two central values. These
values are underlined in the ordered set below.
{102, 109, 207, 357, 360, 403, 471, 483, 670, 729, 842, 843,
920, 941}