Professional Documents
Culture Documents
Learning Outcomes: At the end of this module, you are expected to:
LEARNING CONTENT
Introduction:
The previous lessons discussed the different methods of data collection as well as the different
methods of organizing, summarizing, and presenting data which includes tables and graphs. Although
tables and graphs are extremely useful in data presentation, they do not allow us to make concise,
quantitative statements that characterize a distribution as a whole. In order to do this, we have to use
descriptive statistics. In describing numerical data, the type of descripting is determined by the nature
of the data themselves and the objective(s) or purpose(s) of the description. In this topic, we are going
to discuss the measure of central tendency and position. Enjoy learning!
Lesson Proper:
Descriptions of statistical data can be quite brief or elaborate depending on the nature of the
data or what we intend to do. Sometimes, presenting data as they are and letting them speak for
themselves may be quite satisfactory by data further summarized by means of appropriate statistical
description give more useful information. One of these appropriate statistical description give more
useful information. One of these appropriate statistical descriptions is the measure of central tendency.
The measure of central tendency of a given set of data is the value around which the whole set
of data tends to cluster. It is represented by a single number which summarizes and describes the
whole set.
The most commonly used measures of central tendency are the mean, median, and mode.
Where: 𝑋̅ = sample mean; ∑= symbol for “summation”; 𝑋𝑖 = ith individual observation; n = total
no. of observations
Example :
A pediatrician had 9 patients on a particular clinic day. The weights (in kilograms) of her patients
on that day were as follows: 7, 17, 12.6, 15.7, 16, 16, 11.7, 17.5, and 12.6. Compute and interpret the
mean.
Solution:
𝑛
𝑖=1∑𝑋𝑖 7 + 17 + ⋯ + 12.6 143.8
𝑋̅ = = = = 14.01 𝑘𝑔
𝑛 9 9
Interpretation:
The Median
The median is the midpoint of the distribution. Half of the value in the distribution fall below the
median and the other half above it. For distributions having an even number or arrayed observed
values, the median is the average of the two middle most value; but, for odd number of arrayed
observations, it is the middlemost value.
The median is the most appropriate locator of the center since it has resistance to extreme
values. It is a positional average; hence, its value depends on its position relative to the number of
observations in the array and on the number of items in the distribution. The median is sometimes
denoted by 𝑋̃ or Mdn. The steps are:
Example 1:
Solution:
1. Arrange the weights of the patients as follows: 7, 11.7, 12.6, 12.6, 15.7, 16, 16, 17, 17.5, 17.7
𝑛 𝑛 10
2. Solve for 2 . 2 = 2 = 5.
(𝑛⁄2)+(𝑛⁄2+1)𝑡ℎ (5𝑡ℎ+6𝑡ℎ 𝑜𝑟𝑑𝑒𝑟𝑒𝑑 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛𝑠) 15.7+16.0
3. Since it is an integer, 𝑋̃𝑘 = = 𝑋̃𝑘 = = =
2 2 2
15.85 𝑘𝑔
Interpretation:
Half of the patients weigh less than or equal to 15.85 kg while the other half weigh more than
15.85 kg.
Example 2:
Consider the 11 patients admitted to a psychiatric ward of a general hospital who experienced
the following lengths of stay (in days) were as follows: 29, 14, 11, 24, 14, 14, 28, 14, 18, 22, 14. Find
and interpret the median length of stay of the patients.
Solution:
1. Arrange the weights of the patients as follows: 11, 14, 14, 14, 14, 14, 18, 22, 24, 28, 29.
𝑛
2. Solve for 2 .
𝑛 11
= = 5.5 = 𝑋̃ = 6𝑡ℎ 𝑜𝑟𝑑𝑒𝑟𝑒𝑑 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛 = 14 𝑑𝑎𝑦𝑠.
2 2
Interpretation:
Half of the patients length of stay is less than or equal to 14 days while the other half of the
patients length of stay is more than 14 days.
The Mode
The mode, denoted by 𝑥̂, of a give set of ungrouped data is the value that occurs most frequently.
The mode is not a unique measure since two or more values may occur more frequently in a given
distribution.
Example:
A pediatrician had 10 patients on a particular clinic day. The weights (in kilograms) of her patients
on that day were as follows: 7, 17, 12.6, 15.7, 16, 16, 11.7, 17.5, and 12.6. Find the mode weight of the
patients. Interpret it.
Solution:
MELS 1073 – Biostatistics & Epidemiology (Lecture) | 3
By inspection, the most frequent weights of the patients are 12.6 kg and 16 kg; hence the mode
is
𝑥̂ = 12.6 𝑘𝑔 𝑎𝑛𝑑 16 𝑘𝑔
Interpretation:
1. The mean is most frequently used measure of location since it reflects every value and
has the characteristics of simplicity, uniqueness and stability from sample to sample in a
distribution. However, when the distribution contains very large or very small values, it
can be misleading, while the median, on the other hand, is the most appropriate locater
of central tendency since it is the midpoint of the distribution and is not influenced by
extreme values, large or small, but by the number of observations in a given set.
2. In a symmetrical distribution (normal curve), where there is only one mode, the mean, the
median, and the mode have equal values and coincide at the highest point on the graph
and they all lie on the axis of symmetry.
3. In every given set of distribution, a unique value of the mean and of the median exist while
the mode, unlike the mean and the median, does not always exists nor is it unique—two
or more value may occur in a given distribution.
4. In a symmetrical distributions, the position of these measures varies. In a negatively
skewed (skewed to the left) distribution, the median lies to the left of the mode and the
mean to the left of the median, while in the positively skewed (skewed to the right), the
median lies to the right of the mode and the mean to the right of the median.
5. The mean is the most significant and widely used measures of averages. The median, on
the other hand, can be determined even for qualitative data as long as they can be
ordered, while the mode is most preferrable in getting the most typical average, since it
is the value that occurs frequently in a series.
1. Nominal data;
2. Solving for the most typical average since it is the value that occur most frequently in a series,
and;
3. A quick or rough estimate of a central value
The mean, median and mode can describe the characteristics of a given distribution.
Whether the curve is symmetrical, positively skewed, or negatively skewed, the area under the
curve to the left of the median is equal to the right; and no matter what the shape of the curve, the mode
is always located at the highest point.
Measures of Position
The quantiles or fractiles are point measures. There are quantile, decile, and percentile which
divide the distribution into a given number of equal parts. The most commonly and widely used quantile
is the percentile. Quartile and decile are seldomly used.
Quartiles
Quartiles are measures that divide the observations into four equal parts. Twenty-five percent
(25%) falls below the first quartiles, fifty percent (50%) is below the second quartile, and seventy-five
percent (75%) is below the third quartile. Quartiles are computed in the same as the median is
computed, since the second quartile is the same as the median. The interpretation of the obtained value
of the quartiles follows the interpretation of the median value.
The steps in finding the quartiles from raw data are as follows:
The individual ages (in years) of 10 patients entering the general hospital are as follows: 15, 31,
75, 84, 19, 79, 74, 78, 79, and 29. Determine the quartiles.
Solution:
1. Arrange their individual ages as 15, 19, 29, 31, 74, 75, 78, 79, 79, and 84.
2. Solve for quartiles:
𝑛𝑘 (10)(1)
= = 2.5 = 𝑄1 = 3𝑟𝑑 𝑜𝑟𝑑𝑒𝑟𝑒𝑑 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛𝑠 = 29
4 4
𝑛𝑘 (10)(3)
= = 7.5 = 𝑄3 = 8𝑡ℎ 𝑜𝑟𝑑𝑒𝑟𝑒𝑑 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛𝑠 = 79
4 4
Interpretation:
𝑄1= 29 can be interpreted as one-fourth of the patients are of ages lower or equal to 29 years
while three-fourth of the patients are of ages lower or equal to 79 years. 𝑄2 = 74.5 means that half of
the patients are of ages lower or equal to 74.5 years while another half fall above it.
Deciles
The deciles are measures of position that divide the total number of observations into ten (10)
equal parts. There are nine (9) deciles. Ten percent (10%) falls below the first decile; 20% falls below
the second decile; 30% falls below the third decile; and so on. The fifth decile is the same with median.
Thus, deciles are computed exactly in the same manner as the median is computed. The interpretation
is the same as the median or the quartiles.
The steps in finding the deciles from raw data are as follows:
Solution:
1. Arrange the survival times of 10 patients as 32, 42, 47, 59, 75, 86, 90, 96, 105, 135.
2. Solve for deciles:
Interpretation:
𝐷1 = 37 means that 10% of the survival times of patients fall below or equal to 37 days while the
remaining 90% of patients fall above it. 𝐷5 = 80.5 means that half of the survival times of patients fall
below or equal to 80.5 days while the other half fall above it. 𝐷7 = 93 means that 70% of the survival
times of patients fall below or equal to 93 days while the remaining 30% of patients fall above it. 𝐷9 =
120 means that 90% of the survival times of patients fall below or equal to 120 days while the remaining
10% of patients fall above it.
Percentile
The percentiles are measures of position which divide the total number of observations into
exactly one hundred equal parts. There are 99 percentiles that determine the points below which
percentages of observations would fall. For example, the seventh percentile would indicate that 7% of
the observations in the distribution lies within or below it while 93% lies above it.
The steps in finding the quartiles from raw data are as follows:
Solution:
1. Arrange the length of services (in years) of nine faculty members as 10, 14, 15, 17, 22,
22, 25, 30, and 34
2. Solve for percentiles:
𝑛𝑘 (9)(5)
= = 0.45 = 𝑃5 = 1𝑠𝑡 𝑜𝑟𝑑𝑒𝑟𝑒𝑑 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛𝑠 = 10
100 100
𝑛𝑘 (9)(50)
= = 4.5 = 𝑃50 = 5𝑡ℎ 𝑜𝑟𝑑𝑒𝑟𝑒𝑑 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛𝑠 = 22
100 100
𝑛𝑘 (9)(95)
= = 8.55 = 𝑃95 = 9𝑡ℎ 𝑜𝑟𝑑𝑒𝑟𝑒𝑑 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛𝑠 = 34
100 100
Interpretation:
𝑃5 = 10 means that 5% of the length of service in years of faculty members of the College of
Medicine fall below or equal to 10 years while the remaining 95% of faculty members fall above it. 𝑃50
= 22 means that half of the length of service in years of the faculty members of the College of Medicine
fall below or equal to 22 years while the other half fall above it. 𝑃95 = 34 means that 95% of the length
of service in years of the faculty members of the College of Medicine fall below or equal to 34 years
while the remaining 5% of faculty members fall above it.