Professional Documents
Culture Documents
2 Measures of
Variation/DispersionLEARNING OBJECTIVES
LET’S EXAMINE
In previous lessons, we discussed different ways of describing a set of data
by locating the typical value/score point that summarizes set of values shown
upon calculating the measures of central tendency, mean, median and mode.
1. What are the mean median and modal scores of each set of data? 2.
Which set of scores has the lowest variability? medium variability? highest
variability? Explain.
You have noticed that scores in set A, B and C are arranged in
asceding order that makes it easier for us to identify the three measures of
central tendency. Mean, median and modal scores are shown in the table
below:
Data Set Mean Median Mode
Obviously, the above sets of data are normally distributed because they
have equal values in terms of its mean median and mode.
But which set of scores do you think has the lowest variability? medium
variability? Highest variability? Without doing any mathematical computation, it
is evident that Set A has lowest variability because scores are so closed to each
other which made them more homogenous. Set B has medium variability
because scores are a little bit scattered to each other and finally Set C shows the
highest variability because it has the most scattered scores.
The data sets above can be represented graphically with the use of
different normal curves below:
Legend:
Set A
Set B
__
__
Set C
_
Source: https://cyntegrity.com/clinical-data-quality-article/variability-graph/
LET’S EXPLORE
Set A: R = 11- 8
=3
Set B: R = 14 - 6
=8
Set C: R = 19 – 2
= 17
Among the three sets of data above, Set C has the highest
variability in terms of Range.
2. Quartile Deviation (QD)
When you take half of the difference between the 3rd and the 1st
quartiles of a distribution, the result is known as the quartile deviation. Quartile
deviation is far better than range because it considers 50% of the values in a
distribution. This is more desirable to use than the range when the distribution to
be described is skewed or when the median is the only measure of central
tendency that is available.
Quartile deviation
formula:
QD = ��3−��1
2
https://www.slideshare.net/kathy_mac/quartile-deviation-72029607
Example 2. Find the quartile deviation of the set of scores below: Data Set:
Let us recall the method of solving the lower quartile (��1)and the
upper quartile (��3) using the Mendenhall and Sincich Method of Finding
Quartiles (Lesson 4 - Other Measures of Position)
ii. Subtract the 2nd score from the 3rd score: 4-3 =1
iii. Multiply this difference with the decimal part of the answer in i. 1(0.75) =
0.75
nd
iv. Add this product to the 2 score.
3 + 0.75 = 3.75
v. ��1 = 3.75, this implies that 25% of the scores are below or equal to
3.75
b. In solving for ��3, you may follow the steps for determining ��1 above.
i. ��3 = 34(�� + 1)
��3 = 34(10 + 1)
��3 = 34(11)
��3 = 8.25, (or 8.25th score which implies that ��3 is located
between the 8th and the 9th score of the set of data above.)
QD = ��3−��1
2
= 16.25−3.75
2
=12.5 2
= 6.25 (This implies that the difference between the two values ��1 ������
��3 known as the interquartile range or (IQR) is12.5 and half this
difference/variation is 6.25 which is the quartile deviation (QD) of the distribution.
3. The Mean (Average) Deviation (Md)
It is a measure of dispersion derived by computing the average of the
absolute deviations of the individual scores of a set of data from the measure of
central tendency such as the mean. The value of the mean deviation about the
mean is a measure of how closely grouped data values are. It answers the
question, “How close to the mean, on average, are the data values?”
��̅=6+7+8+9+10+10+11+12+13+14
10
��̅= 10
ii. Set-up a table and find the distance of each value from that of the
mean expressing the difference in terms of its absolute value
Score Mean Difference of Absolute value
x ��̅ each value from of each
the deviation
mean |�� − ��̅|
x-��̅
6 10 -4 4
7 -3 3
8 -2 -10 2
9 -1 1
10 0 0
10 0 0
11 1 1
12 2 10 2
13 3 3
14 4 4
iii. Calculate the mean of the absolute deviations which is equal to the
mean deviation.
Md = ∑|��−��̅|
��= 10 = 2
20
With this data set, we can say that the mean is 10 and the
average distance from that mean is 2.0. Note that some numbers are closer
than 2.0 and some are farther. But 2.0 is known as the average distance.
4. Sample Variance (����)
��−1
��̅=8+9+9+10+10+10+11+11+11+11
10
��̅= 10
ii. Set-up a table and find the sum of the squared
distances/differences from each point to the mean.
Score Mean Difference of Squared
x ��̅ each value from difference from
the each point to
mean the mean
x-��̅ (�� − ��̅)2
8 10 -2 4
9 -1 1
9 -1 1
10 0 0
10 0 0
10 0 0
11 1 1
11 1 1
11 1 1
11 1 1
��2 = ∑(��−��̅)2
��−1
��2 = 10
10−1
��2 = 1.11
Formula 2: �� =
2 ��∑��2 2
−(∑��)
��(��−1)
i. Set-up a table and square each value/score
8 64
9 81
9 81
10 100
10 100
10 100
11 121
11 121
11 121
11 121
iii. Find the sample variance by substituting the values obtained for
∑�� and ∑��2found from the table above.
��2 =��∑��2−(∑��)2
��(��−1)
��2 = 10(1010)−(100)2
10 (10−1)
�� = 1.11
2
A small variance such as 1.11 indicates that the data points tend
to be very close to the mean, and to each other.
5. Sample Standard Deviation (s)
Standard deviation (s) looks at how spread out a group of values is from
2
the mean by means of looking at the square root of the variance (�� ).
̅2̅
Formula 1: s=√∑(��−��)
��−1
Formula 2: s = √��∑��2−(∑��)2
�� (��−1)
Since the standard deviation is calculated as the square root of variance,
we can use now the data for variance found in #4 above.
∑(��−��
Variance: �� =
2
̅)2
̅̅2
��(��−1)
10−1 s = √10(1010)−(100)2
s = √10
s = 1.05 s = 1.05
10(10−1)
s = √109 s = √100 90
s = √1.11 s = √1.11
Table 1
Frequency Distribution of the Test Scores in Statistics of Fifty Students
21-25 5
26-30 8
21-25 is the lowest class interval with a
lower boundary of 20.5 31-35 14
36-40 12
41-45 6
51-55 is the highest 46-50 4
class interval with an upper boundary of
54.5
51-55 1
R = ����ℎ - ����ℎ
R = 55.5 – 20.5
R = 35
2. The Quartile Deviation (QD)
The computed values for ��3 and ��1 for the above data (Table 1) were
39.88 and 30.19 respectively. (Refer to previous Lesson 4 - Other Measures of
Position).
Therefore, QD = ��3−��1
2
QD =39.88−30.19
2
QD =9.69
2
QD = 4.85
For grouped data, mean deviation can be solved using the formula below:
Md = ∑��|����−��̅|
��where ∑ denotes taking the sum
f frequency
̅
���� midpoint/class mark �� mean
n total number of observations
̅
Let us add the necessary columns in Table1 such as |���� − ��| and
̅
f|���� − ��| that will help us determine the mean deviation of the given
grouped frequency distribution.
Table 1
Frequency Distribution of the Test Scores in Statistics of Fifty Students
Class Class Midpoint/ Mean Absolute value Product of the
of the frequency and
Interval Frequen Class
��̅ difference of difference of
X cy f mark
���������� ����������
����
��̅ ��̅
̅ ̅
|���� − ��| f|���� − ��|
21-25 5 23 35.2 |23 − 35.2|= 12.2 5(12.2) = 61.0
Note: The mean value of 35.2 (��̅=35.2) is taken from the previous computation of mean
found in Lesson 2.2- Measures of Central Tendency
Md = 298.8
50
Md = 5.98
��2 = ∑��(����−��̅)2
��−1
Using again the data in Table 1 and adding the necessary columns will
make the computation of variance for group data understandable it is because
mathematical computations are completely provided.
Table 1
Frequency Distribution of the Test Scores in Statistics of Fifty Students
Class Class Midpoint/ Mean Square of the Product of the
difference of frequency and
Interval Frequen Class
��̅ the square of
X cy f mark ���������� ��̅
the
���� ̅ difference of
(���� − ��)2
����������
��̅
̅
f(���� − ��)2
i=5 n=50 ̅
∑ f(���� − ��)2 =
2658
��2 = 2658
49
��2 = 54.24
We all know that the standard deviation is just the square root of the
variance, thus,
s = √∑��(����−��̅)2
��−1=
s = √2658
49
s = √54.24
s = 7.36