Professional Documents
Culture Documents
Data Summarization - Measures of Variation
Data Summarization - Measures of Variation
Case 1: Data on scores of 5 students in two sections for the same set of questions
Section A Section B
30 35 40 45 50 10 35 40 55 60
̅̅̅
𝑋𝐴 = 40 ̅̅̅̅
𝑋𝐵 = 40
_____________________ _____________________
Case 2: Data on Monthly Sales of Paint in cans for two brands A and B of paint (From Bluman)
NOTE: The value of the measure of variation is directly proportional to the scattering of the rest of the data
points away from the center. That is, the lesser the value of the measure of variation = the lesser is the scattering,
and, the greater the value of the measure of variation = the wider or greater is the scattering away from the center.
Section A Section B
30 35 40 45 50 10 35 40 55 60
̅̅̅̅
𝑿𝑨 = 𝟒𝟎 ̅̅̅̅
𝑿𝑩 = 𝟒𝟎
RA = H – L = 50 – 30 = 20 RB = H – L = 60 – 10 = 50
Example:
Case 1: MAD for Raw Data
Data on scores of 5 students in two sections for the same set of questions
Reqd: MAD
Section A Section B
30 35 40 45 50 10 35 40 55 60
̅̅̅̅
𝑿𝑨 = 𝟒𝟎 ̅̅̅̅
𝑿𝑩 = 𝟒𝟎
For Section A
x d = x - x̅ |𝑑| = |𝑥 − 𝑥̅ |
Hence: For Section A
30 -10 10 ∑|𝑑| ∑|𝑥 − 𝑥̅ |
35 -5 5 𝑀𝐴𝐷 = =
𝑛 𝑛
40 0 0 30
= =6
45 5 5 5
50 10 10
Σd = 0 𝚺|𝒅| = 𝟑𝟎
3
For Section B
x d = x - x̅ |𝑑| = |𝑥 − 𝑥̅ | Hence: For Section B
10 -30 30 ∑|𝑑| ∑|𝑥 − 𝑥̅ |
𝑀𝐴𝐷 = =
35 -5 5 𝑛 𝑛
40 0 0 70
= = 14
55 15 15 5
60 20 20
Σd = 0 𝚺|𝒅| = 𝟕𝟎
NOTE: The sum of deviations for all the values of x in the raw data is equal to zero. ∑ 𝑑 = 0
∑|𝑓(𝑑)| ∑ 𝑓|(𝑑)|
𝑀𝐴𝐷 = ∑𝑓
= ∑𝑓
Given:
Class f M fM ̅
d= M - 𝑿 |𝒇 𝒅|
LB LU
62.5 73.5 5 68 340 -27.9 139.5
73.5 84.5 14 79 1106 -16.9 236.6
84.5 95.5 18 90 1620 -5.9 106.2
95.5 106.5 25 101 2525 5.1 127.5
106.5 117.5 12 112 1344 16.1 193.2
117.5 128.5 6 123 738 27.1 162.6
Σf = 80 Σf(M) = 7673 Σlf(d)l = 965.6
Variability can also be defined in terms of how close the scores in the distribution are to the middle
of the distribution. Using the mean as the measure of the middle or center of the distribution, the variance is
defined as the average squared difference of the scores from the mean(or squared deviation = d2). The variance
is the measure of dispersion that eliminates negative signs by squaring all deviations.
∑ 𝑑2 ∑(𝑥− 𝜇)2 ∑ 𝑑2
𝜎2 = = = ∑𝑓
𝑁 𝑁
∑ 𝑑2 ∑(𝑥− 𝑥̅ )2 ∑ 𝑑2
𝑠2 = = = ∑ 𝑓−1
𝑛−1 𝑛−1
Where:
N or n = Σf
For Section A
x d = x - 𝐱̅ d2
30 -10 100
35 -5 25
40 0 0
45 5 25
50 10 100
For Section B
x d = x - x̅ d2
10 -30 900
35 -5 25
40 0 0
55 15 225
60 20 400
s2 = Σfd2 / n- 1 = Σfd2 / Σf - 1
Given:
Class F M fM ̅
d= M - 𝑿 d2 fd2
LB LU
62.5 73.5 5 778.41 3892.05
73.5 84.5 14 285.61 3998.54
84.5 95.5 18 34.81 626.58
95.5 106.5 25 26.01 650.25
106.5 117.5 12 259.21 3110.52
117.5 128.5 6 734.41 4406.46
NOTE: In describing the center of the data distribution and the variation or scattering of the rest of the
data points away from the center, the Mean and Standard Deviation are used.
The normal monthly precipitation (in inches) for these same 10 cities is listed here.
Monthly Rainfall or Precipitation in inches = 4.8, 2.6, 1.5, 1.8, 1.8, 3.3, 5.1, 1.1, 1.8, 2.5
NOTE: Standard deviation used as the distance or scattering away from the center (mean) applies to both the distance on the left side
and the right side of the mean.
Example:
a.) scores and number of students within 1 standard deviation from the mean
Soln:
Scores within 1 std. dev =
Number of students = 68% (total no. of students) = 0.68(100) = 68 students
OR: There will 68 students with scores from 32 to 38 (No. of students within + 1s away from 𝑥̅ )
7
b.) scores and number of students within 2 standard deviations from the mean
Soln:
Scores within 2 std. dev =
Number of students = 95% (total no. of students) = 0.95(100) = 95 students
OR: There will 95 students with scores from 29 to 41 (No. of students within + 2s away from 𝑥̅ )
c.) scores and number of students within 3 standard deviations from the mean
Soln:
Scores within 3 std. dev =
Number of students = 99.7% (total no. of students) = 0.997(100) = 99.7 or 100 students
OR: There will 100 students with scores from 26 to 44 (No. of students within + 3s away from 𝑥̅ )
Hence: number of students with scores within 32 to 41 = 34% + 47.5 % of the data points
= 81.5% (100 students) = 81.5 or 82 students
Hence: number of students with scores within 38 to 41 = 47.5 % - 34% of the data points
= 13.5% (100 students) = 13.5 or 14 students