Professional Documents
Culture Documents
O.M. “ Another method for structuring numerical data is to organize the values into
groups. This enables efficient and compact data representation. However, it
does complicate the analytical process somewhat. Seeing is believing so stay
tuned to witness the complex methods that are applied when analyzing grouped
data. Also look out for tips involving how to discuss the data that has been
presented. ”
Points Number of
students
40 – 49 8
Modal class
50 – 59 16
60 – 69 20
70 – 79 18
80 – 89 14
90 – 99 4
Next , we construct a histogram in the following manner
The modal class is 60 – 69 which has an upper class boundary of 69.5 and
a lower class boundary of 59.5. The frequency of that class is 20. The intersection
of the two constructed lines in the modal class bar determine the mode.
Hence, the mode is 66.1 points (See histogram above)
(c) To calculate the mean, we also need additional columns and calculations:
Points Number of
students
40 – 49 8
50 – 59 16
60 – 69 20
70 – 79 18
80 – 89 14
90 – 99 4
Solution:
(a)
Points Number of
students
Lower class boundary
of first class – 49
39.5 40 8
50 – 59 16
60 – 69 20
70 – 79 18
80 – 89 14
90 – 99 99.5 4
the range = upper boundary of last class – lower boundary of first class
= 99.5 – 39.5 = 60 points.
(b) (i) The calculation of the interquartile range requires the use of a
cumulative frequency curve.
Upper quartile = Q3 = ¾ (80)th frequency = 60th frequency
Lower quartile = Q1 = ¼ (80)th frequency = 20th frequency
See diagram below for the position of these quartiles:
Q1 = 57
Q3 = 78.4
∑ 𝑓𝑥 2
(i) The variance = s2 = – (𝑥̅ )2 // 𝑥̅ is the mean of the distribution. `
𝑛
See Example 1(a) part (c) //
382 160
= – (67.75)2
80
= 4 777 – 4 590.0625
= 186.9375
≈ 186.9 points (1 d.p.)
∑ 𝑓𝑥 2
(ii) the standard deviation = s = √ – (𝑥
̅ )2
𝑛
= √186.9375
≈ 13.7 points (3 s.f.)
(iii) The first step in constructing a boxplot is to calculate the 5-point data
summary
Minimum value = 0|2 = 2
#2 When commenting on boxplots, always speak to the median line, the length
of the box(interquartile range) and the width of the two whiskers.
TAKE-AWAYS
• Grouped data refers to data organized into fixed class intervals of an
attribute along with their respective frequencies.
• Calculating measures of central tendency for grouped data involves the
use of the histogram, cumulative frequency curve and additional
columns in the grouped frequency tables for calculation purposes. (See
Example 1(a))
• Calculating measures of dispersion for grouped data involves the
calculation of boundaries for each class interval, the cumulative
frequency curve and additional columns in the grouped frequency tables
for calculation purposes . (See Example 1(b))
• Commit Pro Tips #1 and #2 to memory so that you can apply them when
asked to do commentary.