Professional Documents
Culture Documents
Cumulative Frequency and Box Plots
Cumulative Frequency and Box Plots
a Mean Median
𝟐𝟒 +𝟐𝟒+𝟐𝟓+ …+𝟏𝟗𝟐 (the middle value!)
?𝟏𝟏Calculation
=£ 𝟒𝟗 , 𝟏𝟖𝟐 ? Calculation
Advantage: More reflective ?
of a typical worker.
Advantage:
?
This average takes all the values Disadvantage: Ignores all values other than the
into account.
Disadvantage: This average is not very representative
centre one. ?
of the ‘typical’ worker – only two of
?
the workers have a salary higher
than this average!
Starter
Here are the salaries of 11 people working at a firm.
(a) How could we work out the ‘average’ salary. What is the
disadvantage/advantage of each approach?
(b) How could we numerically describe how ‘spread out’ the
salaries are? How useful is the value you have?
b One good measure of spread is the ‘width’ of the data, i.e. the difference
between the smallest and largest value – this is known as the range.
The more spread out the data, the greater the range.
£168,000 does not effectively represent how spread out the data is: most the
employee’s salaries don’t vary by very much…
Starter
Here are the salaries of 11 people working at a firm.
(a) How could we work out the ‘average’ salary. What is the
disadvantage/advantage of each approach?
(b) How could we numerically describe how ‘spread out’ the
salaries are? How useful is the value you have?
50%
The height of the person We already know that the The upper quartile is the
25% along the line is median would be the height of the person 75%
known as the: middle person’s height. along the data.
?
lower quartile
50% of the people in the
school would have a
height less than them.
?
50% of the data has a value more than the median.
?
75% of the data has a value more than the lower quartile.
LQ Median UQ
Determining quartiles from listed data
If there are two middle items, the median is the value
halfway between the two.
Here are the ages of 10 people at Pablo’s party. Choose the correct value.
To get LQ: If
there’s an even
number of
items, simply
12, 13, 14, 14, 15, 16, 16, 17, 19, 24
find median of
the first half of
the data.
(Click to vote)
Median: 15 15.5 16
LQ: 13 13.5 14
UQ: 17 18 19
Interquartile As mentioned earlier, the interquartile range 0% 25% 50% 75% 100%
Range: 17 −14=3
?
is how spread out the middle 50% of the data
is. The middle 50% goes from the 25% point
to the 75% point, so we find the difference
middle 50%
LQ Median UQ
1, 2, 3 1? 2? 3?
1, 2, 3, 4 ?
1.5 ?
2.5 ?
3.5
1, 2, 3, 4, 5 1.5
? 3? 4.5
?
1, 2, 3, 4, 5, 6 2? 3.5
? 5?
What if there’s lots of items?
There are 31 items, in order of value. What items should we use for the median and
lower/upper quartiles?
0 1 1 2 4 5 5 6 7 8 10 11 14 15 16 17 18 19 19
29 31 31 37 38 39 39 40 40 41 43 44
Fro Tip: +1 to the number of items. Then find and of this number to get the
position of the LQ, median and UQ respectively.
This trick only works if the number of items if one less than a multiple of 4.
What item numbers should you use?
? 6
(a) Determine the interquartile range of these marks. Answer:
Sunil did the same eleven maths tests.
The median mark Sunil scored in his tests is 17.
The interquartile range is 8.
(b) Which one of Sunil or James has the more consistent marks?
James because he had a lower interquartile range.
?
Median/quartiles from a frequency table
Age (years) Frequency Time taken (s) Freq
8 4
7 9
1 2
We can use the
4 same “quarters 11
of 1 more than
7 the total 13
frequency” trick.
1 Determine the Lower Quartile, Median, Upper Quartile and Interquartile Range of each
list of numbers.
8 [Nov 2017 3H Q1a] The N1 [JMC 2017 Q20] The range of a list of
table shows information integers is 20, and the median is 17.
about the heights of 80 What is the smallest possible number of
children. Find the class integers in the list?
interval that contains Answer: 2 numbers? (7 and 27)
the median.
? N2 [Hamilton 2016 Q5] James chooses five
different positive integers, each at most
eight, so that their mean is equal to
their median.
9 [Edexcel IGCSE Jan2015(R)-3H Q12b] In how many different ways can he do
Here are the marks scored in a test by the girls in class 8C. this?
2 8 10 12 15 16 16 10 ways
?
17 18 19 20
(a) Find the interquartile range of this data.
8 marks ?
(b) The boys in class 8C did the same test.
The boys’ marks had a range of 19 and an interquartile
range of 11 marks. Gareth says that the girls’ marks are
more spread out than the boys’ marks. Is Gareth right?
No, IQR/range for boys is greater than girls.
?
Box Plots
Box Plots allow us to visually represent the distribution of the data.
IQR
0 5 10 15 20 25 30
0 5 10 15 20 25 30
Age (years)
0 4 8 12 16 20 24
Sketch
Further Test Your Understanding
[Edexcel GCSE June2014-1H Q16b] Colin took a sample of 80 football players.
He recorded the total distance, in kilometres, each player ran in the first half of their
matches on Saturday.
Colin drew this box plot for his results.
There were 80 players in Colin’s sample.
Work out the number of players who ran a distance of more than 5.6 km.
Croydon
Kingston
?
0 2 4 6 8 10 12
Weight (kg)
Exercise 2 (on provided sheet)
5 [Edexcel GCSE June 2017 1H Q19 Edited] 6 [Edexcel GCSE Jun2016-1H Q15b]
Some students were asked how many times The box plots give information about the wages
they each used their mobile phones last week. of a group of 16 year old workers and a group of
The box plots give information about the male 18 year old workers.
students’ answers and about the female There are 200 workers who are 16 years old.
students’ answers. Work out an estimate for the number of these
The median time spent on mobile phones is workers whose wages are £130 or more.
the same for male and female students (both
40 times).
? ?
The female students had a higher
interquartile range (42 vs 31).
Exercise 2 (on provided sheet)
The median height of the Year 11 girls is greater than the Year 7 girls. The
range of the heights of the Year 7 girls is greater than the Year 11 girls.
?
20
21
27
30
Cumulative Frequency Tables for Grouped Data
Cumulative Freq
Convert to
cumulative ?
frequency table.
?
130 < h≤ 150 ?
15
130 <?
h≤ 160 ?
39
130 <?
h≤ 170 ?
61
130<?h≤ 180 ?
80
?
There’s 32 runners, so the middle runner is the 16 th (for cumulative
28 frequency graphs we can simply half the total frequency, without the usual Median = 10.07s
adjustment for listed values). So use line graph to estimate 16 th value.
Cumulative Frequency
24
Lower Quartile
20 We don’t know what time ?
= 9.95s
the fastest runner had, but
we can be certain that 1
16 We know 0
runner had a time of “up
runners had
a time of “up to” 9.7 secs. We use the Upper Quartile
12 to” 9.6 secs.
Always plot
endpoint of each interval.
?
= 10.13s
the lowest Mark schemes allow either
8 value using a
CF of 0.
a curved line or joining each
pair of points with a straight Interquartile Range
4
line. (I do the latter)
?
= 0.18s
A quarter of 32 is
8, so look up 8th
0 runner.
24
Estimate how many
20 runners had a time more
than 9.95
16
Observe that this time 32 – ?
8 = 24 runners
we’re looking up values
12 on the time () axis.
Estimate how many
8 runners had a time
between 9.8s and 10s
4
11 –?3 = 8 runners
0
9.5 9.6 9.7 9.8 9.9 10.0 10.1 10.2 10.3
Time (s)
Time (s) Frequency Cum Freq
Students often get confused
between cumulative frequency 9.6 < t ≤ 9.7 1 1 Plot
graph and frequency polygons, so be
aware of their differences!
9.7 < t ≤ 9.9 4 5 Plot
9.9 < t ≤ 10.05 17 22 Plot
Sketch Line
10.05 < t ≤ 10.2 10 32 Plot
32 18
28 16
Frequency
24 14
20 12
16 10
12 8
8 4
4 2
0 0
9.5 9.6 9.7 9.8 9.9 10.0 10.1 10.2 10.3 9.5 9.6 9.7 9.8 9.9 10.0 10.1 10.2
5?
?
23
?
35
?
39
?
40
?
179
2
?
34
?
Lower Quartile = 16
?
Upper Quartile = 44.5
We previously found:
Minimum = 9, Maximum = 57, LQ = 16, Median = 34, UQ = 44.5
44
100
25< 𝐴?≤ 35
?
134
153
160
?30
?
40.9 − 24.1=16.8
(e) Another IT company has 80 employees.
0 10 20 30 40 50 60 70
A g e ( A ) in y e a rs
4 [Edexcel GCSE June2006-3I Q25c, June2006-5H Q14c]
A company tested 100 batteries. The table shows
(b) On the grid, draw a
information about the number of hours that the
cumulative frequency graph for
batteries lasted. Time (t hours) Frequency
50 ≤ t < 55 12
your completed table.
55 ≤ t < 60 21
60 ≤ t < 65 36
65 ≤ t < 70 23
70 ≤ t < 75 8
140
120
Use the cumulative frequency curve to
find an estimate for the
100 (i) median weight,
Accept 72-74 kg
?
80
(ii) interquartile range of the weights.
C u m u lativ e
Accept 14-17kg (lines for quartiles
? the diagram)
freq u en cy
60
must be shown on
40
20
0
40 50 60 70 80 90 100
Weig h t (k g )
6
C?
D?
B?
A?
CARD SORT SOLUTIONS
?
? ?
Cumulative
Description Box Plot
Frequency Graph
?
? ?
Cumulative
Description Box Plot
Frequency Graph
?
? ?
Cumulative
Description Box Plot
Frequency Graph
?
? ?
Cumulative
Description Box Plot
Frequency Graph
?
? ?
Cumulative
Description Box Plot
Frequency Graph
?
? ?
Cumulative
Description Box Plot
Frequency Graph
?
? ?
Cumulative
Description Box Plot
Frequency Graph
?
? ?
Cumulative
Description Box Plot
Frequency Graph
The Whole Picture...
Frequency
Histogram Polygon
Grouped Cumulative
Widths (cm): Frequency Table Frequency Table
4, 4, 7, 9, 11, 12, 14, 15, Width (cm) Frequency Width (cm) Cum Freq
Determine
Median/LQ/UQ Median/LQ/UQ Cumulative
class interval Frequency
Graph
Estimate of
Median/LQ/UQ/num
Box Plots values in range