Professional Documents
Culture Documents
Statistics Review Worksheet-1a
Statistics Review Worksheet-1a
Important Terms
1. Range is the difference between the lowest and highest values.
2. Classes are the groups the data is organised into. They are also referred to as class intervals.
3. Class limits are the lower and upper limits of the class.
4. Each class has an upper class boundary and a lower class boundary. For the lower class boundaries,
we subtract 0.5 from the lower class limit and for the upper class boundaries; we add 0.5 to the upper
class limit.
5. The class width is the difference between the upper class boundary and lower class boundary.
6. The midpoint of the class is the middle of the upper and lower class boundaries (i.e, the median).
The mean
The mean is the average of the numbers: a calculated "central" value of a set of numbers.
To calculate: add up all the numbers, then divide by how many numbers there are.
Mean =sum of elements/number of elements OR
Mean=
∑ fx
∑ f , where f – frequency, x is the variable and ∑ ¿ ¿ is summation.
The median
The median is the middle number. To find the median number:
Put all the numbers in numerical order.
If there is an odd number of results, the median is the middle number.
If there is an even number of results, the median will be the mean of the two central numbers.
The mode
The mode is the number which occurs most often in a set of data. There can be more than one mode.
Question 1
Question 2
Question 3
Question 4
A boy height was measured over several years and the data were recorded below.
Age 9 11 13 15 17 19 21
(years)
Question 5
The table below shows the number of points gained by the team in a series of cricket matches.
No. of points 0 1 2 3 4 5 6 7 8 9 10
Frequency 3 2 0 5 6 7 2 1 0 4 1
fx
(d) Calculate the range, mean and interquartile range of the data
Question 6
The heights of 50 students correct to the nearest centimeter are shown in the frequency table below.
Height (cm) Frequency
150 1
151 5
152 10
153 16
154 10
155 6
156 2
(a) Calculate the mean, mode, median and range.
(b) Calculate the probability that if a student is selected at random he/she is
(i) shorter than 152cm
(ii) taller than 154cm.
Question 7
The masses of 100 students to the nearest kilogram are given in the frequency table below.
Question 8
The table below shows the distribution of masses of 100 adults measured to the nearest kilogram.
Mass (kg) Frequency (f) Midpoint (x) fx
8170/100 =81.7 kg
Question 9
The frequency distribution of the length of 100 steel rods measured in mm is given in the table below.
100-104 4
105-109 9
110-114 10
115-119 17
120-124 25
125-129 21
130-134 9
135-139 5
1. Pictograph
A Pictograph is a way of showing data using images. Each image stands for a certain number of
things.
2. Pie Chart
Pie charts are useful to compare different parts of a whole amount. A pie chart is a circular chart
in which the circle is divided into sectors.
3. Line Graph
A line graph is often used to represent a set of data values in which a quantity varies with time.
These graphs are useful for finding trends.
x axis – Time for eg years y axis – Amount of an item for eg Income
4. Bar Chart
Bar charts are often used to present data in a pictorial form to illustrate the information collected
and highlight important points.
Bar charts are drawn with parallel bars placed vertically (or horizontally).
The width of each bar and the spacing between the bars are kept the same to avoid giving a
misleading representation.
The height of the bar is drawn to scale to represent the amount of the item.
x axis – Variable eg Favourite Sport y axis – Frequency
6. Frequency Polygon
In a Frequency Polygon, a line graph is drawn by joining all the midpoints of each class.
A frequency polygon gives the idea about the shape of the data distribution.
The two end points of a frequency polygon always lie on the x-axis and they represent the midpoint
of the class that would come before the first class and the midpoint of the class that would come
after the last class.
x axis – Variable (Class Midpoint) y axis – Frequency
7. Cumulative Frequency Curve
A Cumulative Frequency Graph is a graph plotted from a cumulative frequency table. A cumulative
frequency graph is also called an ogive.
x axis – Variable (Upper Class Boundaries) y axis – Cumulative Frequency
We know that the median divides the data into two halves. We also know that for a set of n
ordered numbers the median is the (n + 1) ÷ 2 th value.
The lower quartile divides the bottom half of the data into two halves, Lower quartile is the
(n + 1) ÷ 4 th value.
The upper quartile also divides the upper half of the data into two halves. Therefore Upper
quartile is the
3 (n + 1) ÷ 4 th value.
The interquartile range is the difference between the upper quartile and lower quartile.