Professional Documents
Culture Documents
Frequency Distributions
and Their Graphs
1 of 149
Section 2-2 Objectives
2 of 149
Frequency Distribution
5 of 149
Constructing a Frequency Distribution
Midpoint of a class
(Lower class limit) + (Upper class limit)
2
59–114 5 5
115–170 + 8 13
171–226 + 6 19
Relative Cumulative
Class Frequency, f Midpoint frequency frequency
59–114 5 86.5 0.17 5
115–170 8 142.5 0.27 13
171–226 6 198.5 0.2 19
227–282 5 254.5 0.17 24
283–338 2 310.5 0.07 26
339–394 1 366.5 0.03 27
395–450 3 422.5 0.1 30
Σf = 30 f
∑ ≈1
n
Frequency Histogram
• A bar graph that represents the frequency distribution.
• The horizontal scale is quantitative and measures the
data values.
• The vertical scale measures the frequencies of the
classes.
• Consecutive bars must touch.
Class Frequency,
Class boundaries f
59–114 58.5–114.5 5
115–170 114.5–170.5 8
171–226 170.5–226.5 6
227–282 226.5–282.5 5
283–338 282.5–338.5 2
339–394 338.5–394.5 1
395–450 394.5–450.5 3
Class Frequency,
Class boundaries Midpoint f
59–114 58.5–114.5 86.5 5
115–170 114.5–170.5 142.5 8
171–226 170.5–226.5 198.5 6
227–282 226.5–282.5 254.5 5
283–338 282.5–338.5 310.5 2
339–394 338.5–394.5 366.5 1
395–450 394.5–450.5 422.5 3
You can see that more than half of the GPS navigators are
priced below $226.50.
Frequency Polygon
• A line graph that emphasizes the continuous change
in frequencies.
frequency
data values
frequency
relative data values
From this graph you can see that 27% of GPS navigators are
priced between $114.50 and $170.50.
© 2012 Pearson Education, Inc. All rights reserved. 28 of 149
Graphs of Frequency Distributions
cumulative
frequency
data values
© 2012 Pearson Education, Inc. All rights reserved. 29 of 149
Constructing an Ogive
From the ogive, you can see that about 25 GPS navigators cost
$300 or less. The greatest increase occurs between $114.50 and
$170.50.
© 2012 Pearson Education, Inc. All rights reserved. 33 of 149
Section 2.2 Summary
Class Frequency, f
1–4 4
Upper 5–8 5
Lower
Class
Class 9 – 12 3 Frequencies
Limits
Limits 13 – 16 4
17 – 20 2
Class Frequency, f
1–4 4
5–1=4 5–8 5
9–5=4 9 – 12 3
13 – 9 = 4 13 – 16 4
17 – 13 = 4 17 – 20 2
The class width is 4.
Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 38
Constructing a Frequency Distribution
Example continued:
3. The minimum data entry of 18 may be used for the
lower limit of the first class. To find the lower class
limits of the remaining classes, add the width (8) to each
lower limit.
The lower class limits are 18, 26, 34, 42, and 50.
The upper class limits are 25, 33, 41, 49, and 57.
Midpoint = 1 + 4 = 5 = 2.5
2 2
Relative
Class Frequency, f
Frequency
1–4 4 0.222
∑f = 18
Relative frequency = f = 4 ≈ 0.222
n 18
Larson & Farber, Elementary Statistics: Picturing the World, 3e 43
Relative Frequency
Example:
Find the relative frequencies for the “Ages of Students”
frequency distribution.
Relative Portion of
Class Frequency, f Frequency students
18 – 25 13 0.433 f = 13
26 – 33 8 0.267 n 30
34 – 41 4 0.133 ≈ 0.433
42 – 49 3 0.1
50 – 57 2 0.067
f 1
∑f =
30 ∑ =
n
Larson & Farber, Elementary Statistics: Picturing the World, 3e 44
Cumulative Frequency
The cumulative frequency of a class is the sum of the
frequency for that class and all the previous classes.
Ages of Students
Cumulative
Class Frequency, f Frequency
18 – 25 13 13
26 – 33 +8 21
34 – 41 +4 25
42 – 49 +3 28
Total number
50 – 57 +2 30 of students
∑f =
30
14 13 Ages of Students
12
10
8
8
f 6
4
4 3
2 2
0
17.5 25.5 33.5 41.5 49.5 57.5
Broken axis
Age (in years)
Larson & Farber, Elementary Statistics: Picturing the World, 3e 47
Frequency Polygon
A frequency polygon is a line graph that emphasizes the
continuous change in frequencies.
14
Ages of Students
12
10
8 Line is extended
to the x-axis.
f 6
4
2
0
13.5 21.5 29.5 37.5 45.5 53.5 61.5
Broken axis
Age (in years) Midpoints
0.5
0.433
(portion of students)
Relative frequency
30 Ages of Students
Cumulative frequency
(portion of students)
24
18
The graph ends
at the upper
12 boundary of the
last class.
6
0
17.5 25.5 33.5 41.5 49.5 57.5
Age (in years)
Larson & Farber, Elementary Statistics: Picturing the World, 3e 50
Section 2-4
Statistical Graphics
51
Section 2-4 Objectives
Stem-and-leaf plot
• Each number is separated into a stem and a leaf.
• Similar to a histogram.
• Still contains original data values. 26
From the display, you can conclude that more than 50% of the
cellular phone users sent between 110 and 130 text messages.
© 2012 Pearson Education, Inc. All rights reserved. 56 of 149
Graphing Quantitative Data Sets
Dot plot
• Each data entry is plotted, using a point, above a
horizontal axis.
Data: 21, 25, 25, 26, 27, 28, 30, 36, 36, 45
26
20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45
From the dot plot, you can see that most values cluster
between 105 and 148 and the value that occurs the
most is 126. You can also see that 78 is an unusual data
value.
© 2012 Pearson Education, Inc. All rights reserved. 59 of 149
Graphing Qualitative Data Sets
Pie Chart
• A circle is divided into sectors that represent
categories.
• The area of each sector is proportional to the
frequency of each category.
Relative
Type of degree Frequency, f frequency Central angle
Associate’s 728 0.24 360º(0.24)≈86º
Relative Central
Type of degree frequency angle
Associate’s 0.24 86º
Bachelor’s 0.51 184º
Master’s 0.20 72º
First professional 0.03 11º
Doctoral 0.02 7º
From the pie chart, you can see that over one half of the
degrees conferred in 2007 were bachelor’s degrees.
Pareto Chart
• A vertical bar graph in which the height of each bar
represents frequency or relative frequency.
• The bars are positioned in order of decreasing height,
with the tallest bar positioned at the left.
Frequency
Categories
© 2012 Pearson Education, Inc. All rights reserved. 66 of 149
Example: Constructing a Pareto Chart
Millions of dollars
Admin. error 5.4 15
Employee 10
15.9
theft
5
Shoplifting 12.7
0
Vendor fraud 1.4 Employee Shoplifting Admin. Error Vendor
Theft Cause fraud
Interpretation
From the scatter plot, you can see that as the petal
length increases, the petal width also tends to
increase.
Time Series
• Data set is composed of quantitative entries taken at
regular intervals over a period of time.
e.g., The amount of precipitation measured each
day for one month.
• Use a time series chart to graph.
Quantitative
data time
© 2012 Pearson Education, Inc. All rights reserved. 73 of 149
Example: Constructing a Time Series
Chart
• 58, 30, 37, 36, 34, 49, 35, 40, 47, 47,
• 39, 54, 47, 48, 54, 50, 35, 40, 38, 47,
• 48, 34, 40, 46, 49, 47, 35, 48, 47, 46
79
Make a line plot of the ages.
80
• This graph shows all the ages of the
people who live in the apartment
building. It shows the youngest
person is 30, and the oldest is 58.
Most people in the building are over
46 years of age. The most common
age is 47.
81
• Line plots allow several features of
the data to become more obvious. For
example, outliers, clusters, and gaps
are apparent.
82
• Outliers are data points whose values are
significantly larger or smaller than other
values, such as the ages of 30, and 58.
• Clusters are isolated groups of points, such as
the ages of 46 through 50.
• Gaps are large spaces between points, such as
41 and 45.
83
Dot Plot.
Cluster
Outliers Outliers
84
Dot Plot
18 20 21 27 29 20
19 30 32 19 34 19
24 29 18 37 38 22
30 39 32 44 33 46
54 49 18 51 21 21
Continued.
85
Dot Plot
Ages of Students
15 18 21 24 27 30 33 36 39 42 45 48 51 54 57
61, 61, 64, 67, 70, 71, 71, 71, 72, 73, 74, 74, 75,
77, 79, 80, 81, 81, 83
= 1 person
61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83
Time in Seconds
88
Are there any clusters?
61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83
Time in Seconds
89
Are there any gaps?
= 1 person
61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83
Time in Seconds
90
Stem-and-Leaf Plot
In a stem-and-leaf plot, each number is separated into a
stem (usually the entry’s leftmost digits) and a leaf (usually
the rightmost digit). This is an example of exploratory data
analysis.
Example:
The following data represents the ages of 30 students in a
statistics class. Display the data in a stem-and-leaf plot.
Ages of Students
18 20 21 27 29 20
19 30 32 19 34 19
24 29 18 37 38 22
30 39 32 44 33 46
54 49 18 51 21 21 Continued.
Stem-and-Leaf Plot
Ages of Students
Key: 1|8 = 18
1 888999
2 0011124799 Most of the values lie
3 002234789 between 20 and 39.
4 469
5 14
This graph allows us to see
the shape of the data as well
as the actual values.
92
Stem-and-Leaf Plot
Example:
Construct a stem-and-leaf plot that has two lines for each
stem.
Ages of Students
1 Key: 1|8 = 18
1 888999
2 0011124
2 799
3 002234
3 789 From this graph, we can
4 4 conclude that more than 50%
4 69 of the data lie between 20
5 14 and 34.
5
93
Pie Chart
A pie chart is a circle that is divided into sectors that represent
categories. The area of each sector is proportional to the frequency of
each category.
Accidental Deaths in the USA in 2002
Type Frequency
Motor Vehicle 43,500
Falls 12,200
Poison 6,400
Drowning 4,600
Fire 4,200
Ingestion of Food/Object 2,900
(Source: US Dept. of Firearms 1,400 Continued.
Transportation)
94
Pie Chart
To create a pie chart for the data, find the relative frequency
(percent) of each category.
Relative
Type Frequency
Frequency
Motor Vehicle 43,500 0.578
Falls 12,200 0.162
Poison 6,400 0.085
Drowning 4,600 0.061
Fire 4,200 0.056
Ingestion of Food/Object 2,900 0.039
Firearms 1,400 0.019
n = 75,200
Continued.
Pie Chart
Next, find the central angle. To find the central angle,
multiply the relative frequency by 360°.
Relative
Type Frequency Angle
Frequency
Motor Vehicle 43,500 0.578 208.2°
Falls 12,200 0.162 58.4°
Poison 6,400 0.085 30.6°
Drowning 4,600 0.061 22.0°
Fire 4,200 0.056 20.1°
Ingestion of Food/Object 2,900 0.039 13.9°
Firearms 1,400 0.019 6.7°
Continued.
Pie Chart
Ingestion Firearms
3.9% 1.9%
Fire
5.6%
Drowning
6.1%
Poison
8.5% Motor
vehicles
Falls 57.8%
16.2%
97
Pareto Chart
A Pareto chart is a vertical bar graph is which the height of
each bar represents the frequency. The bars are placed in
order of decreasing height, with the tallest bar to the left.
Accidental Deaths in the USA in 2002
Type Frequency
Motor Vehicle 43,500
Falls 12,200
Poison 6,400
Drowning 4,600
Fire 4,200
Ingestion of Food/Object 2,900
(Source: US Dept. of Firearms 1,400 Continued.
Transportation)
Pareto Chart
Accidental Deaths
45000
40000
35000
30000
25000
20000
15000
10000
5000
Poison
Motor Firearms
Falls Poison Drowning Fire
Vehicles
Ingestion of
Food/Object
Times Series Chart
A data set that is composed of quantitative data entries taken at regular intervals over a period
of time is a time series. A time series chart is used to graph a time series.
Example:
Month Minutes
The following table lists the number of
minutes Robert used on his cell phone
for the last six months. January 236
February 242
March 188
April 175
May 199
Construct a time series chart for the
number of minutes used.
June 135
Continued.
Times Series Chart
Robert’s Cell Phone Usage
250
200
Minutes
150
100
50
0
Jan Feb Mar Apr May June
Month