You are on page 1of 16

Module 3

Organizing and Presentation of


Data

Prepared by:
EDWARD B. PESCUELA
Instructor

“Most people use statistics like a drunk man uses a lamppost; more for support than
illumination”
― Andrew Lang

MC MATH 13: ELEMENTARY STATISTICS & PROBABILITY |PESCUELA s. 2021


MODULE 3
Organizing and Presentation of Data

Introduction
This is a learning module intended to explain how to organize data by constructing frequency
distributions and how to present the data by constructing charts and graphs. The charts and graphs
illustrated here are histograms, frequency polygons, ogives, pie graphs, Pareto charts, and time series
graphs. A graph that combines the characteristics of a frequency distribution and a histogram, called a
stem and leaf plot, is also explained. Moreover, this course will enhance students’ ability in using statistical
software such as MS Excel to automate data processing. Before each activity, fast facts and discussions
are given to help you understand the concepts and processes involved as well as to solve problems in
each activity. The activities will be done individually. Answers in every assessment must be written or
encoded on a short bond paper following the given format. Please do not forget to write your significant
learning experience at the last part of your output. The submission of Module 3 outputs will be on March
15, 2021. If you have queries, you may reach me through FB Group Chat. Thank you and have fun!

Format

Name: ______________________________________ Year & Section: ________________


Date: _______________________________________ Instructor: Mr. Edward B. Pescuela

MC Math 13: Elementary Statistics & Probability


Module 3: Organizing and Presentation of Data
Pretest/Exercise1/Activity 1
1.)
2.)
3.)
.
.
.

My Significant Learning Experience


In this module, I have learned that …
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________.

_____________________________
Signature over Printed Name

MC MATH 13: ELEMENTARY STATISTICS & PROBABILITY |PESCUELA s. 2021


Objectives
At the end of the week, the pre-service teacher (PST) should be able to:
1. present data through texts, tables, and graphs
2. organize data sets using frequency distribution table
3. represent data in frequency distributions graphically using histograms, frequency
polygons, and ogives
4. use appropriate digital technology in organizing data
5. represent data using graphs, Pareto charts, time series graphs, and pie graphs
6. draw a stem and leaf plot and scatter plot
7. interpret a stem and leaf plot and scatter plot
8. use appropriate digital technology in organizing data

Pretest
Solve the problems being specified in each of the item.
1. Grams per Food Serving The data shown are the number of grams per serving of 30 selected
brands of cakes. Construct a frequency distribution using 5 classes.
32 47 51 41 46 30
46 38 34 34 52 48
48 38 43 41 21 24
25 29 33 45 51 32
32 27 23 23 34 35
Source: The Complete Food Counts.
2. Number of College Faculty The number of faculty listed for a variety of private colleges that offer
only bachelor’s degrees is listed below. Use these data to construct a frequency distribution with
7 classes, a histogram, a frequency polygon, and an ogive. Discuss the shape of this distribution.
What proportion of schools have 180 or more faculty?
165 221 218 206 138 135 224 204
70 210 207 154 155 82 120 116
176 162 225 214 93 389 77 135
221 161 128 310
Source: World Almanac and Book of Facts.
3. Average Global Temperatures The average global temperatures for the following years are
shown. Construct a time series graph and comment on the trend.

Source: National Oceanic and Atmospheric Administration.

4. Twenty Days of Plant Growth The growth (in centimeters) of two varieties of plant after 20 days is
shown in this table. Construct a back-to-back stem and leaf plot for the data, and compare the
distributions.

MC MATH 13: ELEMENTARY STATISTICS & PROBABILITY |PESCUELA s. 2021


9/1/2020

2.1: Categorical (Qualitative) Data Tables and Graphs


Frequency tables
Car Example
Frequency = Count in each category Relative
Frequency
Category Frequency
Relative Frequency = Proportion in each (Count)
(Proportion)
category Ford 5 0.1
• The sum of the relative frequencies Chevy 12 0.24
always = 1.00 Honda 6 0.12
Toyota 12 0.24
How to compute relative frequencies: Nissan 10 0.2
• Compute the total counts of all the bins. Other 5 0.1
• Divide the frequency (count) in each bin by the Total 50 1
total.
• Example: The total frequencies of all the bins = 24
• The “8 to 11” bin frequency = 7, so 7/24 = 0.292

frankieleon / Creative Commons 2.0


Slides by Carey Smith, with content from Kate Kozak, Statistics sing Technology, 2nd https://showmeinstitute.org/blog/regulation
edition (http://www.opentextbookstore.com/details.php?id=21), and from David /car-wars-ii-revenge-cronies
Straayer and elsewhere, as noted.
Licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. 1

Example Problem
Frequency and Relative Frequency
50 students were asked how many courses Number Frequency Relative
they were taking this semester. of Courses (Count) Frequency
The (incomplete) results are shown here.
1 13 0.3
Fill in the blank cells to complete the table.
Round the relative frequencies to the 2 16
nearest tenth. 3
Steps to solve this: 50 1.00
1. Enter the totals.
The problem states the total number of students = 50
The total of relative frequencies is always 1.00
2. Compute the missing frequency for students with 3 courses
13 + 16 + x = 50➔x = 50 – (13+16)=50-29 = 21
Number Frequency Relative
3. Compute the relative frequencies:
of Courses (Count) Frequency
13/50 = 0.26 Rounded to the nearest tenth = 0.3 1 13 0.3
16/50 = 0.32 Rounded to the nearest tenth = 0.3 2 16 0.3
21/50 = 0.42 Rounded to the nearest tenth = 0.4
3 21 0.4
2

1
9/1/2020

Bar Graphs (or Bar Charts)


Cars These can be made with
15 Excel. (See the next page.)
Frequency They can also be sketched by
10 hand on graph paper, which
may good enough to choose
5 the matching graph in a
homework or test problem.
0
Ford Chevy Honda Toyota Nissan Other

• There should be labels on the X & Y axes and a title for the graph.
• There should be a scaling on the frequency axis
• The categories should be listed on the category axis.
• Start at 0 unless you have a good reason not to!
You can also draw a bar graph using relative frequencies.
This is useful when you want to compare two samples with different sample sizes.
The relative frequency graph and the frequency graph should look the same,
except for the scaling on the frequency axis.
3

Pareto Charts

A Pareto chart is just a bar chart with the bars sorted with the
highest frequencies on the left.

Pareto Chart for Car Type


14
12
• Useful for quality control
10 – Example: Defects in manufacturing
Frequency

8 • Only for categorical


6 (qualitative) data
4
2
0
1 2 3 4 5 6
Car Type

2
9/1/2020

How to Make a Bar Graph with Excel


1. Enter the data in a column 2. Select the “Insert” menu.
& highlight the portion to plot. In the “Recommended Plots” box,
(Don’t include the total) click the down arrow for the 1st one
& choose “Column Chart” in the top box.
Relative
Frequenc y Frequenc y
Oil c hange 20 0.4000
Brak es 5 0.1000
Tires 8 0.1600
Belts 2 0.0400
Battery 6 0.1200
Alternator 3 0.0600
Starter 2 0.0400
Lights 4 0.0800
Total 50 1.0000
(The data can be copied from
homework problems, so you
don’t have to type each number.

Excel creates this chart


Use Design->“Add Chart Element”
for axis labels
It’s similar with Google sheets.

Pie Charts Show % of Each Category


• Pie charts are common
• Best when the value of each slice is shown on the chart
• Otherwise it is hard to extract numbers from them
• Hard to draw by hand accurately
Car Proportions

• Homework has multiple-


0.1 0.1
choice questions
• Select the correct pie chart.
0.2
0.24 •Start by matching the largest
and smallest percentages (i.e.,
relative frequencies)
0.24 0.12

1 2 3 4 5 6

Excel, with “Data labels”


Excel, Google Sheets, & StatCrunch can
create pie charts, but they will usually put
the slices in a different arrangement.

End of Section

3
9/1/2020

Pie chart Question Example


Kara categorized her spending for this month into
four categories:
• Rent $361
• Food $289
• Fun $217
• Other $433

What percent of her total spending did she spend


on Fun? (Answer to the nearest whole percent.)
Solution: MyOpenMath
• Compute the total = $1300 question id: 1053
• Proportion on Fun = $217/$1300 = 0.1669
• Convert to a percent: 100*0.1669 = 16.69
• Rounded to the nearest whole percent = 17%

4
9/1/2020

2.2 Organizing & Graphing Quantitative Data


Popular Displays
• Frequency distribution tables & histograms for
quantitative data
• Histograms are bar charts for quantitative data
• The data is grouped into classes
• The width of a bin (class) is selectable
• When doing a homework or test problem, choose
the starting value and bin (class) width that match
the problem.
• When creating your own histogram, choose a
convenient starting value and bin (class) width.
• Book’s example computes a starting value of 315. I would
choose 300 or 400 & change the number of classes &
starting value, if needed. Histogram Example
• Dot plots = a type of histogram

(Skip Ogive charts)

Frequency Distribution Table for Quantitative Data


3 Num. of Frequency
The class width is the Students (Count)
4
Example: Data difference between lower 0 to 3 1
5
limits of adjacent classes.
= Number of students in 5 4 to 7 7
In this frequency table, the
the Math Center at 24 5
class width = 4
8 to 11 7
6 12 to 15 5
random times Difference between 0 & 4 is 4
6 16 to 19 4
This list has been sorted. 7 Difference between 4 & 8 is 4
8
9
9 Method:
10 • Count the number of data values
10 (frequency) in each bin.
10 • Example: There are 7 data items in the
11 “8 to 11” bin.
12 • Optional: Sort the numbers first
13 • This can be done in Excel, if the data is
14 in a single column. Instruction are at the
15 end of these notes.
15 (If not given to you by the problem, choose
16 a convenient bin width & starting value
17 from the min & max. values.)
18
19

1
9/1/2020

Histogram for Discrete Quantitative Data


Number of Students in the Math Center
• A Histogram is like This histogram was made with StatCrunch
a bar chart for Graph -> Histogram. Select “var1”
quantitative data, Bins: Start at 0
but the x-axis is Width: 4
numbers. “Value above bar”
•The x-axis
starting point
may or may not
be 0.
•The y-axis
starting point
should always
be 0.
• In this example,
the starting value
is 0 and the class
width is 4

Frequency and Relative Frequency Tables


for Quantitative Data
The relative frequencies are the proportions in each bin.
Number of Students Relative Frequency of Students
8 0.350
7 7
7 0.292 0.292
0.300
6
5 0.250
5 0.208
4 0.200 0.167
4
0.150
3
2 0.100
1 0.042
1 0.050
0 0.000
0 to 3 4 to 7 8 to 11 12 to 15 16 to 19 0 to 3 4 to 7 8 to 11 12 to 15 16 to 19

How to compute relative frequencies:


• Compute the total counts of all the bins.
• Divide the frequency (count) in each bin by the total.
• Example: The total frequencies of all the bins = 24
• The “8 to 11” bin frequency = 7, so 7/24 = 0.292

2
9/1/2020

Histogram Using a Frequency Table


& Bar Chart in Excel
• Enter your frequency table in an Excel spreadsheet
• Highlight the bins and frequencies (counts)
• Then insert a bar chart. (See instructions in section 2.1)
• Put in an appropriate title Number of Students in the Math Center
8
Num. of Frequency
7
Students (Count)
6
0 to 3 1 5
4 to 7 7 4
8 to 11 7 3
12 to 15 5 2
1
16 to 19 4
0
1 2 3 4 5

Technically, the bars of a histogram should be touching


Excel can create a histogram from raw data, but it doesn’t let you set the starting value.
Google Sheets can create a histogram from raw data, but it doesn’t let you set the bin width
Instructions for both are at the end of these charts.

Histogram Using a Relative Frequency Table in Excel


•Enter your frequency table in an Excel spreadsheet
•Use a 3rd column for the relative frequencies
relative frequency = frequency / total
•Highlight the bins and relative frequencies & create a bar chart

Relative Frequency of Students


0.350
Num. of Frequency Relative 0.292 0.292
0.300
Students (Count) Frequency
0.250
0 to 3 1 0.042 0.208
0.200 0.167
4 to 7 7 0.292
0.150
8 to 11 7 0.292
12 to 15 5 0.208 0.100
0.042
16 to 19 4 0.167 0.050
total 24 0.000
1 2 3 4 5

Right click on any bar & click “Add Data Labels”


to print the values above all the bars.

3
9/1/2020

Histogram with Continuous Quantitative Data


40 Females’ White Blood Cell Counts
4.1
Continuous
6 data
8 include
10 decimals
13
4.4 6.3 8.1 10.2 13.5
4.6 6.4 8.6 10.3
4.7 6.6 8.9
4.9 6.7 9.1 StatCrunch
4.9 6.8 9.2 Graph->Histogram
5.1 7 9.6 Column: var1
5.3 7 9.8 Bins:
5.3 7.1 Start at: 0
5.3 7.5 Width: 2
5.4 7.9 Value above bar: check
5.4
5.6 The bin width is the increment
5.7 between starting values
5.7 Since the bin width =2, the bins are:
5.9 4.00 to 5.99
6.00 to 7.99
8.00 to 9.99
10.00 to 11.99
https://en.wikipedia.org/wiki/White_blood_cell
12.00 to 13.99 Public Domain

A “Dot Plot” is a Histogram


that Uses Dots Instead of Bars

4
9/1/2020

Common Distribution Shapes

or “Bell-Shaped” or “Normal Distribution”

Also: a Bi-Modal shape has 2 peaks End of Section 9

5
5/18/2020

2.3 Other Graphical Representations of Data


Scatter Plot in Excel

• Enter data in Excel rows or columns


• Insert -> Scatter
(or Line to connect the dots)

(Skip stem & leaf plots)

Time Series Plot


Rural Population % vs. Time
https://www.census.gov/population/censusdata/table-4.pdf

100
90
Rural %
80
70
Rural %

60
Excel:
50
Use “Scatter Plot” connected by lines
40
30
20 A time-series graph shows trends vs. time
10
0
1750 1800 1850 1900 1950 2000

How is this time-series plot better than just a table of numbers?

End of Section

1
5/18/2020

How to Sort Data in Excel & Google Sheets


To make a frequency table with quantitative data, it helps to sort the
data first.
1. Highlight the data and copy it into a spreadsheet (e.g. Excel or
Google Sheets).
• If the data is in more than 1 column, put all the data into a single column.
• Method: Highlight and copy the data in column 2 and paste it underneath the data in
column 1. Repeat for the remaining columns.
3. Highlight the spreadsheet column with the data.
4a. Excel: Click Data-> Sort.
• Since you do not need to expand the selection to include other
columns, choose "Continue with the current selection."
• Then click "Sort". Accept the default, which is smallest to largest.
4b. Google sheets:
• Since you do not need to expand the selection to include other
columns, Click Data-> Sort range (not sheet) by column A: A->Z
3

2
Evaluation

Direction. Below are set of data that you need to organize by constructing frequency distribution
table. In able to make your data presentable, you should make an appropriate graph for the given
data. You may aid your work with a Statistical Tool such as MS Excel.

1. The distribution below shows marks of a test of 50 students that was marked out of 50 marks.
Make a frequency distribution table using classes of width 5 starting from 10–14.

2. Use the frequency distribution table in number 1 to draw (a) frequency polygon, (b)
frequency polygon and (c) ogive graph.

3. Roller Coaster Mania. The World Roller Coaster Census Report lists the following number
of roller coasters on each continent. Represent the data graphically, using a Pareto chart and
a horizontal bar graph.
Africa 17
Asia 315
Australia 22
Europe 413
North America 643
South America 45
Source: www.rcdb.com

4. Top 10 Airlines During a recent year the top 10 airlines with the most aircraft are listed.
Represent these data with an appropriate graph.
American 714 Continental 364
United 603 Southwest 327
Delta 600 British Airways 268
Northwest 424 American Eagle 245

MC MATH 13: ELEMENTARY STATISTICS & PROBABILITY |PESCUELA s. 2021


U.S. Airways 384 Lufthansa (Ger.) 233
Source: Top 10 of Everything.
5. A group of statistics students carried out a survey in 72 major towns in a region. They wished
to establish how serious the spread of HIV and AIDS was. The table below shows the
number of people infected, giving their figures to the nearest 10 persons. Construct a stem
and leaf plot for the data.

6. Jason saves a portion of his salary from his part-time job in the hope of buying a used car.
He recorded the number of pesos he was able to save over the past 15 weeks.
Pesos saved:
950, 600, 450, 350, 850, 500, 300, 900, 450, 700, 400, 250, 850, 450, 950
Construct a box and whisker plot for this set of data.

References

Bluman, Allan G. Elementary Statistics: a step-by-step approach / Allan Bluman. —


8th ed.

MC MATH 13: ELEMENTARY STATISTICS & PROBABILITY |PESCUELA s. 2021

You might also like