Professional Documents
Culture Documents
Descriptive Statistics
Part I
Learning Objective
Data
We would like to study the
gender distribution
Graph
Table
INFOGRAPHICS
Raw Data
Data in the sequence in which they are collected and before they are processed
or ranked.
Arrays
An arrangement of numerical raw data in ascending order or descending
order of magnitude.
Ungrouped data
Contains information on each member of a sample or population
individually.
Grouped data
Data presented in classes or intervals
Example 2.1
This is a ________
Time 0-30 31-60 61-90 >=91
Frequency 1 1 2 1
Tabulating and Graphing
Frequency Distribution
Relative Frequency and Percentage Distributions
Graphical Presentation
1. Frequency Distribution for
Qualitative Data
Course Frequency
Biotech 8
Business 6
Engineering 4
Infotech 3
Others 4
Total 25
What do you think can be done to this table to make it a little more informative?
2. Relative Frequency and
Percentage Distributions
What do you think can be done to this table to make it a little more informative?
Graphical Presentation of Qualitative Data
2. Pie Chart
A circle divided into portions that represent the relative
frequencies or percentages of a population or a sample belonging
to different categories.
Starts at 12 o’clock
Moves clock wise from largest sector to the smallest.
Example 2.3
HANDS ON SESSION ONE
1. Tabulation
Single-Valued Classes
Frequency Distribution for quantitative data
Relative frequency and percentage distributions
Cumulative frequency distribution
Organizing and Graphing Quantitative
Data
2. Graphing
Histogram
Polygon
Ogive / Cumulative frequency curve
Stem-and-leaf displays
1. Single-Valued Classes
5 1 1 2 0 1 1 2 1 1
1 3 3 0 2 5 1 2 3 4
2 1 2 2 1 2 2 1 1 1
4 2 1 1 2 1 1 4 1 3
Frequency Distribution for vehicles owned by
households
5 2
Grand Total 40
2. Frequency Distribution for quantitative
data (continuous)
Class
› An interval that includes all the values that falls within two numbers,
the lower and upper limits.
Class limits
› Endpoints of each interval.
Class Boundary
› The dividing line between two classes. It is given by the midpoint of
the upper limit of one class and the lower limit of the next higher class.
Frequency Distribution for
quantitative data (cont'd)
Fourth class = 30-39 Lower boundary of 3rd class = (19 +20) = 19.5
2
Lower limit of 3rd class = 20 Upper boundary of 3rdclass = (29 +30) = 29.5
2
Upper limit of 3rd class = 29 19.5-12.5 = 7 (13 + 19) / 2 = 16
Constructing frequency distribution tables
Determine the lower limit of the first class or the starting point.
Any convenient number that is equal to or less than the smallest value in the data set
can be used as the lower limit of the first class.
Constructing frequency
distribution tables (cont'd)
Solution:
Part 1- Tabulation
1.Download the score data from WBLE
2.Create a frequency table for the data.
Histogram
14
12
Frequency
10
8
Frequency
6
4
2
0
90 0 0 0 0 0 e
1 0 1 1 1 2 1 3 1 4 or
M
Bin
Polygon
Polygon is a line graph formed by joining the midpoints of the tops of successive bars
in a histogram.
Next, we mark two more classes (with zero frequencies), one at each end, and mark the
midpoints.
Three types of polygon:
Frequency polygon
Relative frequency polygon
Percentage polygon
Histogram and Polygon (cont'd)
Example 2.6
Reconsider the data in Example 2.4 and draw
i) the frequency histogram and frequency polygon
ii) the relative frequency histogram and relative frequency polygon
iii) the percentage histogram and percentage polygon
Histogram and Polygon (cont'd)
Solution:
14
25
12
20
10
Frequency
Percent
15
6
10
4
5
2
0 0
80 90 100 110 120 130 140 150 80 90 100 110 120 130 140 150
Birth-weights (oz) Birth-weights (oz)
Histogram and Polygon (cont'd)
Solution:
14 0,3
12
Relative Frequency
0,25
10
0,2
Frequency
8
0,15
6
0,1
4
2 0,05
0 0
74,5 84,5 94,5 105 115 125 135 145 155 74,5 84,5 94,5 105 115 125 135 145 155
Birth-weights (oz) Birth-weights (oz)
Histogram and Polygon (cont'd)
12 – 17 6 10 5
18 – 20 3 3 3
21 – 29 9 12 4
Cumulative frequency distribution
A table that presents the total number of values that fall below
the upper boundary of each class.
It is constructed for quantitative data only.
2. If relative cumulative frequency is used in place of cumulative frequency, the graph is called
relative cumulative frequency curve or percentage ogive.
Cumulative frequency curve /
Ogive (cont'd)
Example 2.10
Draw an ogive for the data in Example 2.4. Estimate from the
ogive, determine
Ogive
Cumulative Frequency
60
50
40
30
20
10
0
79,5 89,5 99,5 109,5 119,5 129,5 139,5 149,5
Birth-weights (oz)
Cumulative frequency curve /
Ogive (cont'd)
Solution:
a) From the graph, there are 8 deliveries with birth-weights less
than 95 oz.
b) If 20% were X or more then 80% were less than X, 80%
50=40
From the graph, X= 130.5
Stem-and-leaf displays
75 52 80 96 65 79 71 87 93 95
69 72 81 61 76 86 79 68 50 92
83 84 77 64 71 87 72 92 57 98
Key 5|0 =50
Explore and Learn
1.You can use the score data for
this exercise.
2.Create a Histogram for the data.
HANDS ON 3.Explore excel for various type
graphs that can be used to
SESSION visualize continuous data.
4.Assume that score >30
THREE indicates not satisfied. How can
you show the rate of satisfied vs
unsatisfied customers among
females and males?
The End
Some Excel Instruction…
Frequency Distribution for Qualitative Data (Excel)
Frequency Distribution for
Qualitative Data (Excel)
Removing the space between bars to form a histogram
Example 2.8 (Excel)
Knowledge enables you to make lemonade!