Professional Documents
Culture Documents
SUMMARIZING
AND
ORGANIZING
DATA
Data Cleaning
Summarization and presentation
Data Analysis
Data Handling
Data Collection
Data Storage
Data Protection
Data Retention
Data Analysis
Data Sharing
Data Reporting
4
5
WHAT SENSE CAN BE MADE OUT OF
THIS DATA?
Tables
Tallies
Stem and leaf
Grouping
displays data
Frequency
distributions
Summarizing data
Measures of dispersion
Box-and-whiskers plots
Methods
9 for Organizing Data
Displaying data
Tables
Histograms, diagrams
bar
Pie charts
Scatterplots
Graphs
10 Ordering Data
Example: ages of graduate students (n=10)
Suppose the unordered data were:
35, 40, 52, 27, 31, 42, 43, 28, 50, 35
Data could be ordered by hand:
27, 28, 31, 35, 35, 40, 42, 43, 50, 52
Ordering data by hand can be tedious, especially
when there is a large number of observations
Advantage
Provide information regarding the frequency of
observations in groups or categories
Disadvantage
The actual values of observations within groups are
not
retained
12 Stem-and-Leaf Displays
City Population
Rawalpindi 100,000
Islamabad 80,000
15Lahore 120,000
Simple table
FREQUENCY DISTRIBUTION
Value Frequency
140 1
141 4
142 6
143 8
144 7
18
145 3
146 1
Total 30
ANOTHER EXAMPLE
68 63 42 27 30 36 43
28 32 79 27 22 23 49
24 25 44 65 43 25 12
74 51 36 42 28 31
28 25 45 12 57 51
12 32 49 38 42 27
19 31 50 38 21 16 24
69 47 23 22 43 27
49 28 23 19 46 30
STEPS
Data Arrangement
o Arrange data into array of observation from smallest to largest
o Range of values divided into ordered classes (CI) & number of
observation into frequencies (f)
Class Intervals
o It is division of range into number of arbitrary but usually equal
& non overlapping segments
20
STEPS
10 – 19 5
20 – 29 19
30 – 39 10
40 – 49 13
50 – 59 4
60 – 69
22 4
70 – 79 2
Total 57
STEPS
Interval Width
o Number of units between upper & lower limits (Real??)
o Two types of limits:
Class limits/apparent limits (10 – 19, 20 – 29)
Class Boundaries/Real limits : 9.5 – 19.4, 19.5 – 29.4
o Class Boundaries/Real limits
Determine point at which lowest class interval should
begin
Can include value not included in data
23
Do not leave intervals with zero observations
Record total observation to frequencies column
ANOTHER TERMINOLOGY
Class Mark
o Mid point
o Denoted by X
UL+LL/2
24
MERITS OF FREQUENCY DISTRIBUTION
It shows at glance how many individual observation are in a
group & where is main concentration
Shows range & shape of distribution
It showed proportion of population or sample with certain
characteristics
25
DEMERITS
Simple Table
Relative Frequency
It is fraction of items (out of total number) which is belonging
to that class
Divide absolute frequency of that class by total number of
observations
Can be used for comparing two different frequency
distributions for two or more groups of individuals
The sum of all the relative frequencies will always be ….. ?
RELATIVE FREQUENCY
29
60-69 4 0.07 (4/57)
57 0.998
OTHER FEATURES OF FREQUENCY
DISTRIBUTION
Percentage Frequency
It is proportion of number of items (out of total number) which is
belonging to that class (% age relative to total cases)
Obtained by multiplying relative frequency with 100
The sum of all the percentage frequencies is always 100
Also shows 1/3rd of cases or 1/4th of cases lie in which group
PERCENTAGE DISTRIBUTION
Cumulative Frequency
It is obtained by adding up frequencies of succeeding classes
First frequency will be same and next will be obtained by
adding up next class
It can be tabulated for absolute frequency, relative frequency &
percentage frequency
The sum of all cumulative frequencies will be equal to total
number i.e. absolute frequency total
CUMULATIVE FREQUENCY
Cum %
Cum Cum Rel Percentage age Freq
Class Int Freq Rel Freq
Freq Freq Freq
THANK YOU