Professional Documents
Culture Documents
of Data
• A statistical enquiry has four phases, viz.,
– Collection of data;
– Classification and tabulation of data;
– Analysis of data;
– Interpretation of data.
Classification and tabulation of data
• The collected data are always in un organized form and need to be
organized and presented in meaningful and comprehensible form
in order to facilitate further statistical analysis.
• The process of grouping into different classes or sub classes
according to some characteristics is known as classification.
• Tabulation is the systematic arrangement and presentation of
classified data
Sex
Manifold Classification
• The classification, where two or more attributes are
considered
– Classification of population simultaneously with respect to two attributes, e.g
sex and employment, then Population are classified into four classes namely:
Male employed , Male , unemployed, Female employed, Female unemployed
Quantitative classification
• Quantitative classification refers to the classification of data
according to some characteristics that can be measured in
numerical size
– Height, weight etc
• Here we classify the data by assigning arbitrary limits known as
class-limits.
• The quantitative phenomenon under study is called a variable.
– Population of the whole country may be classified according to
different variables like age, income, wage, price
• Hence this classification is often called ‘classification by
variables’.
Quantitative classification
We find that the lowest value is 56 and the highest value is 73. Thus for
approximately 10 classes, the difference of values between two
consecutive classes will be
Grouped Frequency Distribution
X 14 20 33 25 41 18 24 29 38 45
Y 148 242 296 312 518 196 214 340 492 568
X 23 32 37 19 28 34 38 29 44 40
Y 282 400 288 292 431 440 500 512 415 514
X 22 39 43 44 12 27 39 38 17 17
Y 282 481 516 598 122 200 451 387 245 245
Construct the bi-variate distribution table for the above data and assume class
interval of X as 10-20, 20-30 etc and for Y take class interval as 100-200, 200-300
TABULATION
DEFINITION
According to Tuttle, “A statistical table is the logical listing
of related quantitative data in vertical columns and
horizontal rows of numbers, with sufficient explanatory
and qualifying words, phrases and statements in the form
of titles, heading and footnotes to make clear the full
meaning of the data and their origin”
OBJECTIVES OF TABULATION
1. To simplify the complex data
2. To economize space
3. To facilitate comparison
5. To save time
6. To depict trend
7. To help reference
Components Of Table
1. Table number
4. Stub
5. Body / Field
6. Head note
7. Foot note
8. Source data
Stub Caption Total
headings (rows)
Subhead Subhead
Stub
Entries
Total
(columns)
Foot note :
Source note:
REQUIREMENTS OF GOOD
STATISTICAL TABLES
1. Suit the purpose
2. Scientifically prepared
3. Clarity
4. Manageable size
5. Columns and rows should be numbered
6. Suitably approximated
7. Attractive get-up
8. Units
9. Average and totals
10.Logical arrangement of items
11.Proper lettering
Types of tables
1. Simple and Complex tables.
3. Economic space
5. It helps in references
3. Lack of flexibility
Practical Exercise 4
In certain data, the following four main characteristics
with their sub-characteristics are present
Main Characteristics Sub Characteristics
1. Locality Urban / Rural
2. Religion Hindus / Muslims/Christians / Sikhs
3. Sex Males / Females
4. Age 0-30 / 30-60 / above 60