Professional Documents
Culture Documents
IN THE CLASS OF
ECONOMICS
Organisation of Data
Organisation of data is the second
statistical tool under which data are
arranged in such a form that comparison
of masses of similar data may be
facilitated a further analysis may be
possible. The most popular way of
organization of data is classification of
data.
Meaning of Classification of Data
Classification is the of arranging data in various
groups or classes according to their characteristics.
There are two features of classification of data
1- Data are classified in various groups are classes.
2- The basis of classification of data is their
characteristics (resemblances and affinities).
Objectives of Classification:
a] To simplify complex data
b] To facilitate understanding
c] To facilitate comparison
2 BIHAR 12899.89
3 KARNATAKA 1834.78
4 PUNJAB 21788.20
1 1941 31.87
2 1951 36.11
3 1961 43.91
4 1971 54.82
5 1981 68.33
Qualitative Classification-
When data are classified on the basis of quality is known
as qualitative classification of data.
QUALITATIVE CLASSIFICATION
We may first divide the population to
male and female on the basis of the
attribute “ sex” each of this class may
be further subdivide into “literate and
‘illiterate’ on the basis of attribute
‘literacy’ further classification can be
made on the basis of same other
attribute , say , employment .
QUANTITATIVE CLASSIFICATION
When data are classified on the basis of quantity is known
as quantitative classification of data.
WEIGHT (Kg) NO. OF STUDENTS
40-50 60
50-60 50
60-70 28
70-80 20
80-90 12
90-100 170
Characteristics of a Good
Classification
1.Comprehensiveness
2.Clarity
3.Homogeneity
4.Suitability
5.Stability
6.Elastic
Concept of Variable
Types of Variable
A) Discrete Variable
Discrete variables are those variables that increase in jumps or
in complete numbers. (No fraction is possible)
Eg. Number of students in a class, Number of cars in a show
room etc. (1,2, 10,or 15 etc.)
B) Continuous Variables
Variables that assume a range of values or increase not in
jumps but continuously or in fractions are called continuous
variables.
Eg. Height of the boys –5’1’’ , 5’3’’ and so on, Marks in any range
0-10, 10-15, 15-20
Meaning of Discrete Variable
Which are measured in complete numbers like numbers
of students, teachers, office staff etc.
EMPLOYEE/STU NO
DENTS
STUDENTS 500
TEACHERS 32
OFFICE STAFF 6
D-GROUP EMPLOYEE 4
Meaning of Continuous Variable
Which are not measured in complete numbers always
like height in meter, weight in Kg etc.
Height in Cm No of Students
110-120 10
120-130 12
130-140 11
140-150 8
Types of Statistical Series-
Types of Series
1 10
2 15
3 18
4 20
5 22
Frequency Distribution
Series-
Types Frequency Distribution Series-
Value Frequency
10 4
11 6
12 6
13 4
14 3
Continuous Series-
It is that series in which item cannot be exactly
measured; they are placed in a class.
Class Frequency
Interval
0-10 10
10-20 15
20-30 20
30-40 18
40-50 15
50-60 9
Types of Continuous Series-
1- Exclusive Series
2- Inclusive Series
3-Open End Series
4- Mid Value Frequency Series
5- Cumulative frequency series
In constructing continuous series we come across terms
like:
a] Class : Each given internal is called a class e.g., 0-5, 5-10.
b] Class limit: There are two limits upper limit and lower
limit.
c] Class interval: Difference between upper limit and lower
limit.
d] Range: Difference between upper limit and lower limit.
e] Mid-point or Mid Value: Upper limit -Lower limit
2
f] Frequency: Number of items [observations] falling within
a particular class.
1 Exclusive Series
It such types of statistical series in which upper
limit of a class are the lower limit of just next class
Obtained No of
Marks Students
00-10 8
10-20 9
20-30 10
30-40 9
40-50 8
2 Inclusive Series
It is such types of statistical series in which all
frequencies of class are included in the same class.
C.I. Frequency
1-10 8
11-20 9
21-30 10
31-40 9
41-50 8
Conversion of Inclusive into
Exclusive Series
11-20 9 10.5-20.5 9
21-30 10 20.5-30.5 10
31-40 9 30.5-40.5 9
41-50 8 40.5-50.5 8
Open End Series
The lowest value of highest value of the
distribution are not defined.
00-10 8 10 8
10-20 9 20 8+9=17
20-30 10 30 17+10=27
30-40 9 40 27+9=36
40-50 8 50 36+8=44
More Than Cumulative Frequency Series
C.I Frequency More No of
Than Items
0 44
00-10 8
10 44-8=36
10-20 9
20 36-9=27
20-30 10
30 27-10=17
30-40 9
40 17-9=8
40-50 8
50 8-8=0
LOSS OF INFORMATION
•The frequency distribution summarizes the raw data by
making it concise and comprehensible. However, it does not
show the details that are found in your data and leads to loss
of information.
•When the raw data is grouped into classes, an individual
observation has no significance in further statistical
calculations.
•For example, suppose class 10-20 contains 6 values: 12,
15,16,18,14,19. When such data is grouped as a class 10-20,
study material and individual values have no significance and
only frequency, i.e.6 is recorded and not their actual values.
•Statistical calculations are based only on the values of the
class mark instead of the actual values. As a result, it leads to
considerable loss of information.
FREQUENCY ARRAY