• After collection and editing of data, the first step
towards further processing the same is classification • Classification is the grouping of related facts into classes • Facts in one class differ from those of the other • Sorting facts based on one basis of classification and on another basis is called as cross classification • When students submit application in a college ,they submit application to the office • The application form contains information such as performance in previous examinations, date of birth, gender, nationality, etc., • If one is interested in finding how many 1st, 2nd and 3rd class students have joined the college, one may find out how by looking into every form and note the details • Then they may conclude like out of 1000 students, 50 had first class, 800 second class and 150 third class • Process with the help of which this information in a summary form is obtained is called data classification Objects of classification • To condense the mass of data in such a manner that similarities and dissimilarities can be readily apprehended • Millions of figures can thus be arranged in few classes having common features • To facilitate comparison • To pinpoint most significant features of data at a glance • To give prominence to important information while dropping out unnecessary elements • To enable a statistical treatment of the material collected Types of classification • Geographical • Countries • States • Cities • Regions • Zones • Areas Geographic classification Geographic region South North Northeast East Central Southwest Northwest Parti 13,128 13,938 11,989 13,823 20,991 8,388 15,801 cipants
Diabetes prevalen ce
1,461 2,185 1,972 2,852
n (%) 1,181 (9.9) 1,155 (13.8) 1,431 (9.1) (11.1) (15.7) (14.3) (13.6) Chronological classification Qualitative classification • Data are classified on the basis of some attribute or quality such as gender, colour of hair, literacy, religion etc., • In this type of classification, attribute cannot be measured • One can find whether it is present or absent • Attribute – population
urban rural
• When one attribute is studied, 2 classes are formed
• One possesses the attribute, other not possessing it • Simple classification- nominal data, dichotomous • Instead of forming only 2 classes, we further divide the data on the basis of some attributes, to form several classes, classification is manifold