DATA CLASSIFICATION AND TYPES Surya S 19pbo022 II M.Sc. Botany DATA Data is any information in raw or organized form using alphabets , numbers or symbols that refers to or represents preferences, ideas, objects, traits, categories etc. IMPORTANCE OF DATA AND DATA Data is one of theCOLLECTION most important and vital aspect of any research study. The methodology of researches conducted in different fields of study may be different but every research is based on data which is analyzed and interpreted to get information. Data is the basic unit in statistical studies. Statistical information like census, population variables, health statistics etc, are all developed from data. DATA COLLECTION Data collection is the process of gathering and measuring information on variables of interest, in an established systematic fashion that enables one to answer stated research questions, test hypotheses, and evaluate outcomes PRIMARY AND SECONDARY DATA Data sources are broadly classified into primary and secondary data. TYPES: PRIMARY DATA - collected by the researcher himself SECONDARY DATA - collected by others to be"re-used" by the researcher PRIMARY DATA Data that has been collected from first-hand-sources by surveys, observation or experimentation is known as primary data. Primary data has not been published yet and is derived from a new or original research study thus is more reliable, authentic and objective. Primary data has not been changed or altered by human beings, therefore its validity is greater than secondary data IMPORTANCE AND SOURCES OF PRIMARY DATA Research based on only secondary data is least reliable and may have biases because secondary data has already been manipulated by human beings. In statistical surveys it is necessary to get information from primary sources and work on primary data. Sources are limited and at times it becomes difficult to obtain data from primary source either because of scarcity of population or lack of cooperation. SOURCES OF PRIMARY DATA COLLECTION Observation method Experiment Interviews Questionnaire Schedule Through agencies Survey Observation method Observation method is a method under which data from the field is collected with the help of observation by the observer or by personally going to the field. Experiments Experiments require an artificial or natural setting to perform logical study to collect data. These are more suitable for medicine, psychological studies, nutrition etc. In experiments the experimenter has to keep control over the influence of any extraneous variable on the results. Interview This method of collecting data involves presentation of oral-verbal stimuli and reply in terms of oral- verbal responses. Interviewer asks questions (which are aimed to get information required for study) to respondent. Direct Personal Interview – Data is personally collected by the interviewer. Telephonic interviews - Data is collected through an interview over the telephone with the interviewer Questionnaire It is the most commonly used method. Questionnaires are a list of questions for which the respondent give answers. It can be conducted via telephone, mail, live in a public area, or in an institute. Mailed questionnaire – Data is collected through questionnaire mailed to the informant. Questionnaire filled by enumerators – Data is collected by trained enumerators who fill questionnaires. Schedule Very similar to Questionnaire method main difference is that a schedule is filled by the enumerator who is specially appointed for the purpose. Enumerator goes to the respondents, asks them the questions from the Questionnaire in the order listed, and records the responses. Enumerator must be trained in administering the schedule. Information through agencies The local agents or correspondents will be appointed, they collect the information and transmit it to the office or person. This system is adopted by newspapers, agencies, etc., when information is needed in different fields.The informants are generally called correspondents. Survey A detailed study of geographical area to gather data attitudes, impressions, opinions, satisfaction level etc., by polling a section of the population. Survey is most commonly used method in social sciences, management, marketing and psychology to some extent. Surveys can be conducted in different methods. TYPES Census Survey 2.Continuous or Regular Survey Advantages Data interpretation is better Addresses Specific Research Issues Greater Control Disadvantages High Cost Time Consuming More number of resources is required SECONDARY DATA Data collected from a source that has already been published in any form is called as secondary data. Secondary data is not collected by the person who is doing research. The review of literature in any research is based on secondary data mostly from books, journals and periodicals. IMPORTANCE AND SOURCES OF SECONDARY DATA Sometimes it is difficult to obtain primary data; in these cases getting information from secondary sources is easier and possible. Sometimes primary data does not exist in such situation one has to confine the research on secondary data. Sometimes primary data is present but the respondents are not willing to reveal it in such case too secondary data can suffice. After the expense of electronic media and internet the availability of secondary data has become much easier. SOURCES OF SECONDARY DATA COLLECTION Published printed source Books Journals/periodicals Magazines/Newspapers Published Electronic Sources e-journals General websites Weblogs Unpublished Personal Records Diaries Letters Government Records Census Data/population statistics Health records Educational institutes records Public Sector Records NGOs's survey data Other private companies records Published Printed Sources There are varieties of published printed sources. Their credibility depends on many factors. For ex:, on the writer, publishing company and time and date when published. # Books – Books are secondary source but most authentic one in secondary sources. # Journals and periodicals – journals provide up-to-date information which at times books cannot and It can give infomation on the very specific topic rather than about more general topics. # Magazines/Newspapers - Magazines are effective but not very reliable. Newspaper is more reliable and in some cases the information can only be obtained from newspapers as in the case of some political studies. Published Electronic Sources # E-journals - e-journals are more commonly available than printed journals. # General Websites - informations are not very reliable, content should be checked for the reliability before quoting from them. # Weblogs - they are actually diaries written by different people. These diaries are as reliable to use as personal written diaries. Unpublished Personal Records Some unpublished data may also be useful in some cases. # Diaries – Diaries are personal records and are rarely available but they might be very useful for a descriptive research. # Letters - Letters like diaries are also a rich source but should be checked for their reliability before using them. Government Records # Government records are very important for marketing, management, humanities and social science research. Advantages Ease of Access Low Cost to Acquire Clarification of Research Question Disadvantages Quality of Research Not Specific to Researcher's Needs Not Timely CLASSIFICATION OF DATA There are four types of data classification. They are Geographical classification Chronological classification Qualitative classification Quantitative classification Geographical classification When data are classified on the basis of location or areas, it is called geographical classification Example: Classification of production of food grains in different states in India. Chronological classification Chronological classification means classification on the basis of time, like months, years etc. Example: Profits of a company from 2015 to 2020. Quantitative classification Refers to the classification of data according to some characteristics, which can be measured such as height, weight, income, profits etc. Qualitative classification data are classified on the basis of some attributes or quality such as colour of hair, literacy etc., In this the attribute under study cannot be measured. It can only be found out whether it is present or absent.