Professional Documents
Culture Documents
11 - Stat - 2 - Collection of Data PDF
11 - Stat - 2 - Collection of Data PDF
2 Collection of Data
vary, they are called variable. The 2. WHAT ARE THE SOURCES OF DATA?
variables are generally represented by
Statistical data can be obtained from
the letters X, Y or Z. The values of
two sources. The enumerator (person
these variables are the observation.
who collects the data) may collect the
For example, suppose the food grain
data by conducting an enquiry or an
production in India varies between
investigation. Such data are called
100 million tonnes in 1970–71 to 220
Primary Data, as they are based on
million tonnes in 2001–02 as shown
first hand information. Suppose, you
in the following table. The years are
want to know about the popularity of
represented by variable X and the
a film star among school students. For
production of food grain in India (in
this, you will have to enquire from a
million tonnes) is represented by
large number of school students, by
variable Y:
asking questions from them to collect
TABLE 2.1 the desired information. The data you
Production of Food Grain in India get, is an example of primary data.
(Million Tonnes) If the data have been collected and
X Y processed (scrutinised and tabulated)
1970–71 108 by some other agency, they are called
1978–79 132 Secondary Data. Generally, the
1979–80 108 published data are secondary data.
1990–91 176 They can be obtained either from
1996–97 199 published sources or from any other
1997–98 194 source, for example, a web site. Thus,
2001–02 212 the data are primary to the source that
collects and processes them for the
Here, these values of the variables first time and secondary for all sources
X and Y are the ‘data’, from which we that later use such data. Use of
can obtain information about the secondary data saves time and cost.
trend of the production of food grains
For example, after collecting the data
in India. To know the fluctuations in
on the popularity of the film star
the output of food grains, we need the
among students, you publish a report.
‘data’ on the production of food grains
If somebody uses the data collected
in India. ‘Data’ is a tool, which helps
by you for a similar study, it becomes
in understanding problems by
secondary data.
providing information.
You must be wondering where do
3. HOW DO WE COLLECT THE DATA?
‘data’ come from and how do we collect
these? In the following sections we will Do you know how a manufacturer
discuss the types of data, method and decides about a product or how a
instruments of data collection and political party decides about a
sources of obtaining data. candidate? They conduct a survey by
COLLECTION OF DATA 1 1
Advantages Disadvantages
• Highest Response Rate • Most expensive
• Allows use of all types of questions • Possibility of influencing
• Better for using open-ended respondents
questions • More time taking.
• Allows clarification of ambiguous
questions.
Sample Survey
Population or the Universe in statistics
means totality of the items under
According to the Census 2001, study. Thus, the Population or the
population of India is 102.70 crore. It Universe is a group to which the
was 23.83 crore according to Census results of the study are intended to
1901. In a period of hundred years, apply. A population is always all the
the population of our country individuals/items who possess certain
increased by 78.87 crore. Census characteristics (or a set of characteris-
1 6 STATISTICS FOR ECONOMICS
tics), according to the purpose of the • Sample: Ten per cent of the
survey. The first task in selecting a agricultural labourers in Chura-
sample is to identify the population. chandpur district.
Once the population is identified, the Most of the surveys are sample
researcher selects a Representative surveys. These are preferred in
Sample, as it is difficult to study the statistics because of a number of
entire population. A sample refers to reasons. A sample can provide
a group or section of the population reasonably reliable and accurate
from which information is to be information at a lower cost and
obtained. A good sample (represen- shorter time. As samples are smaller
tative sample) is generally smaller than than population, more detailed
the population and is capable of information can be collected by
providing reasonably accurate conducting intensive enquiries. As we
information about the population at need a smaller team of enumerators,
a much lower cost and shorter time. it is easier to train them and supervise
Suppose you want to study the their work more effectively.
average income of people in a certain Now the question is how do you
region. According to the Census do the sampling? There are two main
method, you would be required to find types of sampling, random and non-
out the income of every individual in random. The following description will
the region, add them up and divide make their distinction clear.
by number of individuals to get the
average income of people in the region. Activities
This method would require huge • In which years will the next
expenditure, as a large number of Census be held in India and
enumerators have to be employed. China?
Alternatively, you select a represent- • If you have to study the opinion
ative sample, of a few individuals, from of students about the new
the region and find out their income. economics textbook of class XI,
what will be your population and
The average income of the selected
sample?
group of individuals is used as an
• If a researcher wants to estimate
estimate of average income of the the average yield of wheat in
individuals of the entire region. Punjab, what will be her/his
population and sample?
Example
• Research problem: To study the Random Sampling
economic condition of agricultural As the name suggests, random
labourers in Churachandpur district sampling is one where the individual
of Manipur. units from the population (samples)
• Population: All agricultural are selected at random. The
labourers in Churachandpur district. government wants to determine the
COLLECTION OF DATA 1 7
Recap
• Data is a tool which helps in reaching a sound conclusion on any
problem by providing information.
• Primary data is based on first hand information.
• Survey can be done by personal interviews, mailing questionnaires
and telephone interviews.
• Census covers every individual/unit belonging to the population.
• Sample is a smaller group selected from the population from which
the relevant information would be sought.
• In a random sampling, every individual is given an equal chance of
being selected for providing information.
• Sampling error arises due to the difference between the actual
population and the estimate.
• Non-sampling errors can arise in data acquisition, by non-response
or by bias in selection.
• Census of India and National Sample Survey Organisation
are two important agencies at the national level, which collect,
process and tabulate data.
EXERCISES