Professional Documents
Culture Documents
The more information the researcher has about the phenomenon the better
Just as a building needs bricks and mortar for its construction, similarly, the
The more valid is the sources of information, the more reliable will be the
information received, which in turn, will lead to correct and reliable conclusion.
According to W. A. Bagley, the sources of information in the field of social science may
be classified into:
i. Primary source: Include the actual information received from individuals directly
The information obtained from primary sources is often called as primary data.
Are those information gathered by researcher himself and are gathered for the first time,
ii. Secondary sources of information: These sources of information are called “secondary
Are those which have been collected by someone else and which have already been passed through
The methods of collecting primary and secondary data differ since primary
data are to be originally collected, while in case of secondary data the nature of
experimental research.
i. Observation method
ii. Interview
iii. Questionnaires
behavioral science.
Under this method the researcher should personally and directly observe the condition
• Direct observation is the most reliable method for gathering information related to the
Participant observation: The researcher lives in the group or in the community as a member
of it and participates in their life.
Non-participant observation: The researcher does not participate in the group life but
observe as an external spectator.
Under such approach the presence of the researcher is unknown to the people.
B. Control Vs. uncontrolled observation
Uncontrolled observation: Is the observation, which takes place in natural setting. Here no
attempt is made to use precision instruments.
Controlled observation: Observation takes place according to definite pre-arranged plans,
including experimental procedure and generally done in laboratory under controlled
condition.
Structured Vs. Unstructured
• Such observation is very much flexible and commonly used in exploratory research
2. Interview Method.
to respondent.
to questioning.
B. Telephone Interview
itself
Prerequisites of interview
Interviewer should be honest, sincere, hardworking, impartial and must posses the
Occasional field checks should be also made in advance so that appropriate action must
be taken if some of the selected respondent refuse to cooperate or are not available when
Interviewer has to try to crate friendly atmosphere of trust and confidence so that the
respondent may feel at ease while talking and discussing with the interviewer.
interview
The objective is to find out what a selected group of respondents do, think or
fell.
This method of data collection is used when the subject of study is very wide
It is also used for such things, which cannot be known through direct
• Purpose: to identify issues and themes, not just interesting information, and
not “counts”
Phase Action
2 Warm-up Relate experience; stimulate group interaction; start with least threatening
and simplest questions
3 Main Move to more threatening or sensitive and complex questions; elicit deep
body responses; connect emergent data to complex, broad participation
4 Closure End with closure-type questions; summarize and refine; present theories,
etc; invite final comments or insights; thank participants
17
IPDET © 2009
COLLECTION OF SECONDARY DATA
e. reports and publications of various associations connected with business and industry,
banks, stock exchanges, etc.
g. public records and statistics, historical documents, and other sources of published
information.
6.3. Sampling Design
only a few items but representative of the total technically called a ‘sample’
i. Type of universe: The first step in developing any sample design is to clearly
ii. Sampling unit: can be a geographical unit such as state, district, village, etc., or a
construction unit such as house, flat, etc., or a social unit such as family, club,
school, etc.
iii. Source list: It is also known as ‘sampling frame’ from which sample is to be
iv. Size of sample: This refers to the number of items to be selected from the
Vii. Budgetary constraint: Cost considerations have a major impact upon decisions
relating to not only the size of the sample but also to the type of sample.
vii. Sampling procedure: Finally, finally researcher must decide about the
technique to be used in selecting the items for the sample.
viii. Sampling errors: are the random variations in the sample estimates around the
true population parameters.
Sampling error decreases with the increase in the size of the sample, and it happens to be of a
Sample designs are basically of two types viz., non-probability sampling and
probability sampling.
Representation
Element selection Probability sampling Non-Probability Sampling
Unrestricted Sampling Simple random sampling Convenience sampling
Restricted Sampling Complex random (such as cluster Purposive sampling (such as
sampling, systematic sampling, quota sampling, judgment
stratified sampling, etc.) sampling,
In such a design, personal element has a great chance of entering into the selection
of the sample. However, in such a sampling, there is no assurance that every element
has some specifiable chance of being included.
i. Judgmental sampling-in this type sampling units or elements in the population are
purposely selected
ii. Convenience sampling- based on the convenience of the statisticians who is select
are included in it merely on account of their being available on the spot where the
survey is in progress.
• Under quota sampling the interviewers are simply given quotas to be filled from
the different strata, with some restrictions on how they are to be filled.
2. Probability sampling: 'random sampling’ or ‘chance sampling’. Under this
sampling design, every item of the universe has an equal chance of inclusion in
the sample.
There are number of probability sampling some of them are discussed bellow
v. Multi-stage Sampling
i. Simple Random Sampling
Each element in the population has an equal chance of being included in the
Each element in the sample frame is assigned a number and each number is
If say the sample size is 45, then the selection procedure is repeated 45 times.
the sample interval k(N/n)), where N is total number of units in the population
iv. Stratified sampling –Under this technique population is divided into several
sub-populations that are individually more homogeneous than the total population
and then simple random technique is used to select sample within each strata.
o How to form strata? on the basis of common characteristic(s) of the items in such a
way as to ensure elements being most homogeneous within each stratum and most
sampling
o How many items be selected from each stratum or how to allocate the sample size of
𝑛
• Example: Proportionate stratified sample size determination for 4 strata:– 𝑁1 =
1
𝑛2 𝑛3 𝑛 𝑛
= = 𝑁4 = 𝑁
𝑁2 𝑁3 4
• In case of disproportionate sample is that the sample size should be more in stratum
v. Cluster sampling
In cluster sampling instead of selecting individual units from the population, entire
If the total area of interest happens to be a big one, a convenient way in which a
sample can be taken is to divide the area into a number of smaller non-overlapping
areas and then to randomly select a number of these smaller areas (usually called
clusters).
vii. Multistage sampling –involves the selection of units in more than one stage.
Example suppose a sample of 5000 urban households from all over the country is
to be selected . In first stage we select districts, second we may select cities form
each district, sub cities/kebels from each cities and then selection of households.
6.4. sample size and its determination
• In sampling analysis the most ticklish question what should be the size of the
sample or how large or small should be „n’? If the sample size (‘n’) is too
small, it may not serve to achieve the objectives and if it is too large, we may
• A researcher is worried about sample size because of the fact that sample size
(number of elements in sample) and precision of the study are directly related.
statistical knowledge.
• There are a number of sample size determination methods.
researcher in some cases can be used as a base to determine the size of the sample.
• Under this approach the sample size is determined by the available fund for the
proposed study.
• E.g., if cost of surveying of one individual or unit is 30 birr and if the total
available fund for survey is say 1800 birr , the sample size then will be
determined as,
• Sample size (n) = total budget of survey /Cost of unit survey, accordingly, the
have information about the estimated variance of the population, the magnitude of
population parameters
1500 and 24000 birr, using the rule of the thumb the standard deviation will
indicates how precise the estimate must be. For best estimate, the range of error has
to be small. Small range of error can be obtained if the sample size is large.
The researcher makes subjective judgment about the desired magnitude of error.
confident a researcher can be about being correct. In most case (research) 95%
o That is, it is assumed that 95 times out of 100 the estimate from sample will
Once the above concepts are understood and determined the size of sample is quite simple.
i) For mean n = (ZS/E)2 and Where E = Magnitude of acceptable error , Z = Confidence level, S=
1. standard deviation = 18
3. range of error to be 3
• Note: for 95% confidence level ( 0.95/2 = 0.0475), the Z = 1.96 (from normal tables)
population parameter.
Suppose an individual student has scored the following grades in 10 subjects (Consider
these subjects as population); 55, 60, 65, 90, 55, 75, 88, 45, 85, 82.
Say, a sample of four grades 55, 65, 82, and 90 are selected at random from this
population to estimate the average grade of this student. The mean of this sample is 73.
However, the variation due to random fluctuation (sampling error) decreases as the
sample size increases though it is not possible to completely avoid sampling error.
Systematic Error (non-sampling error)
• Systematic sampling is also called sampling bias and such error can be created from errors in the sampling
procedure, and it cannot be reduced or eliminate by increasing the sample size.
• Such error occurs because of human mistakes and not chance variation.
• The possible factors that contribute to the creation of such error include
1. Inappropriate sampling: If the sample units are a misrepresentation of the population; it will result in
sample bias.
It occurs when there is a failure of all units in the population to have some probability of being selected for
the sample.
2. Accessibility bias: When all members of the population are not equally accessible, the researcher must
provide some mechanism of controlling in order to ensure the absence of over and under-representation of some
respondents.
3. Non-response bias: This is an incomplete coverage of sample or inability to get complete response from all
individuals initially included in the sample.
This is due to the failure in locating some of the individuals of the sample element or due to their refusal to
respond.
In some cases, respondents may intentionally give false information in response to some sensitive question.
For instance, people may not tell the truth of their bad habit and income.
Total error = sampling error + Non-sampling Error
Total error is usually measured as total error variance, also known as mean square (MSE) and (TE) 2
= (SE) 2 + (NE) 2.
6.5. Central Limit Theorem and standard error
The sampling distribution of a mean of sample taken from a normal population shows two
important properties. First, the sampling distribution has a mean that is equal to
Second, the sampling distribution has a standard deviation that is equal to population
𝜎
standard deviation divided by the square root of the sample size (𝜎𝑥 = √𝑛).
The central limit theorem states that as sample size increases, the sample distribution of
the mean approaches normal distribution regardless of the distribution of population from
𝜎
deviation is 𝜎𝑥 =
√𝑛
The significance of the central limit theorem lies in that it permits us to use
But when sampling is not from a normal population, the size of the sample
When n is small, the shape of the distribution will depend largely on the shape
of the parent population, but as n gets large (n > 30), the shape of the sampling
distribution will become more and more like a normal distribution, irrespective
The standard error gives an idea about the reliability and precision of a sample.
The (S.E) helps in testing whether the difference between observed and expected frequencies could arise due
to chance.
The smaller the S.E., the greater the uniformity of sampling distribution and hence,
• Conversely, the greater the S.E., the greater the difference between observed and expected
frequencies.
• The standard error enables us to specify the limits within which the parameters of the
the actual value will fall within the stated precision limits.
Thus, if we take a confidence level of 95%, then we mean that there are 95
chances in 100 (or .95 in 1) that the sample results represent the true