Professional Documents
Culture Documents
Chapter V
Sampling Design
1
The Concept of Sampling
2
Content of the lecture
Sampling Design
Census and sample survey
The Need for sampling
Steps in sampling Design
Criteria for selecting a sampling procedures
Types of sample Design
Probability sampling Design
Non probability sampling Design
3
What is Sampling?
Some studies involve only small number of people and thus all
of them can be included.
But when the population is large, it is usually not possible
to undertake a census.
4
What is Sampling?
The best sample is a representative sample, or is a model of
the population.
5
What is Sampling?
6
Reasons for Sampling
Reduced cost: since data are secured from a small fraction of
the population, cost will be reduced.
7
Reasons for Sampling
Feasibility: some investigations could only be addressed by
sample surveys: for example
8
Reasons for Sampling
Representativeness
There are two principles in representativeness:
The need to avoid bias and the need to gain maximum
precision.
9
Reasons for Sampling
Representativeness is important particularly if you want to
make generalization about the population.
10
Steps in Sampling Design
The sampling Process
Representative samples are generally obtained by
following a set of well-defined procedures:
1. Defining the target population
2. Choosing the sampling frame
3. Selecting the sampling method
4. Determining the sample size
5. Implementing the sampling plan
11
Steps in Sampling Design
a) Identifying the relevant population:
Determine the relevant population from which the sample
is going to be drawn.
Example: if the study concerns income, then the definition
of the population as individuals or households can make a
difference.
12
Steps in Sampling Design
c) Securing a sampling frame:
A list of elements from which the sample is actually drawn
is important and necessary.
13
Determining the sample size
One of the questions researchers tend to ask is ‘how many
people should I speak to?’
This is not an easy question.
14
Determining the sample size
This obviously depends on the type of research.
The sample size will also depend on what you want to do with
your results.
16
Determining the sample size
The general rule in quantitative research is that the larger the
sample the better it is.
17
Determining the sample size
Also, you have to account for non-responses and you may
need to choose a larger sample size to overcome this
problem.
18
Determining the sample size
In general, the sample size depends on several factors.
i) Degree of homogeneity: The size of the population variance is the
most important parameter.
The greater the dispersion in the population the larger the
sample must be to provide a given estimation precession.
iv) Cost: All studies have some budgetary constraint and hence
cost dictates the size of the sample.
20
Determining the sample size
v) Practicality: Of course the sample size you select must make
sense.
Therefore the sample size is usually a compromise between
what is DESIRABLE and what is FEASIBLE.
FEASIBLE
21
Determining the sample size
VI) Other Considerations:
22
Determining the sample size
(VII) Rule of Thumb: is based on past experience with samples
that have met the requirements of the statistical methods.
For small populations (under 1000 a large sampling ratio
(about 30%). Hence, a sample size of about 300 is required.
23
Determining the sample size
(VIII) Using Cochran’s Formula: You need to determine a few
things about the sample you need.
24
Determining the sample size
Confidence Level — How confident do you want to be that the
actual mean falls within your confidence interval?
The most common confidence intervals are 90% confident,
95% confident, and 99% confident.
The safe decision is to use 0.5 – this is the most commonly used
number and ensures that your sample will be large enough.
25
Determining the sample size
Cochran (1963) has developed the following formula to
determined sample size - infinite population when we are to
estimate the proportion in the universe.
26
Determining the sample size
Example 1: What should be the size of the sample if a simple random sample
from a population of 4000 items is to be drawn to estimate the per cent
defective within 2 per cent of the true value with 95.5 per cent
probability? What would be the size of the sample if the population is
assumed to be infinite in the given case?
Solution: In the given question we have the following:
N = 4000;
e = .02 (since the estimate should be within 2% of true value);
z = 2.005 (as per table of area under normal curve for the given confidence
level of 95.5%).
As we have not been given the p value being the proportion of defectives in
the universe, let us assume it to be p = .02 (This may be on the basis of our
experience or on the basis of past data or may be the result of a pilot study).
27
Determining the sample size
28
Determining the sample size
Example 2: Suppose a certain hotel management is interested in
determining the percentage of the hotel’s guests who stay for more than
3 days. The reservation manager wants to be 95 per cent confident that
the percentage has been estimated to be within ± 3% of the true value.
What is the most conservative sample size needed for this problem?
Solution: We have been given the following: Population is infinite;
e = .03 (since the estimate should be within 3% of the true value);
z = 1.96 (as per table of area under normal curve for the given
confidence level of 95%). As we want the most conservative sample size
we shall take the value of p = .5 and q = .5. Using all this information, we
can determine the sample size for the given problem as under:
29
Determining the sample size
Sample Size in Qualitative Studies
Sample size is not big issue in qualitative research since the
emphasis is obtaining new information to have deep
understanding of a phenomenon.
i.e., no fixed rules for sample size in qualitative research.
The sample size depends on WHAT you try to find out, and
from what different informants or perspectives you try to
find that out.
the sample size is therefore estimated, but not
determined.
30
Types of sampling
31
Probability and non-probability sampling
We can arrange the different approaches to the sampling
process in a spectrum.
32
Probability and non-probability sampling
At the other end of the spectrum are approaches to sampling
that are concerned with selecting cases that will enable the
researcher to explore the research questions in depth,
33
Probability sampling
Probability sampling is based on the concept of random
selection that assures that each population element is given a
known non zero chance of being selected.
35
Probability sampling
In summary:
probability samples are more representative than any other
type of sample.
36
Probability sampling
Several types of probability sampling methods could be
identified:
Simple Random Sampling Technique
Systematic sampling Technique
Stratified Sampling Technique
Cluster Sampling Technique.
Hybrid Sampling
37
Probability sampling
1. Simple Random Sampling (SRS): each person in a population
has the same chance of being included in a sample as every
other person.
each element of the population has an equal chance of being
selected into the sample.
38
Probability sampling
E.g., simple random sampling for household surveys
Population = all households in the country
Sampling frame = the list of all households (20 million in
Ethiopia?)
Sample size = say we have resources to cover only 20,000
households
Sampling fraction 20,000/20,000,000 or 0.1%
Select randomly 20,000 households from the long list of
20,000,000 households
39
Probability sampling
Merits of SRS
No investigator bias or discretion
Help us to obtain a more representative sample
Can produce better estimates for the population
Limitations
It needs up-to date list of the population units
Units selected might be scattered geographically, hence,
high cost of data collection
40
Probability sampling
2. Systematic Sampling Technique
In systematic sampling individuals are chosen at regular
intervals (for example every kth) from the sampling frame.
41
Probability sampling
Steps to draw a systematic sample:
Calculate K (the sampling interval).
Select a number between 1 and K at random, say that
number is r.
Then this means we select the rth element at random.
Then the rth, (r+K) th, (r+2K) th, …, [r+(n-1)K] th elements of
the population will be selected in our sample .
42
Probability sampling
E.g., a systematic sample is to be selected from 1200 students
of a school.
The sample size to be selected is 100.
The sampling fraction is: 100/1200= sample size/study
population = 1/12
The sampling interval is therefore 12.
The first student in the sample is chosen randomly, for
example by blindly picking one out of twelve pieces of paper,
numbered 1 to 12.
If number 6 is picked - every twelfth student will be
included –i.e. 6, 18, 30, 42, etc.
43
Probability sampling
3. Stratified Sampling
Useful when we have heterogeneous populations.
44
45
Probability sampling
The reasons for stratifying
1. To increase a sample’s statistical efficiency (smaller
standard errors-less variation).
2. To provide adequate data for analyzing the various
subpopulation.
3. To enable different research methods and procedures to be
used in different strata.
4. The absence or poor quality of a sampling frame makes it
necessary to first select a sample of geographical units, and
then to construct a sampling frame only within those
selected units.
46
Probability sampling
multiple stage stratified random sampling could also be
considered.
E.g., in the household survey we may be interested to have
sufficient number of households from each region of
Ethiopia;
So stratify by region!
How to Stratify
Three major decisions must be made in order to stratify the
given population into some mutually exclusive groups.
47
Probability sampling
(1) What stratification base to use: stratification would be based
on the principal variable under study such as income, age,
education, sex, location, religion, etc.
48
Probability sampling
(3) What strata sample size to draw: different approaches could be
used:
One could adopt a proportionate sampling procedure.
49
Probability sampling
4. Cluster Sampling:
It may be difficult or impossible to take a simple random
sample because a complete sampling frame does not exist, or
Due to logistical difficulties
E.G: interviewing people who are scattered over a
large area may be too time-consuming.
51
Probability sampling
52
Non-probability sampling
Non-Probability Sampling: selection is non random i.e., sampling
units/elements have unequal chance of being selected
53
Non-probability sampling
Three conditions to use non-probability sampling.
First, if there is no desire to generalize to a population
parameter.
54
Non-probability sampling
(1) Convenience or accidental sampling: The method selects anyone
who is convenient.
Units that are convenient for the investigator are selected
(e.g. volunteers)
It can produce highly un-representative samples.
55
Non-probability sampling
(2) Quota Sampling: subgroups are identified and a specified
number of individuals from each group are included in the
research – based on certain criteria.
57
Non-probability sampling
(4) Snowball (Network) Sampling – chain sampling
This is a method for identifying and selecting the cases in a
network.
You start with one or two information-rich key
informants and ask them if they know persons who know
a lot about your topic of interest.
Contact the first few and ask them for names of others, and so on.
58
Problems in Sampling
Sampling errors occur randomly and are equally likely to be in either direction.
The magnitude of the sampling error depends upon the nature of the universe; the
more homogeneous the universe, the smaller the sampling error. Sampling error
is inversely related to the size of the sample i.e., sampling error decreases as the
sample size increases and vice-versa.
59
Problems in Sampling
Survey errors: The discrepancy between statements from
survey estimates and the reality (the true value) is called
survey errors.
60
Problems in Sampling
Sampling errors are errors which are attributable to sampling,
and which therefore, are not present in information gathered in
a census.
It is not a mistake
Can be controlled by well developed sampling theory
This error arises because it is unlikely that one will end up with a
truly representative sample, even when probability sampling is
employed
61
Problems in Sampling
Sampling error is related to confidence intervals.
63
Problems in Sampling
Non-Coverage sampling error: This refers to sample frame defect.
Omission of part of the target population (e.g., soldiers,
students living on campus, people in hospitals, prisoners,
households without a telephone in telephone surveys, etc.).
64
Problems in Sampling
Non response error – Common in self-administered surveys
This error occurs when you are not able to find those whom
you were supposed to study.
65
Problems in Sampling
Instrument error
The instrument in sampling survey is the device in which
we collect data- usually a questionnaire.
When a question is badly asked or worded, the resulting
error is called instrument error.
66
Problems in Sampling
Interviewer Error :
Enumerators can distort the results of a survey by in-
appropriate suggestions, word emphasis, tone of voice and
question rephrasing.
Cheating by enumerators -with only limited training and
under little direct supervision.
Perceived social distance between enumerator and
respondent also has a distorting effect.
68
The Concept of Sampling
69