Professional Documents
Culture Documents
Data Collection,
Determination of Sample
Size, Basic Concepts in
Sampling Design
Learning Outcomes:
❖Determine the
appropriate sample size
❖Differentiative various
sampling techniques
❖Know the sources of
errors in sampling
Sample Size
Determination
SAMPLE SIZE
-is typically denoted by n and it is always a
positive integer
How many should be chosen
for a study?
Non-Statistical Considerations
❖Level of Precision
❖Confidence Level
❖Degree of Variability
Criteria need to be specified to
Determine the appropriate Sample Size
LEVEL OF PRECISION
- is the range in which the true value of the population is
estimated to be
true.
-also called sampling error.
𝜎
Sampling Error = Z ×
𝑛
Criteria need to be specified to
Determine the appropriate Sample Size
CONFIDENCE LEVEL
- it is statistical measure of the number of times out of 100 that results can
be expected to be within a specified range.
Z- Table
1-(a/2) where a is a level or
significance
1-0.9=0.10
1-(0.10/2)=0.95
1-0.99=0.01
1-(0.01/2)=0.995
Criteria need to be specified to
Determine the appropriate Sample Size
DEGREE OF VARIABILITY
- Depending upon the target population and attributes under consideration,
the degree of variability varies considerably. The more heterogeneous a population
is the larger the sample size is required to get an optimum level of precision.
Methods in Determining the Sample
Size
𝑧𝜎 2
𝑛≥
ⅇ
Where:
Z is the z-score corresponding to level of confidence
e is the level of precision.
Methods in Determining the Sample
Size
Estimating the Mean or Average
Take Note:
If When σ is unknown, it is common practice to conduct a preliminary survey to determine s
and use it as an estimate of σ or use results from previous studies to obtain an estimate of
σ. When using this approach, the size of the sample should be at least 30. The formula for
the sample standard deviation s is
𝑥 − 𝑥ҧ 2
𝑠=
𝑛−1
Methods in Determining the Sample
Size
Example:
A soft drink machine is regulated so that the amount of drink
dispensed is approximately normally distributed with a standard
deviation equal to 0.5 ounce. Determine the sample size needed if
we wish to be 95% confident that our sample mean will be within
+.03 ounce from the true mean.
𝑧𝜎 2 1.96(0.5 2 0.98 2
Z=1.96 n= n= n=
ⅇ 0.03 0.03 n= 32.67 2
n= 1067
Methods in Determining the Sample
Size
Estimating Proportion (infinite Population)
The sample size required to obtain a confidence interval for p with
specified margin of error e is given by
𝑧 2
𝑛≥ 𝑃 1−𝑃
ⅇ
Where:
Z is the z-score corresponding to level of confidence
e is the level of precision.
P is population proportion
Methods in Determining the Sample
Size
Example
Suppose we are doing a study on the inhabitants of a large town, and want to find out
how many households serve breakfast in the mornings. We don’t have much information
on the subject to begin with, so we’re going to assume that half of the families serve
breakfast: this gives us maximum variability. So p = 0.5. We want 99% confidence and at
least 1% precision.
𝑧 2 2
Z= 2.58 2.58
𝑛≥
ⅇ
𝑃 1−𝑃 𝑛≥ 0.25 𝑛 ≥ 258 2 0.25
0.01
-is used to calculate the sample size n given the population size and
error. It is computed as
𝑁
𝑛≥
1 + 𝑁ⅇ2
Where:
N is the total population
e is the level of precision.
Methods in Determining the Sample
Size
Example:
𝑁 1000 1000
𝑛≥ 𝑛≥ 𝑛≥
1 + 𝑁ⅇ2 1 + 1000(0.05)2 1 + 1000(0.05)2
1000
𝑛≥ 𝑛 ≥ 286
1 + 2.5
Methods in Determining the Sample
Size
Infinite Population Correction
The conservative formula using the strong law of large number.
1 𝑧 2
𝑛= ≈ 385
4 ⅇ
2 1
Where: 1 1.96 2
𝑛= 𝑛 = 39.2
Confidence level is 95% 4 0.05 4
1
The level of precision is 0.05 𝑛 = 1536.64 𝑛 = 384.16
4
Methods in Determining the Sample
Size
Finite Population Correction
If the population is small then the sample size can be reduced slightly.
𝑛0
𝑛= 385
𝑛 −1 𝑛= 𝑛 = 278
1+ 0 385 − 1
𝑁 1+
1000
Where:
𝑛0 is Cochran’s sample size recommendation
N is the population size
Sampling Design
Two Types of Samples
❖ Probability Samples
❖ Non-probability Samples
Probability Samples
• Samples are obtained using some objective chance mechanism, thus
involving randomization.
• They require the use of a complete listing of the elements of the
universe called the sampling frame.
• The probabilities of selection are known.
• They are generally referred to as random samples.
• They allow drawing of valid generalizations about the
universe/population.
Basic Sampling Techniques of
Probability Samples
A = (50/500) * 200 = 20
B = (50/500) * 300 = 30
Basic Sampling Techniques of Probability
Samples
Cluster Sampling
• It is a way to randomly select
participants from a list that is too
large for simple random sampling.
• The clusters are constructed such
that the sampling units are
heterogeneous within the cluster
and homogeneous among the
clusters.
Basic Sampling Techniques of Probability
Samples
Obtaining a Cluster Sampling
1. Divide the population into non-overlapping clusters.
2. Number the clusters in the population from 1 to N.
3. Select n distinct numbers from 1 to N using a randomization mechanism.
The selected clusters are the clusters associated with the selected
numbers.
4. The sample will consist of all the elements in the selected clusters.
Basic Sampling Techniques of Probability
Samples
Multi-Stage Sampling
Selection of the sample is done in two
or more steps or stages, with
sampling units varying in each stage.
Basic Sampling Techniques of Probability
Samples
1. Non-responses
2. Interviewer Error
3. Misrepresented Answers
4. Data entry errors
5. Questionnaire Design
6. Wording of Questions
7. Selection Bias