Professional Documents
Culture Documents
1 - 1 - 5 Survey Sample Calculator Template 3
1 - 1 - 5 Survey Sample Calculator Template 3
PLANNING
survey resources
SAMPLE SIZE SAMPLE SIZE PROPORTIONATE
basic formula comparison formula stratification
and feasibility
This tool can be used in survey This tool can be used to This tool can be used to This tool can be used to break
design to calculate the feasible calculate the sample size calculate the sample size up a sample over a number of
number of interviews given the required for a statistically required for a statistically different locations or populations
information requirements, relevant (probabilistic) sample relevant (probabilistic) sample types using proportionate
resources and time available. using the basic formula for for comparison surveys (e.g. stratification.
This tool can also be used for proportions. either analysing the evolution of
planning on the resources an indicator, comparing sub-
required for a certain sample groups, etc.) using the formula
size one wishes to achieve. for proportions.
Total number of interviews which can be completed in one day by one enumerator interviews nQ = qd / (qt + qb) 9
Time each day to administer interviews minutes qd 240
Time to administer one interview minutes qt 20
Time required between interviews (e.g. to move from one interview to another) minutes qb 5
Total number of enumerators for data collection people nE = e a / e r 5
Available enumerators people ea 5
# enumerators required per interview people er 1
Total number of days for interviews days nD = d f - d t 5
Total days for the field mission (including travel days) days df 7
Number of travel days days dt 2
Population size The number of units (households, individuals, etc.) from which the sample will be drawn. Essentially the N
population in your sample frame. If this is unknown, leave blank, and the calculator will use the basic sample
unadjusted for a finite population.
Confidence interval The level of confidence of the statistics made on the sampled population. For example, a 95% confidence level Z 90%
indicates that you can be 95% sure that the responses/measures you get from the sampled population match
what would be the responses/measures if you had collected data on the entire population. The standard in most
surveys is 0.95. If resources do not permit or the variable of study is expected to be homogeneous, 0.90 could
be used with small additional risk. Recommended value: 95%
Margin of error The plus or minus (+/-) figure reported in the results. For example, if you have a confidence interval of 0.05 and e 0.05
47% of the sample selected response "A" to a question, you can state with XX% confidence that between 42%
and 52% of the population would choose "A". Recommended value: 0.05 (for low baseline or low expected
prevalence levels, e.g. <10%, a smaller margin of error such as 0.02 or 0.01 should be used).
Prevalence The proportion of the target population that are known to or thought to have the characteristic being measured p 0.50
(e.g. engaged in farming, poor food consumption, displaced, etc.). Recommended value: 0.5 If unknowm 50%
or 0.5 is accepted as the most conservative. If the prevalance is known, use the prevalance rate.
Design effect (DEFF) When using complex sampling methods, such cluster sampling, the sample size should be adjusted to account D 1.00
for the design effect (DEFF) which is the extent to which the sampling error departs from the sampling error used
in simple random sampling. For simple random or stratified random samples, the value here is 1.0 as we do not
account for DEFF. For cluster samples, 2.0 is commonly used as default where cluster sizes are small and
numerous. Recommended value: 1.0 for simple random sampling; 2.0 for cluster sampling.
Expected response rate If you expect there to be some non-responses, the sample size can be adjusted to help ensure you collect R 1.00
sufficent data for the analysis. Recommended value: Use the response rate observed in previous exercises on
similar topics in the region. 1.0 for 100%, 0.8 for 80%, etc.
CALCULATIONS
First calculation First estimate, where population size is not known Z2*p*(1-p) / e2 270.60
Second calculation Finite population correction (for known population size) n1/(1+((n1-1)/N) 270.60
Third calculation Adjust for the DEFF n*D 270.60
Z-score for statisticial Z-score corresponding to the degree of confidence desired to be able to conclude that an observed difference 90%
significance (α) (P2-P1) would not have occurred by chance (α is statistical significance). The standard in most surveys is 0.95.
If resources do not permit, 0.90 could be used with small additional risk (Magneni, 1997). Recommended value: Zα
95%
Z-score corresponding to Z-score corresponding to the degree of confidence desired to be certain of detecting a difference (P2-P1) if one 80%
statistical power (β) actually occurred (β is statistical power). The minimum value of 80% is generally accepted. Recommended
value: minimum 80% Zβ
Prevalence round/group 1 Estimated proportion of the target population with the characteristics being measured in the first round or group 0.50
of the survey (baseline). Recommended value: Expected prevalence OR 0.5 If unknowm 50% or 0.5 is
accepted as the most conservative. If the prevalance is known, use the prevalance rate. P1
Prevalence round/group 2 Expected proportion of the target population with the characteristics being measured in the second round or 0.40
group of the survey (comparison), such that the quantity (P2-P1) is the size of the magnitude of difference it is
desired to be able to detect. If the prevalence is unknown, simply add or subtract the real % difference you
would like to detect (e.g. if P1 is 0.5 or 50% and you would like to detect differences when the real difference is P2
15%, then enter 0.65 or 65% here). The smaller the difference, the larger the sample size required to find a
difference. Recommended value: Expected prevalence OR P1 minus the minimum % difference you would like
to detect
One-sided or two-sided test One-sided (also known as directional hypothesis) tests the significance of the relationship between variables in determines value of α 2
one direction (e.g. hypothesis that sample mean is lower than population mean). Two-sided (also known as non-
directional hypothesis) tests the significance of the relationship between the variables in either direction (e.g.
sample mean is different, either lower or higher, than the population mean). Recommended value: One-sided if
the direction is thought to be known, more conservative two-sided if the direction is unknown.
Design effect (DEFF) When using complex sampling methods, such cluster sampling, the sample size should be adjusted to account 1.00
for the design effect (DEFF) which is the extent to which the sampling error departs from the sampling error used
in simple random sampling. For simple random or stratified random samples, the value here is 1.0 as we do not D
account for DEFF. For cluster samples, 2.0 is commonly used as default where cluster sizes are small and
numerous. Recommended value: 1.0 for simple random sampling; 2.0 for cluster sampling.
Expected response rate If you expect there to be some non-responses, the sample size can be adjusted to help ensure you collect R 1.00
sufficent data for the analysis. Recommended value: Use the response rate observed in previous exercises on
similar topics in the region. 1.0 for 100%, 0.8 for 80%, etc.
CALCULATIONS
Base calculation (Zα+Zβ)^2 *(P1*(1-P1) + P2*(1-P2)) / (P2-P1)^2 302.55
Adjust for DEFF n*D 302.55
Adjust for non-response n/R 302.55
Home
PROPORTIONATE STRATIFICATION
This tool can be used to break up the sample over a number of different locations or populations types using proportionate stratification. Fill out all values in
LIGHT GREEN at right and see the result in adjacent box (in DARK GREEN).
Population size (N) Sample size (n) Population size (N) Sample size (n)
TOTAL TOTAL
Place 1 Population 1
Place 2 Place 1 Population 2
Place 3 Population 3
Place 4 Population 1
Place 5 Place 2 Population 2
Place 6 Population 3
Place 7 Population 1
Place 8 Place 3 Population 2
Place 9 Population 3
Place 10 Population 1
Place 11 Place 4 Population 2
Place 12 Population 3
Place 13 Population 1
Place 14 Place 5 Population 2
Place 15 Population 3
Place 16 Population 1
Place 17 Place 6 Population 2
Place 18 Population 3
Place 19 Population 1
Place 20 Place 7 Population 2
Place 21 Population 3
Place 22 Population 1
Place 23 Place 8 Population 2
Place 24 Population 3
Place 25 Population 1
Place 26 Place 9 Population 2
Place 27 Population 3
Place 28 Population 1
Place 29 Place 10 Population 2