Professional Documents
Culture Documents
Design your experiment. The sample size will depend on what type of research you conduct,
such as performing interviews, doing surveys, reporting voting patterns or measuring molecules.
Calculate the population size. Your research likely has the goal of finding something out about a
population, and in order to determine how many observations you need to make (your sample
size), it's necessary to know how many total possible observations are available.
Specify the level of accuracy you want from your research. The sample size directly determines
the margin of error or the width of the confidence interval, two statistical measurements that can
be used to judge how accurately your research tracks to the larger population.
Calculate your ideal sample size. Do this by using a formula or an estimate. Statistical software
often provides formulas for calculating sample size. You can use such software, or you can
estimate your sample size given your research design, size of population and level of accuracy.
A number of formulae are available for working out sample size. We will introduce two of them
here to highlight some of the issues involved in calculating sample size for a quantitative project.
The proportion of the population (p) may be known from prior research or other sources; if it is
unknown use p = 0.5 which assumes maximum heterogeneity (i.e. a 50/50 split). The degree of
precision (d) is the margin of error that is acceptable. Setting d = 0.02, for example, would give a
margin of error of plus or minus 2%. We apply this formula in the example in the worked
example below.
To determine the minimum sample size you then apply the formula:
z 2 pq
nr
d2
1.96 0.5 0.5
2
0.052
384.16 385
To calculate the sample size based on the sample required to estimate a population mean with an
approximate 95% confidence level, you can use the following formula:
z 2 2
nr 2
d
Where nr = required sample size, ó (the Greek letter sigma) = the population standard deviation,
a measure of the variation in the population and d = the degree of precision required by the
researcher. A drawback with this formula is the need to know the population standard deviation.
This may be known from prior research; if a good estimate is unavailable the formula will not be
reliable. We apply this formula in the example below.
z 2σ 2
nr
d2
1.96 1.5
2 2
0.252
138.29 139
Note that as with the proportion formula, accepting a higher margin of error will decrease the
We are given Margin of Error d = 50, Standard deviation s = 255 and the reliability co-efficient
=1.96.
zs 1.96 255
2 2
nr 99.92 100
d 50
The population size, N=1000
nr 1 99
Therefore 0.099
N 1000
Since the sampling fraction is considerable we have to recalculate adjusted sample size using the
formula,
nr
na
n 1
1 r
N
100
1
100 1
1000
100
91
1.099
If, N = 10000
na 1 90
0.009
N 10000
Since the sampling fraction is negligible we can take the previously calculated n (=100).
A preliminary sample of 100 farmers was selected from a population of 5000 farmers by simple
random sampling. It was found that 40 of the selected farmers opt for a new variety of paddy.
How large a sample must be selected to have a precision of 5% with 95% confidence?
40
We are given that p 0.4; d 0.05 and the reliability coefficient = 1.96.
100
Hence,
pq 0.4 0.6
n z2 (1.96) 2
d2 0.0025
0.93
369
0.0025
368
The sampling fraction 0.074. it is considerable.
5000
Hence
nr 369 369
na
n 1 369 - 1 1.074
1 r 1
N 5000
344