Professional Documents
Culture Documents
ScienceDirect
Research Methods
Ajit Kumar, Shivani Dogra, Avneet Kaur, Manoj Modi, Anup Thakur, Satish Saluja*
Department of Neonatology, Sir Ganga Ram Hospital, New Delhi, India
abstract
Keywords: Adequate sample size is of paramount importance in medical research. Inadequate number of
Sample size subjects in a study may lead to inconclusive results and erroneous interpretation. This ne-
a Error cessitates estimation of sample size for a research project. There are formulae for calculation of
b Error sample size for different study designs. A researcher can manually compute sample size using
Power these formulae. Alternatively, there are several statistical softwares and online calculators,
which can compute sample size for various research designs. Common types of study designs
in medical research are estimation of a proportion or a mean in a defined population; and
hypothesis testing for a difference in two qualitative (proportions) or two quantitative out-
comes (means). For each type of study design, investigator needs to have some a priori infor-
mation like precision, acceptable limits of a & b error, confidence level and effect size.
Copyright ª 2014, Sir Ganga Ram Hospital. Published by Reed Elsevier India Pvt. Ltd. All
rights reserved.
Before we start a research project, it is important to know the Power of a study or the b error is not applicable for
minimum number of subjects required to be enrolled to obtain calculation of sample size in these type of study design.
a meaningful answer to our research question. Certain 2. Comparison studies (hypothesis testing)
essential parameters are required to calculate sample size.1 In a. Categorical outcomes
this section, we shall discuss calculation of sample size using i. Expected proportion of outcome/event rates in two
either statistical formulae, software, web based online calcu- groups (assumed from pilot observation or reported
lators or downloadable smartphone applications. For the from other studies)
calculation of sample size, following information is required: b. Continuous outcomes
i. Expected means and standard deviations of quanti-
1. Estimation studies tative outcomes in two groups (assumed from pilot
a. If the outcomes are observation or reported from other studies)
i. Categorical: expected prevalence or proportion of an c. Type I (a error): Usually a error of 5% is acceptable. The
event/disease in a population corresponding z values for a error or for 95% confidence
ii. Continuous: standard deviation of parameter of level are displayed in Table 1.
interest d. b (beta) error is conventionally kept at 20%. The power of
b. Precision (d): absolute/relative expressed as fraction e.g. a study is 1b. For a b error of 20% (0.2) the power of study
a 10% precision shall be expressed as 0.1 is 80% (10.2), its corresponding z value is 0.842 (Table 1).
c. Confidence level: Usually a confidence level of 95% is
acceptable. This also gives us the estimate of ‘a error’, Values of Z1a and Z1b corresponding to different level of a
which is 5% or 0.05 for 95% confidence level. error and power (1b) are listed in Table 1. Z1a value is
* Corresponding author.
E-mail address: satishsaluja@gmail.com (S. Saluja).
http://dx.doi.org/10.1016/j.cmrp.2014.04.001
2352-0817/Copyright ª 2014, Sir Ganga Ram Hospital. Published by Reed Elsevier India Pvt. Ltd. All rights reserved.
88 c u r r e n t m e d i c i n e r e s e a r c h a n d p r a c t i c e 4 ( 2 0 1 4 ) 8 7 e9 2
obese patients during the post-anesthesia care unit (PACU) (e.g. 50 mmHge60 mmHg) with an expected standard devia-
stay.4 They calculated sample size to detect an absolute tion of 10 in both groups with power 80% and a error 5%.
improvement of 10 mmHg in arterial oxygen partial pressure Sample size calculation for this study is as follows:
s21 þ s22 Z1a=2 þ Z1b This shows sample size calculation for hypothesis testing in
n¼ two population proportions. Formula is displayed in high-
ðm1 m2 Þ2
h i lighted ellipse at top row with value of sample size in high-
ð10Þ2 þ ð10Þ2 ½1:96 þ 0:8422 lighted circle. In this example to detect a difference of 20%
n¼ ¼ 15:7
ð10Þ2 (80%e60% or 0.8e0.6) with alpha 0.05 and power of 80%, 36
h i subjects would be required in each group.
2 ð10Þ2 ½1:96 þ 0:8422
n¼ ¼ 15:7
ð10Þ2
5.2. Online calculators & softwares
Here SD of 10 has been used for both groups and 2 (SD)2
has been used instead of (SD1)2 (SD2)2. In this study we need Many downloadable software and online calculators are
to enroll 16 patients in each group. available on internet, which can do sample size calculation for
Reader need not to go through these calculations manu- common study designs. Following is the list of few readily
ally. These calculations can be done with ease by various available websites or free download softwares:
methods e.g STATA (www.stata.com),
Power & Precision (www.PowerAnalysis.com),
Excel calculator Power and sample size (http://biostat.mc.vanderbilt.edu/
Online calculators wiki/Main/PowerSampleSize) Sealedenvelop.com (www.
Downloadable applications and softwares. sealedenvelope.com)
http://www.stat.ubc.ca/wrollin/stats/ssize/
5.1. Excel calculator
Sealedenvelop.com is one of the user friendly website,
We can create excel calculator by entering above formula in which can be accessed on a computer or a smartphone to
an excel file. Fig. 1 displays a snapshot of one of the excel file. calculate sample size online. Fig. 2 is showing snapshot of
sample size calculation using sealedenvelop.com. In this outcome of interest and is subject to high type II error. A large
example, sample size has been calculated to detect a decrease sample size than required is a waste of resources and would
in primary outcome from 30% in control group to 20% in be unethical. Sample size for various study designs can be
experimental group for 80% power with 5% level of signifi- calculated manually, using statistical formulae or with help of
cance. This shows that a sample size of 291 subjects in each statistical softwares or online calculators.
group is required for this study.
There are many statistical software which can be down-
loaded to computer for sample size estimation and other Conflicts of interest
statistical calculations. One of the commonly used software is
STATA. Step by step sample size calculation using STATA for 2 All authors have none to declare.
population proportion with alpha 0.05 and power 0.80 has
been displayed in Fig. 3 In the first window under the ‘Main’
tick type of populations based on the study design and under
references
the ‘Option’ tick ‘Compute sample size’ and enter confidence
level and power of study. On clicking submit a separate win-
dow opens up as in third frame indicating sample size. For a
1. Singh A, Soni A, Saluja S. Essentials of sample size calculation.
reduction in outcome from 30% to 20% a sample size of 313 GRJ. 2013;3:97e99.
subjects is required in each group. 2. Lawanga SK, Lemeshow S. Adequacy of Sample Size in Health
Studies. World Health Organization; 1990.
3. Morley CJ, Davis PG, Doyle LW, Brion LP, Hascoet JM, Carlin JB,
6. Summary COIN Trial Investigators. Nasal CPAP or intubation at birth for
very preterm infants. N Engl J Med. 2008;358:700e708.
4. Zoremba M, Kalmus G, Begemann D, et al. Short term non-
Sample size estimation is an essential step in designing and
invasive ventilation post-surgery improves arterial blood-
conducting medical research. It ensures recruitment of gases in obese subjects compared to supplemental oxygen
adequate number of subjects in the study. An inadvertent delivery e a randomized controlled trial. BMC Anesthesiol.
small sample size will not give accurate assessment of the 2011;11:10.