Professional Documents
Culture Documents
CALCULATION
♦ Using formulas
♦ Using tables of sample sizes
♦ Using statistical calculators
(StatCalc of EpiInfo, Open EPI)
Sample size calculation
Things to know:
type of the study: descriptive or analytic?
proportions or means? usual values?
amount of deviation from the true value?
Clinically important difference?
confidence level? power?
one-tailed or two-tailed hypotheses
Hypotheses testing
H0 : Patient is alive.
H1 : Patient is not alive (dead).
Hypotheses testing
State of Nature
Diagnosis No Yes
Assumption of innocence
= 0.05,
1- (confidence level) = .95
= 0.20,
1- (power) = 0.80
Confidence level, Power
The easiest ways to increase power are to:
p(100 p )
n f (1 )
2
where
n = computed sample size
p = estimate of the proportion
= the desired width of the confidence interval
1- = confidence level
Sample size for descriptive studies
1. Estimation of a population proportion
Table 1 Values for f(1-) for various
confidence levels 100 (1-) %
(1-) 0.8 0.9 0.95 0.99
f(1-)* 1.642 2.706 3.842 6.635
p(100 p )
n f (1 )
2
Sample size for descriptive studies
1. Estimation of a population proportion
Table 1 Values for f(1-) for various
confidence levels 100 (1-) %
(1-) 0.8 0.9 0.95 0.99
f(1-)* 1.642 2.706 3.842 6.635
p(100 p )
n f (1 )
2
15(100 15)
n 2
3.842
4
n 306
Sample size for descriptive studies
2. Estimation of a population mean
2
s
n 2 f (1 )
where
n = computed sample size
s = estimate of the standard deviation of the
observations
= the desired width of the confidence interval
1- = confidence level
Sample size for descriptive studies
2. Estimation of a population mean
Table 1 Values for f(1-) for various
confidence levels 100 (1-) %
(1-) 0.8 0.9 0.95 0.99
f(1-)* 1.642 2.706 3.842 6.635
2
s
n 2 f (1 )
Sample size for descriptive studies
2. Estimation of a population mean
Table 1 Values for f(1-) for various
confidence levels 100 (1-) %
(1-) 0.8 0.9 0.95 0.99
f(1-)* 1.642 2.706 3.842 6.635
where
n = computed sample size
p1, p2 = estimate of the sample proportion for each group
1- = confidence level
1- = power of the test
Sample size for analytic studies
1. Hypothesis testing between two proportions
Table 2 Values for f(,)*
Significance level,
Power, one-tailed two-tailed
1-
0.05 0.01 0.05 0.01
0.5 2.71 5.41 3.84 6.63
0.8 6.18 10.04 7.85 11.68
0.9 8.56 13.02 10.51 14.88
* f(,) is the square of the sum of the upper tail and
the upper tail point (for one tailed test) or 1/2 point
(for two-tailed test) of the std. Normal distribution
Sample size for analytic studies
1. Hypothesis testing between two proportions
A new antibiotic is to be compared to a standard drug
with respect to cure rate of urinary tract infection. The
new drug will be considered better than the standard
drug if it shows a 5% difference from the cure rate of
80%. How many patients are needed if the investigator
wants 90% power and 95% confidence?
Significance level,
Power, one-tailed two-tailed
1-
0.05 0.01 0.05 0.01
0.5 2.71 5.41 3.84 6.63
0.8 6.18 10.04 7.85 11.68
0.9 8.56 13.02 10.51 14.88
* f(,) is the square of the sum of the upper tail and
the upper tail point (for one tailed test) or 1/2 point
(for two-tailed test) of the std. Normal distribution
Sample size for analytic studies
1. Hypothesis testing between two proportions
A new antibiotic is to be compared to a standard drug
with respect to cure rate of urinary tract infection. The
new drug will be considered better than the standard
drug if it shows a 5% difference from the cure rate of
80%. How many patients are needed if the investigator
wants 90% power and 95% confidence?
where
n = computed sample size
s = estimate of the standard deviation of the
observations, assuming it is the same for
each group
= the true difference between the means
1- = confidence level
1- = power
Sample size for analytic studies
1. Hypothesis testing between two means
Table 2 Values for f(,)*
Significance level,
Power, one-tailed two-tailed
1-
0.05 0.01 0.05 0.01
0.5 2.71 5.41 3.84 6.63
0.8 6.18 10.04 7.85 11.68
0.9 8.56 13.02 10.51 14.88
* f(,) is the square of the sum of the upper tail and
the upper tail point (for one tailed test) or 1/2 point
(for two-tailed test) of the std. Normal distribution
Sample size for analytic studies
2. Hypothesis testing between two means
2
2s
n 2 f ( , )
Sample size for analytic studies
2. Hypothesis testing between two means
Table 2 Values for f(,)*
Significance level,
Power, one-tailed two-tailed
1-
0.05 0.01 0.05 0.01
0.5 2.71 5.41 3.84 6.63
0.8 6.18 10.04 7.85 11.68
0.9 8.56 13.02 10.51 14.88
* f(,) is the square of the sum of the upper tail and
the upper tail point (for one tailed test) or 1/2 point
(for two-tailed test) of the std. normal distribution
Sample size for analytic studies
2. Hypothesis testing between two means
2
2s
n f ( , )
2
2
2(10)
n 2
10.51 84
5
Sample size calculation using EPI-Info6
http://www.cdc.gov/epiinfo/Epi6/ei6.htm
STATCALC program
Sample size for analytic studies
2. Hypothesis testing between two means
Odds Ratio = 21
You need to have an estimate of the percentage of exposure among the controls
and either the odds ratio or the percentage of exposure among cases
Sample size for analytic studies
Cohort Study
Research question: Is Hib vaccine associated with the
development of leukemia among children in
Dasmarinas, Cavite ?
+ resolution
Flubendazole
group
(-) resolution
(Exposed)
+ resolution
Mebendazole
group (-) resolution
(Unexposed)
Calculate sample size: RCT
Example: Efficacy of flubendazole compared to mebendazole in
the treatment of trichiuriasis among pediatric patients.
+ resolution 75%
Flubendazole
group
(-) resolution
(Exposed)
+ resolution 50%
Mebendazole
group (-) resolution
(Unexposed)
50% with resolution in 75% with resolution
Mebendazole group in flubendazole group
General comments on estimation of sample size
Compute the sample size as early as possible
during the design phase, (to estimate the
resources required and the feasibility of the
study.
The rarer the condition being investigated, the
larger the sample size, all other things being
equal.
Complex data analysis generally requires larger
samples than simple analysis.
In general, longitudinal studies require a larger
sample size than case-control and cross
sectional studies.
General comments on estimation of sample size