Sample Size Determination

MED101x
Introduction to Applied Biostatistics
Sample Size Computation
A researcher conducted a study comparing the

effect of an intervention vs placebo on reducing
body weight, and found 5 lbs reduction among
the intervention group with P=0.01.
Another researcher conducted a similar study

comparing the effect of the same intervention vs
the same placebo on reducing body weight, and
found the same 5 lbs reduction with the
intervention group but could not claim that the
intervention was effective because P=0.35.
What do you think the crying researcher did differently from the smiling one?
1
MED101x
What impacts on p-value when comparing new drug v.s. placebo?
The effect of the new drug. ex: Larger reduction (10lbs) in weight by the
new drug!
Variation of data: Larger variation can result in larger p-value.

Source of variation:
Between-subject variation
Measurement error
And what else??????
Question: How can I make my P-value smaller.
Enroll as
many as
you can.
P-value!?
2
MED101x
What impacts on p-value when comparing new drug v.s. control?
The effect of the new drug. ex: Larger reduction (10lbs) in weight by the
new drug!
Variation of data: Larger SD can result in larger p-value,

Source of variation:
Between-subject variation
Measurement error
Sample size. Larger sample size can make p-value smaller!

Even a tiny, clinically meaningless
effect can become significant some
day if you keep enrolling patients
indefinitely. 5
As many as I can???????
How many can I ???????
So you need to justify

the enrollment of a
minimum number of
subjects enough to
prove that the drug is
effective.
Need
Sample size
Estimation!!!
Do I have a enough resource?
Does NIH agree to pay me that much????
6
Is it ethical to expose unnecessary large number of patient to a unproven drug?
3
MED101x
42% of R01 applications are criticized for sample size or power problems.
Inouye SK, Fiellin DA. An evidence-based guide to grant proposals for clinical research. Ann Intern Med. 2005;142:274-282
Example of reporting sample size estimation (１)
CONSORT statements provide a check list for required items for RCT,
and used by many journals such as NEJM, LANCET, JAMA, Anals Int
Med
• Title and Abstract
• Results
• Introduction
– Recruitment
– Background
– Baseline data
• Methods
– Numbers analyzed
– Participants
– Outcomes and estimation
– Interventions
– Ancillary analyses
– Objectives
– Adverse events
– Outcomes
• Comments
– Randomization
– Interpretation
– Blinding
– Generalizability
– Sample Size and Power
– Overall evidence
– Statistical methods
8
4
MED101x
Example of reporting sample size estimation (２)
Scientific approach of proving a hypothesis = Disproving a null hypothesis.
Null Hypothesis：There is no difference between the new drug and

control drug.
P-value: The probability of observing a difference as large or larger just

by change alone when the null hypothesis is true.
Reject The new drug is more effective than the control.
Fail to reject No evidence to support that the new drug is more

effective than the control.
When you make this inferential judgment, two type of errors can occur.
10
5
MED101x
Two critical errors involved in hypothesis testing:
Type 1 error (α) : falsely concluding that the drug is effective when the
drug actually is not effective.
Traditionally, you are allowed to make this error up to 5%
{i.e., significance level (α) = 5% }
Type 2 error (β): falsely concluding that the drug has no effect when the
drug is actually effective.
Traditionally, you are allowed to make this error up to 20%
Power of statistical tests:

Power of the test (1-β): the probability of correctly concluding that the
drug is effective, when the drug actually is effective.
A statistical test with a larger sample size can decrease both type I and II
errors. We try to get a sample large enough to ensure that 1-β is at a
reasonable level (80% or more).
11
Larger Sample Size
Greater power to detect a

true difference!
Smaller p-value !!!!

12
6
MED101x
Example: Estimation of sample size comparing 2 group means
Study Design:
2-arm RCT to compare HbA1c level among patients with type 2 diabetes.
Sample size computation:

A pilot data suggests that mean HbA1c level among patients without this
intervention is 8.7with standard deviation of 2.2. We believe that the
intervention will decrease patient’s HbA1c level by 1%. (? patients in
each group) are needed to achieve 80% power at two-sided 5%
significance level.
13
Free Software for Sample Size and Power – PS Software
PS （Power and Sample Size) software
PS is an interactive program for performing power and sample size

calculations and freely available on the internet. This program was
developed by my colleagues, Professor William Dupont and Dale
Plummer. You can download it from our website of the department of
Biostatistics, Vanderbilt University at:
http://biostat.mc.vanderbilt.edu/twiki/bin/view/Main/PowerSampleSize
14
7
MED101x
How to download PS software (1): How to find the download site for PS
http://biostat.mc.vanderbilt.edu/twiki/bin/view/Main/PowerSampleSize
15
How to download PS software (2): Download site
16
8
MED101x
Sample size estimation using PS software – Test Type
T-test
Comparing continuous
outcome variable
comparing between 2
groups
17
T-test
outcome variable
comparing between 2
groups
Dichotomous
Comparing binary
outcome variable
comparing between 2
groups
18
9
MED101x
T-test
outcome variable
comparing between 2
groups
Dichotomous
Comparing binary
outcome variable
comparing between 2
groups
Survival
Comparing binary
outcome variable
comparing between 2
groups where not all
patients are followed for
the same time period
19
Regression 1
Comparing binary
outcome variable
comparing between 2
groups where not all
patients are followed for
the same time period
20
10
MED101x
Sample size estimation using PS software - Parameters
21
Independent t-test
or Paired t-test?
22
11
MED101x
23
Parameters: Significance level
α = Significance level (Type I error probability)

Allowable probability falsely detecting difference when
there is none.
Typically set to 2-sided 5%
24
12
MED101x
Parameters: Power
Power =
The probability of correctly detecting that groups are
different, when groups are actually different.
Typically set to 80, or 90%
25
Parameters: Difference in means
δ = Difference in means (effect)

Typically you obtain this from a pilot study or a literature.
26
13
MED101x
Parameters: Standard Deviation
σ = Standarad Deviation
Showing variation of data
Typically you obtain this from a pilot study or a literature.
Histogram Histogram
120.0 120.0
80.0 80.0
Count
Count
40.0 40.0
0.0 0.0
0.0 0.5 0.9 1.4 0.0 0.5 0.9 1.4
log_Valsal log_Valsal
Exp Cont
27
Parameters: m
m=
the ratio of control to experimental patients.
28
14
MED101x
Sample size estimation using PS software for Student’s t-tests (1)
After entering all

parameters, click
here
29
Sample size estimation using PS software for Student’s t-tests (2):

Drawing a graph of statistical power by varying sample sizes (1)
2 Click here
for the graph
30
15
MED101x

1 Draw the graph
31

Copy and
1 paste into your
document
32
16
MED101x

0.8
0.6
0.4
0.2
0
0 20 40 60 80 100 120 140 160 180 200
Sample size
It requires about 77 patients in each group (total of 154) to achieve

80% power.
33
Improving analytical power (reducing required sample size):

Transformation
For skewed distribution, transformation may improve power.
Original value Log transformed value
15
10.0
10 7.5
Count
Count
5.0
2.5
0 0.0
6.0 8.0 10.0 12.0 14.0 1.80 2.00 2.20 2.40 2.60
12 Month HbA1c ln_ha1c12
34
17
MED101x
Required
sample size
reduced from
77 to 49 per
arm.
Transformation
improves power
only when
improving
normality
Parameters needed for sample size computation:

Power = 80%, 2-sided α = 5%, δ=mean(ln(HbA1c))=0.122
σ =0.21, m = 1
35
SD of ln(HbA1c)
Comparing 2 proportions (Pearson chi-square test)
Percent of patients who did not improve HbA1c level (defined as not
achieving < 7%) was 80% among patients without an intervention.
We anticipate 25% reduction with this intervention (80% x
0.75=60%). ?81 patients in each group are needed to achieve 80%
power with two-sided 5% significance level.
Parameters needed for sample size computation:

Power = 80%,
Significance level (α) = 2-sided 5%,
Anticipated difference (δ) =1
Standard deviation (σ)=2.2,
m (sample size ratio between the two groups) = 1 36
18
MED101x
Sample size computation for the question on the previous page
37
Impact of data categorization on analytical power
General Rule: Greater granularity in Data leads to greater power.

Loss of Data often leads to loss of power.
Two continuous variables (N=30) were generated assuming correlation = 0.5

o o
oo
oo
o o
o o oo
Outcome (Median=0.17)
Pearson’s correlation o
o
o
o oo o
o
Exposure (Median=1.2)
→ Power > 80%
Exposure
Independent Sample t-test ]
→ Power = 59%
75 ]
50
<1.2 >1.2
Pearson chi-square test 25 Exposure
→ Power = 12% 0
<1.2 >1.2 38
Exposure
19
MED101x
Concluding Remarks !!
Be familier with terminology in sample size computation !!
Never put power less than 80% in a grant proposal !!!!
Not enough sample size, there may be a way to get 80% !!!!
Think about what you want to compute, power, sample size, or effect?
Avoid categorization, go continuous !!!
Download PS now, and try it before midnight today !!!

39
20

Sample Size Determination

Uploaded by

Document Information

Original Description:

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Sample Size Determination

Uploaded by

Copyright:

Available Formats

MED101x

Introduction to Applied Biostatistics

Sample Size Computation

A researcher conducted a study comparing the

Another researcher conducted a similar study

What impacts on p-value when comparing new drug v.s. placebo?

Variation of data: Larger variation can result in larger p-value.

And what else??????

Question: How can I make my P-value smaller.

What impacts on p-value when comparing new drug v.s. control?

Variation of data: Larger SD can result in larger p-value,

Sample size. Larger sample size can make p-value smaller!

So you need to justify

Example of reporting sample size estimation (１)

Example of reporting sample size estimation (２)

Scientific approach of proving a hypothesis = Disproving a null hypothesis.

Null Hypothesis：There is no difference between the new drug and

P-value: The probability of observing a difference as large or larger just

Reject The new drug is more effective than the control.

Fail to reject No evidence to support that the new drug is more

Two critical errors involved in hypothesis testing:

Power of statistical tests:

Larger Sample Size

Greater power to detect a

Smaller p-value !!!!

Example: Estimation of sample size comparing 2 group means

Sample size computation:

Free Software for Sample Size and Power – PS Software

PS （Power and Sample Size) software

PS is an interactive program for performing power and sample size

How to download PS software (2): Download site

Sample size estimation using PS software – Test Type

Sample size estimation using PS software – Test Type

Sample size estimation using PS software – Test Type

Sample size estimation using PS software – Test Type

Sample size estimation using PS software - Parameters

Sample size estimation using PS software - Parameters

Sample size estimation using PS software - Parameters

Parameters: Significance level

α = Significance level (Type I error probability)

Typically set to 2-sided 5%

Typically set to 80, or 90%

Parameters: Difference in means

δ = Difference in means (effect)

Parameters: Standard Deviation

Typically you obtain this from a pilot study or a literature.

Sample size estimation using PS software for Student’s t-tests (1)

After entering all

Sample size estimation using PS software for Student’s t-tests (2):

for the graph

Sample size estimation using PS software for Student’s t-tests (3):

1 Draw the graph

Sample size estimation using PS software for Student’s t-tests (4):

Sample size estimation using PS software for Student’s t-tests (5):

It requires about 77 patients in each group (total of 154) to achieve

Improving analytical power (reducing required sample size):

For skewed distribution, transformation may improve power.

Original value Log transformed value

12 Month HbA1c ln_ha1c12

Parameters needed for sample size computation:

Comparing 2 proportions (Pearson chi-square test)

Parameters needed for sample size computation:

Sample size computation for the question on the previous page