Professional Documents
Culture Documents
2006
Writing a study protocol
introduction
research question
current knowledge
research hypothesis
research objective
Writing a study protocol
methodology
type of study
experiments
prospective (randomized, control)
observational
prospective
retrospective
audit
Writing a study protocol
methodology
sample size calculation
inclusion criteria
process of randomization
data collection)
control group
test group(s)
Writing a study protocol
methodology
monitoring of the patients during the period of
procedure
routine - heart rate, blood pressure, oxygen
saturation
data / observations
side effects of test protocol
rescue therapy
safety of patient
treatment plan
normality
statement about treatment of normal or
expressed as mean and standard deviation,
with 95% confidence interval
t-test for comparison of means obtained from
2 groups of data
analysis of variance test for comparison of
means obtained from more than 2 groups
Χ2 test for discrete data
Writing a study protocol
statistical analysis
non-normal distribution
expressed as median and range, with limits of
25th and 75th percentiles
Mann-Whitney U test for analysis of data
from 2 groups
Kruskal-Wallis test for analysis of data from
H0
2006
Null hypothesis
study hypothesis
investigator conducting a study usually has a
theory in mind
however, very difficult to prove the hypothesis
null hypothesis
differences observed is not due to exposure to
it is termed null
Types of research
Longitudinal studies
Cross-sectional studies
2006
Types of research - longitudinal study
investigates a process over time
the effect of external factors on human
subjects
3 types
clinical trial, a cohort study, case-control study
randomised disease severity
studies
retrospective studies
according to
outcome
determine
“exposure”
Randomised clinical trial
randomisation
is a procedure in which the play of chance
Assessment
Test
Control
Assessment
Assessment
Test Control
Control Test
problem of bias selection already occurring -
those who did not receive transplants may be
more ill or may not have satisfied the criteria
survival of patients who received heart
prospective study
often termed observational studies, since they
disease Exposed
Without disease
Without
disease
With disease
Not
exposed
Without disease
Time
Problems with cohort study
exposure to factor may be by natural selection or
up to the individuals’ decision
bias may influence the measure of interest
measure of interest
e.g. cohort study of cardiovascular risk in men
sterilised by vasectomy
e.g. incidence of breast cancer with
consumption of alcohol
Problems with cohort study
size of the study
the required size of a cohort study depends not
when cohort is made up of employed
individuals, the risk of dying in the first few
years of follow up is less than in general
population, this is known as healthy worker
effect; people who are sick are less likely to
be employed
incomplete representation
e.g. people who respond (to questionnaires)
and people who are lost to follow up
Case-control studies
case-control study
also known as a case-reference study or
retrospective study
starts with identification of persons with the
Exposed
Subjects with
disease (case)
Not exposed
Exposed
Subjects without
disease (control)
Not exposed
Time
(Retrospective)
Case-control studies
designs
matched design
control subjects can be chosen to match
individual cases for certain important
variables such as age, gender and weight
unmatched design
diseased population
Case-control studies
selection of controls
not required that the control group are alike in
to ensure that the sample is representative
of general population in say, age, gender and
social class structure
not recommended in medical research
only subjects who are available to the
interviewer can be questioned
Cross-sectional studies
problems
bias in the type of responders and non-
responders
exposures have to be determined by a
retrospective history
Calculation of sample size
2006
Calculation of sample size
consider
control group response
significance level
power
Control group response
it is first necessary to postulate the response of
the control group patients
denoted by π1, to distinguish it from the value that
will be obtained from the trial, denoted p1
experience of other studies may provide π1
The anticipated benefit
it is also necessary to postulate the size of the
anticipated response in treatment group patients,
denoted by π2, to distinguish it from the value
that will be obtained from the trial, denoted p2
anticipated benefit δ = π2 - π1
Type I error
the error of incorrectly rejecting the null
hypothesis when it (the hypothesis) is true
the error of concluding that the differences
variation
alternate hypothesis states that the difference is
real, further that it is due to some specific factor,
where no direction of change is specified both
μ=0
1.96σ
z
Significance level
the significance level, α, is the probability of
making a Type I error and is set before the test
is carried out
in most cases, it will be two-sided
Significance level
denoted by the letter P, represents the
probability of the observed value being due solely
to chance variation
can be interpreted as the probability of obtaining
the observed difference, or one more extreme, if
the null hypothesis is true
the smaller the value of P, the less likely the
variation is to be due to chance and the stronger
the evidence for rejecting the null hypothesis
Significance level
most scientific work, by accepted convention,
rejects the null hypothesis at P < 0.05
probability of the observed value being due
z0.05 = 1.96
zα is the value along the axis of a Normal
distribution
thus
0.025 is to the right of z = +1.96
Type II error
error in incorrectly accepting the null hypothesis
of no difference between treatments, when it
(the hypothesis) is in fact false (and should be
rejected)
accepting the null hypothesis when there should
unusual data sample which does not support a
difference
statistical methods can produce incorrect
conclusions
too small a sample size
the smaller n, the greater must be the real
difference before statistical difference may
be shown
usually set at a value of β = 0.2 or 20%
Power
the probability that a study can predict a
difference, when a real difference actually exists,
is termed the statistical power of the study
it is the probability of rejecting the null
hypothesis when it (the hypothesis) is false
first decide how much false negative or type II
error (β) rate is reasonable
power equals 1-β
δ = π2 - π1
e.g. response rate to placebo and treatment
drug
for Χ2 test, the number in each group should be
at least
m = (zα + z2β)2{π1(1-π1) + π2(1-π2)}
δ2
Calculation of sample size
comparison of means (unpaired data)
wish to detect a difference in means
δ = μ2 - μ1
the number in each group should be at least
m = 2(zα + z2β)2 σ2
δ2
where σ is presumed to be the same with both
drugs
Calculation of sample size
comparison of means (paired data)
with cross over trial
2006
Methods of randomisation
simple randomisation
either manually by tossing coin or throwing a
2006
Types of data
qualitative data (varying by description)
nominal data
data
numerical or quantitative data (varying by number)
numerical discrete data (differ only by fixed
amount)
numerical continuous data (differ by any
amount)
Qualitative data
nominal data
data that one can name or describe
not measured but simply counted
expressed in number (or frequency) or
2 groups - gender of patients, did or did not
get exposure to factor
> 2 groups - blood groups, racial groups,
anaesthetists, surgeons
Qualitative data
ordered categorical or ranked data
if there are more than two categories of
- x xxx
xx
Postoperative
bleeding (ml)
- xx xxxx
xx xxx
xxx
- xxxx xx
xxx
x xx
- xxx
x
A(1) B(2) C(3)
Vascular surgeon
Quantitative data
numerical discrete
data consists of counts
number of general anaesthetics and regional
anaesthetics performed in this hospital in a
year
number of anaesthetic trainees passing the
interval scale,
position of zero is arbitrary, a difference
ordinal
mild | moderate | severe
interval
meaningful distance between values
ratio
allows study of absolute magnitude
Summarising data
2006
Summarising data for the study
measurement of location or central tendency
mean or arithmetic average
interval or ratio of a quantitative variable
median and quartiles
mode
standard deviation
measures of symmetry
Measurement of location or
central tendency
2006
Mean or average
the mean (x, pronounced xbar) or average of n
observations is the sum of the observations, Σx
divided by their number, n (arithmetic average)
sum of all sample values Σx
x= =
size of sample n
advantage
mean uses all the data values, is statistically
efficient
disadvantage
vulnerable to outliers,
measurements fall
for odd number of observations, is the
disadvantage
not statistically efficient as it does not make
10,13,20,20,22,22,23,24,25,25,27,28,30,30,30,
31,31,32,32,33,34,35,35,36,37,38,38,39,39,41,4
1,41,42,42,43,43,44,44,46,47,48,50,50,51,52,54
2 1 03
10 2 0022345578
17 3 00011223455678899
12 4 111223344678
5 5 00124
46
Percentile
25th percentile
the value above which 75 percent of the
2006
Measure of dispersion or variability
range
difference between minimum & maximum values
interquartile range
(largest value 3rd quartile) - (largest value 1st
quartile)
standard deviation
degrees of freedom
Range or interquartile range
range
is given as the smallest and the largest
observations
vulnerable to outliers
interquartile range
the distance between the 25th and 75th
percentile -
Postoperative
bleeding (ml)
not vulnerable to outliers -
-
displayed as box-whisker plots -
-
-
A B C
Surgeon
Standard deviation, s
a measure of dispersion, i.e. how far variables are
away from their mean, often abbreviated as SD
expressed in the same units of measurement as
the observations
Σ(x-x)
√
2
s=
n-1
the value Σ(x-x)2 is interpreted as
from each x value subtract the mean x,
be large
variance
a measure of the dispersion of values about the
mean
the square of the standard deviation
s2 = Σ(x-x)2
sample mean
c.v. = s × 100%
x
Standard deviation, s
for data with a normal distribution, there is
a 68.26% chance that the actual value will be
skewed distribution
a distribution is skewed to the right (left) if
range
mean and standard deviation are sensitive to
the skewness
Skewness
an index of the degree to which a distribution is
not symmetric, or to which the tail of the
distribution is skewed or extends to the left or
right
calculation of skewness, sk
sk = 3(mean-median)
SD
normality can be confirmed if mean and median
are close
skewness is used, along with the kurtosis statistic,
to assess if a variable is normally distributed
Skewness
the normal distribution is
symmetric, and has a
skewness value of zero
a distribution with a
→ +1
significant positive
skewness has a long right
tail
a distribution with a
-1← significant negative
skewness has a long left
tail
Kurtosis
a measure of the extent to which observations are
clustered in the tails
kurtosis can be used, along with the skewness
statistic, to assess whether a variable is normally
distributed
for samples from a normal distribution, the values
of kurtosis will fluctuate around 0
Kurtosis
for a normal distribution,
the value of the kurtosis
statistic is 0
if a variable has a negative
kurtosis, its distribution
→ -ve -ve ←
has lighter tails than a
normal distribution
if a variable has a positive
kurtosis, a larger
+ve ← → +ve proportion of cases fall into
the tails of the distribution
than into those of a normal
distribution
Normal probability plot
normality of
observation can be
confirmed from the
Observation
x
x
Normal probability plot x
x
x
x
x
x
x
x
x
Normal ordinates, z
Normality
bell-shaped distribution of a continuous variable
symmetrical about its mean
median and mean will be close
the data
both give useful information
2006
Populations and samples
a population is a theoretical concept used to
describe an entire group (any collection of people,
objects, events, or observations)
this is usually too large and cumbersome to
error
the greater the variability of the observations,
size increases
result in bias or systematic distortion of the
results
Central Limit Theorem
definition: even when the variable is not normally
distributed the sample mean will tend to be
normally distributed
if random samples of n measurements are
repeatedly drawn from a population with a finite
mean μ, and a standard deviation σ, then when n is
large, the relative frequency histogram for the
(repeated) sample means will tend to be
distributed normally
Normal distribution
2006
Normal distribution
bell-shaped distribution of a continuous variable
symmetrical about its mean μ
described by
σ
population mean, μ
Frequency
μ=0
95%
0 1.96σ
Scoring of fine motor skills
impaired Frequency
severely borderline
impaired
average above average
+1 SD
-1.5 SD
+2 SD
-1 SD
-2 SD
68.26%
IQ Curve
Normal distribution
mathematical property of the Normal distribution
68.26% of the distribution lies between μ ± 1σ
95%)
99% of the distribution lies between μ ± 3σ
μ – 2.58 × σ and μ + 2.58 × σ
1-α
Normal distribution
in practice, the parameters μ and σ must be estimated
from the sample data
for this purpose, a random sample from the
estimated
SE(x) or SD(x) = SD(x)/√n or s/√n
Exercise: Calculation of SE
Mean (x) alanine aminopeptidase value for 25
subjects is 1.0 U; SD is 0.3 U
SE = SD/√n
= 0.3/√25
= 0.3/5
= 0.06
Comparing s and SE
standard deviation, s standard error of mean
is a measure of the is a measure of the
variability between uncertainty in the
individuals with sample statistic,
respect to the
always refers to an
measurement under
consideration estimate of a
describes the sample population parameter
the larger the sample
confidence interval
Confidence interval for a mean
define a range of values within which the
population mean μ is likely to lie
that is, a range of values that is likely to cover
and σ, respectively
sample size is already taken into account in the
2006
t distribution
t distribution with (n–1) degrees of freedom
introduced by WS Gossett, who used the pen-
2006
Non-parametric tests Use Parametric tests
Wilcoxon signed rank Test of Paired t test
test difference
between paired observations
Wilcoxon rank sum test Comparison of 2
Mann-Whitney U test groups 2-sample t test
Kruskal-Wallis one-way Comparison of
analysis of variance several groups One-way analysis of
Spearman rank Measure of variance
correlation association between 2 Pearson correlation
variables
Χ2 goodness of fit test Comparison of an
observed
frequency
distribution with a theoretical one
Tests statistics
a statistic derived from sample data, used to
measure the difference between the observed
data and what would be expected under the null
hypothesis:
z-statistics
t-statistics
freedom
Χ2-statistics
distributed
commonly the population σ is unknown,
becomes,
SE(x) = (s/√n)
under these circumstances the z-test can again
distribution
Small samples
if n < 30
the sample standard deviation, s, is not an
freedom
as for the normal distribution, these are
symmetrical with a mean μ = 0
Paired samples t-test
attributes and demographic data, disease
condition are matched to make two groups of
subjects as similar as possible
the two groups
can be two groups of subjects in a matched
case-control study
can be of the same subjects observed before
analysis of variance
2006
Analysis of variance
the t-test is generalised to more than 2 groups by
means of a technique termed analysis of variance
for this method, there are both between- and
within-groups degrees of freedom
the between-groups and within-groups degrees
for analysis
one-way analysis of variance
two-way analysis of variance
One-way analysis of variance
used to compare the means of several groups
one-way analysis of variance is used when the
subgroups to be compared are defined by just one
factor
e.g. comparison of means between different
non-parametric tests
2006
Non-parametric tests
non-parametric statistical tests are for analysing
numerical data that make no assumption about the
underlying normality of distribution
particularly useful when there is obvious non-
normality in a small data set which cannot be
corrected with a suitable transformation
Wilcoxon signed rank test
a non-parametric statistical test equivalent of the
paired t test, analysing differences between
paired observations
it makes no assumptions about the shapes of the
distribution of the two variables
the absolute values of the differences between
the two variables are calculated as (+/-) for each
case and ranked from smallest to largest
the test statistic is based on the sums of ranks
for negative and positive differences
Wilcoxon signed (+/-) rank test
procedure
the absolute values of the differences between the
sample size
compare this sum with the critical ranges in the
vomiting score 5-
Postoperative
nausea and
4-
3-
2-
1- A B C
Groups
Binomial distribution
2006
Binomial distribution
data (discrete variables) which can take only 0 or
1 response, such as treatment failure or
treatment success, follow the binomial
distribution providing the underlying population
response rate does not change
proportion (p) of respondents (to treatment) in
sample
p = R/n , R is the number of respondents and n
expressed by SD(R)
Binomial distribution
p can be used to estimate the population response
rate, π
number of successful response in the population
can be estimated to be nπ
standard error of p is SE(p) = √(pq/n) where q =
1 - p, is the proportion of unsuccessful responders
95% CI for π is p + 1.96 × SE(p)
The Chi-squared test for
contingency tables
2006
Contingency table
expected frequencies yes or no outcome
condition or
intervention A B
C (xC × yA)/n (xC × yB)/n xC
D (xD × yA)/n (xD × yB)/n xD
yA yB Total=n
Contingency table
can be used to represent
qualitative variables
total N
the Χ2 test is calculated from
, d.f.=21 for(O-E)
a 2 ×2 2 table
Χ =Σ E
the Χ2 test is valid (Cochran 1954) when n is > 40,
regardless of the expected values, and if less than
20% of the expected values are less than 5 and
none are less than 1
when n is between 20 and 40, the test is only valid
if all the expected values are at least 5
Yates’ correction for continuity
Factor A
Present Absent Total
Factor B Present a c m
Absent b d n
Total r s N
Yates’ correction for continuity
for converting the discrete data, as Χ2
N(|ad-bc|-½N)2
Χc =
2
mnrs
Yates’ correction for continuity
the use of the continuity correction is always
advisable although it has most effect when the
expected numbers are small
when the numbers are very small, the Χ2 test is not
a good enough approximation even with a
continuity correction and the Fisher’s exact test
for a 2× 2 table should be used
Fisher’s exact test for a 2 × 2 table
Factor A
Present Absent Total
Factor B Present a c m
Absent b d n
Total r s N
Fisher’s exact test to determine P value
to be used if any expected counts in a 2 × 2
is m!n!r!s!
N!a!b!c!d!
Larger tables
chi-squared test can also be applied to larger
tables, generally called r× c tables
r denotes rows and c denotes columns
(O-E) 2
, d.f.= (r-1)× (c-1)
Χ2 = Σ E
there is no continuity correction or exact test for
contingency tables larger than 2× 2
chi-squared test should not be applied to tables
showing only proportions or percentages
Goodness of fit
apart from using contingency tables, the Χ2-
statistic can be used to see if any observed set of
data follows a particular distribution
e.g. by calculating the expected frequencies
control studies
Correlation and linear
regression
2006
Correlation and linear regression
techniques for dealing with the relationship between 2
or more continuous variables
correlation
examines linear association between 2 variables
variable
strength of the association summarised by the
correlation coefficient
regression
examines dependence of one variable, the dependent
original observations
Spearman rank correlation coefficient
when it is calculated from the ranks of the data
Correlation coefficient, r
dimensionless quantity ranging from -1 to +1
a positive correlation is one in which both
y r=0 y r = -0.4
• •• • •
• • ••• • • •• ••
• • • •
• • •
•
x x
no correlation weak negative
Correlation coefficient, r
should not be used
if the relationship is non-linear
determined in advance
should be used with caution
in the presence of outliers
distribution
Multiple regression
95% confidence intervals
b ± t SE(b)
0.05
if the interval includes zero, the conclusion
should be that that relationship between y and
x remains the same whether or not x changes
Degrees of freedom
2006
Degrees of freedom
the number of degrees of freedom depends on 2
factors
the number of groups we wish to compare
minus 1
for unpaired data, df = n + n minus 2, that is
1 2
n –1 for each group
Degrees of freedom
for linear regression
given n independent pairs of observations, 2
parameters
Diagnostic tests and
implications
Predictions
2006
Sensitivity
what is the probability that the test result will be
positive when a disease is present? that is when
the test result is positive, how accurate is it in
relation to overall presence of the disease?
presence of acute myocardial infarction and
laryngoscopy
Prediction model
Occurrence of event/disease
Prediction by test present (D+) absent (D-) Total
positive test (T+) a b a+b
negative test (T-) c d c+d
Total e f g
as fraction or percentage:
prevalence of disease = e/g or P(D+)
sensitivity of a test = a/e or P(T+ given D+)
specificity of a test = d/f or P(T- given D-)
false negative rate = 1- sensitivity = c/e (Type II
error)
false positive rate = 1- specificity = b/f (Type I error)
Sensitivity and specificity
both are useful statistics because they will yield
consistent results for the diagnostic test in a
variety of patient groups with different disease
prevalences
important point
sensitivity and specificity are characteristics of
2006
Measure of Abbrevn Description No effect Total
effect success
Absolute risk ARR Absolute change in risk: the risk of an ARR=0% ARR=initial
reduction event in the control group minus the risk risk
of an event in the treated group; usually
expressed as a percentage
Relative risk RRR Proportion of the risk removed by RRR=0% RRR=100%
reduction treatment: the absolute risk reduction
divided by the initial risk in the control
group; usually expressed as a percentage
Relative risk RR The risk of an event in the treated group RR=1 or RR=0
divided by the risk of an event in the RR=100%
control group; usually expressed as a
decimal proportion, sometimes as a
percentage
Odds ratio OR Odds of an event in the treated group OR=1 OR=0
divided by the odds of an event in the
control group; usually expressed as a
decimal proportion
Number NNT Number of patients who need to be NNT=∝ NNT= 1/
needed to treat treated to prevent one event; this is the initial
reciprocal of the absolute risk reduction risk
(when expressed as a decimal fraction);
it is usually rounded to a whole number
Odds
given that a subject has or does not have a
disease,
odds always measures the incidence of event
= ad/bc
Odds Ratio 95% CI Interpretation
1 includes 1 No association
>1 does not Positive association between
include 1 exposure and outcome at the 5%
significance level (the odds of
exposure is greater in cases
Than in controls)
Negative association between
<1 does not
exposure and outcome at the 5%
include 1 significance level (the odds of
exposure is smaller in cases than
controls)
Association of exposure and
includes 1 outcome is not proven by the
study at the 5% significance
level
Risk
+ve cases –ve cases total
exposed a b a+b
not exposed c d c+d
total e f g
risk of an event is the probability that an event
will occur within a stated period of time
the risk of developing the disease within the
follow-up time is
a/(a+b) for the exposed population
http://ptwww.cchs.usyd.edu.au/Pedro/CIcalculato
r.xls.
Confidence interval around relative
risk reduction
n=1000/gp
RRR 0f 25%
n=100/gp
-38 9 41 59
-50 -25 0 25 50
2006
Number needed to treat …
the number of patients who must
receive an intervention of therapy
during a specific period of time
to prevent
one adverse outcome or
produce one positive outcome
Number needed to treat …
Risk of perioperative AMI NNT
without β-blocker with β-blocker* (1/ARR)
(ARR)
40 year old man 2% 1.8%
1/0.002
(0.2% or 0.002) = 500
70 year old man 40% 36% 1/0.04
(4% or 0.04) = 25
*assuming 10% relative risk reduction (RRR) with
preoperative administration of β-blocker
ARR = absolute risk reduction
NNT = number needed to treat
Number needed to treat …
if there is a higher probability that a patient will
experience an adverse outcome if we do not treat,
the more likely the patient will benefit from
treatment and
the fewer such patients we need to treat to