Professional Documents
Culture Documents
MEDICINE
LECTURE DELIVERED AT
THE 2008 WACP. FACULTY OF FAMILY MEDICINE
PART I REVISION COURSE.
BY
DR F A OLANIYAN
MBBS Ib., MSc. Epid. & Med Stat. Ib., FWACP, FMCGP
UNIVERSITY COLLEGE HOSPITAL IBADAN
INTRODUCTION
True diagnosis.
We can never be absolutely sure that the ‘true’ diagnosis
is correct.
We decide to accept one method as ‘true’: this is the gold
standard or reference standard.
It is often more invasive than the test, e.g. histopathology
compared to ultrasound image.
It is always possible that the reference standard is wrong
for some subjects.
Statistics of diagnostic test studies
-Sensitivity
-Specificity
-False positive rate
-False negative rate
-Likelihood ratio (LR) for positive test
-Likelihood ratio (LR) for negative test
- Positive predictive value (PPV)
-Negative predictive value (NPV)
-Receiver operating characteristic (ROC) curve
Kappa when responses are binary
TYPES OF DIAGNOSTIC TEST
Diagnostic Test Studies
Two designs
Prospective or cohort design, or cross-sectional
design:
take a sample of subject eligible for the test, test them
all and get true diagnosis on them all.
Retrospective or case-control design:
take a sample with true diagnosis established as
positive and another sample of controls. We may
have negative diagnosis established on controls and
we may not.
TWO GAUSSIAN CURVES DESCRIBING
THE DISTRIBUTION OF A TEST RESULT
Statistical Vs Systematic ERRORS
Statistical error: the diff btw measured value
and the true value that is caused by random,
and inherently unpredictable fluctuations in
the measurement apparatus.
Systematic error: the dif btw a measured
value and the true value that is caused by
non-random fluctuations from an unknown
source and which, once identified, can usually
be eliminated.
TYPES OF ERRORS
STATISTICAL ERROR: Type I and Type II
Type I error, also known as an "error of the
Actual condition
Innocent Not innocent
False Positive
Judged (i.e. guilty but
True Positive
'innocent' not caught)
Test Type I error
result False Negative
Judged (i.e. innocent
True Negative
'not innocent' but condemned)
Type II error
MEDICAL CONDITION
Actual condition
Present Absent
Condition absent + Positive result
Condition Present + Positive result
Positive = False Positive
= True Positive
Type I error
Test
result Condition present + Negative
Condition absent + Negative
result
Negative result
= False (invalid) Negative
= True (accurate) Negative
Type II error
DISEASE
Actual condition
Infected Not infected
False Positive
Test shows (i.e. infection reported
True Positive
'infected' but not present)
Test Type I error
result False Negative
Test shows (i.e. infection
'not infected' not detected) True Negative
Type II error
BASIC CONCEPTS
A test is useful (informative) if
A test is useful (informative) if
SENS + SPEC > 1
SENS + SPEC > 1
FALSE POSITIVE
The false positive rate is the proportion of negative instances that were erroneously
reported as being positive.
It is equal to 1 minus the specificity of the test. This is equivalent to saying it is equal to 1
minus the significance level.
In statistical hypothesis testing, this fraction is given the symbol α, and 1 − α is defined
as the specificity of the test.
A
A test
test is
is useful
useful (informative)
(informative)
PPV
PPV ++ NPVNPV >> 11
2 BY 2 TABLE
Example
100 children diagnosed with language
impairments (LI) and enrolled in language
intervention, and 100 same-age children with
no history of language impairment (LN), were
administered a new test of grammatical
morphology.
80 of the children with LI, and 30 of the
children with LN, scored in the disordered
range on the new measure.
Disorder Status (re: Gold Standard)
80 30
+ Disorder (LI)
a b
New Test
Result c d
-Disorder (LN) (20) (70)
LR- =1-sensitivity/specificity
=[1-{a/(a+c)}]/[d/(b+d)]
Calculating Likelihood Ratios
Sens = .80
Spec = .70
LR+ = sens/1-spec = .80/.30 = 2.67
LR- = 1-sens/spec = .20/.770 = 0.29
Several programs, some free on web, are set
up to allow entry in 2x2 table format
In addition to accuracy measures, they also
provide information on precision
RECEIVER OPERATIVE
CHARACTERISTIC CURVE
ROC analysis is part of a field called "Signal
Detection Theory" developed during World
War II for the analysis of radar images.
ROC analysis is now:
Sensitivity 186/190
Specificity 4753/4810
S
en
si
ti
vit
y 9
7.
9%
S
pe
ci
fi
cit
y 9
8.
8%
P
os
i
ti
vep
r
edi
cti
vev
al
ue 7
6.
5%
N
eg
at
ive
pr
edi
cti
vev
al
ue 9
9.
9%
L
i
kel
iho
odr
at
io+
te
st 8
2.
6
L
i
kel
iho
odr
at
io-t
est .
02
A
cc
ur
acy 9
8.
8%
QUESTION
A test to screen for a dx is performed on a population. 900
respondents have the dx. 920 in the population were
tested positive. The sensitivity of the test is 0.90 and
the specificity of the test is 0.80.
1. How many person do not have the dx?
2. What is FPR, FNR, PPV, NPV, LR+ & LR-?
Greenhalgh guidelines
1: Is this test potentially relevant to my practice?
2: Has the test been compared with a true gold standard?
3: Did this validation study include an appropriate spectrum
of subjects?
4: Has workup bias been avoided?(Was the reference standard group
originally identifiedbecause they were positive on the test?)
5: Has expectation bias been avoided?(I.e. was the reference standard blind to
the test?)
6: Was the test shown to be reproducible?
7: What are the features of the test as derived from this validation study?
8: Were confidence intervals given?
9: Has a sensible ‘normal range’ been derived? (Only relevant for continuous
test variables.)
10: Has this test been placed in the context of other potential tests in the
diagnostic sequence?
Other GL is QUADAS (Quality Assessment of Diagnostic
Accuracy Studies) tool
THANK YOU
FOR
LISTENING