Professional Documents
Culture Documents
com
Survival Analysis
A Brief Introduction
More Statistics tutorial at www.dumblittledoctor.com
2
More Statistics tutorial at www.dumblittledoctor.com
⎝ 0 ⎠
◦ t: a specific point in time. 3
More Statistics tutorial at www.dumblittledoctor.com
Hazard Function/Rate
y Hazard Function λ(t): instantaneous
failure rate at time t given that the
subject has survived upto time t.
P(t ≤ T < t + δ | T ≥ t ) P(t ≤ T < t + δ )
λ ( t ) = limδ →0+ =
δ P(T ≥ t ) × δ
S (t ) − S (t + δ ) 1 f (t )
= limδ →0+ × =
δ S (t ) S ( t )
d
f (t ) = F (t )
dt
y Here f(t) is the probability density
function of the survival time T. That
is, F ( t ) = 1 − S ( t ) = P (T ≤ t )
y where F(t) is the cumulative 4
di ib i i
More Statistics tutorial at www.dumblittledoctor.com
Key Assumption:
Independent Censoring
y Those still at risk at time t in
the study are a random sample of
the population at risk at time t,
for all t
y This assumption
p means that the
hazard function, λ(t), can be
estimated in a fair/unbiased/valid
/ /
way
6
More Statistics tutorial at www.dumblittledoctor.com
3A. Kaplan-Meier (Product-
Limit) Estimator of the
Survival Curve
y The Kaplan–Meier estimator is the
nonparametric maximum likelihood
estimate of S(t). It is a product of
th fformr − d
the r −d r −d
Sˆ (t ) = 1 1
× 2 2
× ... × i i
r1 r2 ri
rk
y is
t k the number of subjects alive
djust
k
before time tk
y d
denotes
t the
th number
b whoh died
di d att time
ti
7
More Statistics tutorial at www.dumblittledoctor.com
0 5 10 15 20
Survival Time
9
More Statistics tutorial at www.dumblittledoctor.com
0 75
0.75
High
Survival
Distribution 0.50
Function
0.25
Low
Medium
0.00
0 10 20 30 40 50 60
Time
11
More Statistics tutorial at www.dumblittledoctor.com
Log--Rank Test
Log
12
More Statistics tutorial at www.dumblittledoctor.com
Wilcoxon Test
13
More Statistics tutorial at www.dumblittledoctor.com
14
More Statistics tutorial at www.dumblittledoctor.com
4. Two Parametric
Distributions
y Here we p present two most notable
models for the distribution of T.
y Exponential distribution:λ (t ) = λ
y Weibull distribution:
λ (t ) = λp(λt ) = pλ × t
p −1 p p −1
16
More Statistics tutorial at www.dumblittledoctor.com
5 Regression Models
5.
y The Exponential
p and the Weibull
distribution inspired two
pparametric regression
g
approaches:
1. Parametric proportional hazard
model – this model can be
generalized to a semi-parametric
semi parametric
model: the Cox proportional
hazard model
2. Accelerated failure time model
17
More Statistics tutorial at www.dumblittledoctor.com
An Example
y In a recent p
paper
p (Reisberg
g et
al., 2010), we applied both
g
regression models to a dementia
study conducted at NYU:
Λ(T) = Λ0 (T)exp(α1 *Group +α2 * Age +α3 *Gender +α4 * Education +α5 * FollowUp)
22
More Statistics tutorial at www.dumblittledoctor.com
23
More Statistics tutorial at www.dumblittledoctor.com
24
More Statistics tutorial at www.dumblittledoctor.com
Parametric versus
Nonparametric Models
25
More Statistics tutorial at www.dumblittledoctor.com
Parametric versus
Nonparametric Models
Properties
p of nonparametric
p models are
• the distribution of survival time is unknown
• the hazard function is unspecified.
unspecified
An example is the Cox proportional hazards
model.
26
More Statistics tutorial at www.dumblittledoctor.com
...
Cox Proportional Hazards
Model
{ β1 X i 1 +...+ β k X ik }
hi (t ) = h0 (t )e
β = 0 → hazard ratio = 1
Two ggroups
p have the same
survival experience
27
More Statistics tutorial at www.dumblittledoctor.com
28
Partial Likelihood
29
More Statistics tutorial at www.dumblittledoctor.com
Partial Likelihood
30
More Statistics tutorial at www.dumblittledoctor.com
Partial Likelihood
hc (2)
Lc =
hc (2) + hb (2) + ha (2) + hd (2) + he (2)
hb (3)
Lb =
hb (3) + ha (3) + hd (3) + he (3)
hd (5)
Ld =
hd (5) + he (5)
31
More Statistics tutorial at www.dumblittledoctor.com
Partial Likelihood
hd (5)
Ld =
hd (5) + he (5)
β1 X d 1 +β 2 X d 2 + .... + β k X dk
ho (5)e
Ld = β1 X d 1 +β 2 X d 2 + .... + β k X dk β1 X e1 +β 2 X e 2 + .... + β k X ek
ho (5)e + ho (5)e
β1 X d 1 +β 2 X d 2 + .... + β k X dk
e
Ld =
e β1 X d 1 +β 2 X d 2 + .... + β k X dk + e β1 X e1 +β 2 X e 2 + .... + β k X ek
32
More Statistics tutorial at www.dumblittledoctor.com
Partial Likelihood
y The overall likelihood is the
product of the individual
likelihood. That is:
L = Lc * Lb * Ld
33
More Statistics tutorial at www.dumblittledoctor.com
Proc LIFETEST
y The Kaplan
Kaplan-
p -Meier(K-M) survival
curves and related tests (Log-
(Log-
Rank, Wilcoxon) can be generated
g
using SAS PROC LIFETEST
Proc PHREG
y The Cox (proportional
p p hazards)
regression is performed using SAS
PROC PHREG
36
More Statistics tutorial at www.dumblittledoctor.com
Proc LIFEREG
y The accelerated failure time
regression is performed using SAS
PROC LIFEREG
Selected References
y PD Allison (1995). Survival
Analysis Using SAS: A Practical
g
Guide. SAS Publishing.
y JD Kalbfleisch and RL Prentice
(2002).The Statistical Analysis
of Failure Time Data. Wiley-
Interscience.
38