Professional Documents
Culture Documents
SURVIVAL ANALYSIS
NEILY ZAKIYAH, PhD., Apt
SURVIVAL ANALYSIS
u To analyze data where the outcome variable is the time until the occurrence
of an event of interest (e.g. death, occurrence of a disease).
u TIME-TO-EVENT DATA
• Time from diagnosis of cancer to death due to cancer
• Time from HIV infection to AIDS
• Time from an asthma exacerbation until the next asthma exacerbation
u The time to event/survival time can be measured in days, weeks, years, etc.
SURVIVAL ANALYSIS
u TIME-TO-EVENT DATA
Event mortality
re-admission
SURVIVAL ANALYSIS
Objectives
u Estimate time-to-event for a group of individuals, e.g. time until
second heart-attack for a group of myocardial infarction patients.
u To compare time-to-event between two or more groups, e.g.
treated vs. placebo myocardial infarction patients in a randomized
controlled trial.
u To assess the relationship of co-variables to time-to-event, e.g.
does weight, insulin resistance, or cholesterol influence survival
time of myocardial infarction patients?
WHY USE SURVIVAL ANALYSIS?
u Why not compare mean time-to-event between the groups using linear
regression?
ignores censoring
u Why not compare proportion of events in the groups using risk/ adds
ratios or logistic regression?
ignores time
SURVIVAL ANALYSIS:
Terms
u Time-to-event: the time from entry into a study until a subject has a
particular outcome.
u Censoring: the time until the last known information for that
particular person.
For example
1. A person does not experience the event at last follow-up survey
2. A person is lost to follow-up/withdrawn during the study period
Two-variable outcome (dependent):
u Time variable: ti = time at last disease-free observation or time at
event
u Censoring variable: ci =1 if had the event; ci =0 no event by time ti
SURVIVAL ANALYSIS
Illustration
SURVIVAL AND HAZARD FUNCTION
u The survival and hazard functions are key concepts in survival analysis
for describing the distribution of event times.
u Survival function: the probability of surviving (or not experiencing the
event) up to the given time
Notation:
– T ≡ survival time of a randomly selected individual
– t ≡ a specific point in time.
– S(t) = P(T > t) ≡ Survival Function
– λ(t) ≡ instantaneous failure rate at time t : hazard function
u Hazard function: the potential that the event will occur, per time
unit, given that an individual has survived up to the specified time.
SURVIVAL FUNCTION (KAPLAN MEIER)
u The Kaplan – Meier (KM) estimator is the most widely used for estimating survival function
• product-limit estimator
• nonparametric maximum likelihood estimator.
• When there are no censored data, the KM estimator is simple estimator
u In the case of human longevity, Ti is unlikely to follow a normal distribution, because the
probability of death is not highest in the middle ages, but at the beginning and end of life.
u The survival function estimator is defined as
u S(t) = P r(T > t) = 1 − F(t).
u The hazard function estimator:
= 1 x 49/50 = 1 – 0.9800
= 0.9800 x 48/49 = 1 – 0.9860
= 0.9600 x 47/48 = 1 – 0.9460
Overall cumulative survival in RRT patients :
COMPARING SURVIVAL BETWEEN
GROUPS
(Log-Rank test)
u Non-parametric method, based on ranks
u No effect estimate, only test for differences
u No adjustment for confounders or interaction/ effect modification
u In application, it is a method for comparing the Kaplan-Meier curves
estimated for each group of subjects.
Cumulative survival in RRT patients (overall and by cause of ESRD)
h1 (t ) h0 (t )e bx1 b ( x1 - x2 )
HR = = = e
h2 (t ) h0 (t )e bx2