Professional Documents
Culture Documents
Survival Analyisis
Survival Analyisis
•Broadband Internet
Censoring Event
Study began
Study end
Event
C ._______________________________...............
D
._____________________________.......
▪A Is left censored.
▪B Is fully observed.
▪C Is right censored because the observation is lost to study.
This type of right censoring does not cause any problems if
the censoring is random
▪D is right censored because the observation period ends before
the event has occurred. This type of censoring does not cause any
problems for the analysis.
• This represents the instantaneous death rate, that is, the probability that an
individual experiences the event of interest at a time point given that the
event has not yet occurred.
✓To carry out calculation, we first set out for each year (X) :
➢ Number alive at start = nx
➢ Number withdrawn during year= wx Formulas for LT
➢ Number dying = dx 1. rx=nx-½wx,
➢ Number at risk = rx
2. qx=dx/rx,
3. px=1-qx,
➢ Prob. of death = qx
4. Px=pxPx-1
➢ Prob. of surviving =( px )
➢ Cumulative prob. of surviving = ( Px )
Year Number Withdrawn At risk Deaths Prob. of death Prob. of Cumulative prob.
at start during year ( rx ) ( dx ) ( qx ) surviving year X of surviving x
(x) ( nx ) ( wx ) ( px ) years ( Px )
1 20 2 1
2 17 2 0
3 15 0 1
4 14 0 0
5 14 1 0
6 13 1 0
7 12 1 2
8 9 0 1
9 8 1 0
10 7 0 2
11 5 2 0
12 3 0 1
13 2 0 0
14 2 0 0
15 2 0 1
16 1 0 0
17 1 0 0
18 1 1 0
Year Number Withdrawn At risk Deaths Prob. of death Prob. of Cumulative prob.
at start during year ( rx ) ( dx ) ( qx ) surviving year X of surviving x
(x) ( nx ) ( wx ) ( px ) years ( Px )
•The median survival time (call it τ ) is just the time where 50% of
the observations have experienced the event
•That means median survival time is the time where S(τ ) = 0.5
•In this case, the estimated median survival is the smallest time τ such
that: S(τ ) < 0.5
Year Number Withdrawn At risk Deaths Prob. of death Prob. of Cumulative prob.
at start during year ( rx ) ( dx ) ( qx ) surviving year X of surviving x
(x) ( nx ) ( wx ) ( px ) years ( Px )
• Death of infants within their first year is an important problem with these
data.
• We will therefore analyze the time from birth to death (in days).
• For infants still alive when these data were collected, time is the time from
birth to the time of data collection.
• The variable event is an indicator for whether time refers to death (1) or end of
study (0).
9/8/2022 BY BIRUK.S (BSC., MPH) 29
Data set- infant data (cont’d)
• Possible explanatory variables for time-to-death could be place of residence and
sex of infants, among others.
• These data can be described as survival data.
• Duration or survival data can generally not be analyzed by conventional
methods such as linear regression.
• The main reason for this is that some durations are usually right-censored.
• That is, the endpoint of interest has not occurred during the period of
observation.
• Another reason is that survival times tend to have positively skewed
distributions.
•KM estimator helps us to find S(t) when there are censored data
S(t) = p1p2p3…pt
✓Mainly descriptive
• The log rank test is the most well known and widely used.
Interpretation;
• Underweight infants are nearly 2.79 times more likely to die at any given time as
compared to that of normal.
• The hazard of death is increased by 2.79 times among underweight infants as compared
to that of normal.
Interpretation;
1. Keeping other variable constant, the hazard of death is
decreased by 14.2% among female infants as compared to
that of male infants.
2. Per one person increase in family size, the hazard of death is
decreased by 11.2% while holding other variable constant.
Methods
1. Hazard Plot (Log–log plot of survival)
2. Correlation Test using (test using Schoenfeld residuals)
0 2 4 6
ln(analysis time)
3. Construct the KM survival curve and Fit cox regression for the
variables martial status length (by creating categories(30 to 40, 41
to 50 and greater than 51) and catgravidity