Professional Documents
Culture Documents
Machine Learning For Healthcare Data
Machine Learning For Healthcare Data
Healthcare Data
Katherine A. Heller
Duke University
Outline
Electronic Health Records
Gaussian Process-based Models for:
Chronic Kidney Disease
Sepsis
Mobile apps
Graph-coupled HMMs for Predicting the Spread of Influenza
MS Mosaic
Chronic Kidney Disease
eGFR
(eGFR)
ED Visit
Hospital Admission
Acute MI
Death
Age
Age 47
eGFR
Untreated diabetes &
1st Nephrology Visit
high blood pressure.
Nephrology Visit
Death
eGFR
Nephrology Visit
PCP Visit
ED Visit
Hospital Admission
Age 49
Acute MI
Kidney function now
50% Death
eGFR
Nephrology Visit
PCP Visit
ED Visit
Hospital Admission
Acute MI
Death
Age 51
Referred to
kidney specialist
eGFR
Dialysis Begins
Nephrology Visit
PCP Visit
ED Visit
Hospital Admission
Acute MI
Death
Dialysis Begins
Nephrology Visit
PCP Visit
ED Visit
Hospital Admission
Acute MI
Death
eGFR
Nephrology Visit
failure Acute MI
Death
42%
starting dialysis have
no prior nephrology care
1
1
<10% with moderate CKD
<50% with severe CKD
even aware of illness!
12
Model for a single
trajectory
Population effect
Latent subpopulation
curve
Individual long-term
deviations
Individual transient
Curves
per
deviations (GP)
subtype
1
4
Experimental Setup
6 variables of interest: eGFR, 5 other labs relevant
to CKD
Cohort of 44,000 patients at Duke with at least
moderate stage CKD (Stage 3+) and 5+
measurements for eGFR
For each test patient: use data before t to predict
future labs
Evaluation for each lab:
average MAE across test patients, in future time windows
Baseline: [Schulam & Saria, 2015] trained independently
Quantitative Results
Chronic Kidney Disease (CKD)
1
Proposed Joint Model
Goal: Jointly model risk of future loss of kidney
function and cardiac events.
piecewise constant
baseline rate
coefficient baseline association between random effect
vector covariates event risk and (frailty term):
expected mean/slope
of eGFR
Data
23,450 patients with moderate stage CKD and 10+ eGFR
readings
CKD definition: 2 eGFR readings < 60mL/min, separated by 90+ days
Early Warning
Scoresin
widespread use
Duke uses
NEWS
Overly
simplistic
8 medication classes
Speeding up:
Stochastic
Gradient
Nose-Hoover
Thermostats
Sampling
Pre-operative
Framework Preoperative
Surgery Assessment: Preoperati
Scheduled Phone/PAT/POET/ ve Care
POSH
Standard
Standard
Data Phone
Phone Care
Care
Machine
Machine LOW Screen
pulls Screen Machine
Machine
Learning
Learning
every 24 Learning
Learning
RISK
RISK INTERMEDI
hours
EPIC PAT*
PAT* RISK
RISK
PREDICTIO
PREDICTIO ATE
Clarit PREDICTION
PREDICTION
NN
y POET
POET (MODEL 2)
(MODEL 2)
(MODEL 1)
(MODEL 1) HIGH **
** Optimizatio
Optimizatio
nn
T: Pre-operative Anesthesia Testing POSH
POSH Intervention
Intervention
ET: Peri-Operative Enhancement Team ***
*** ss
SH: Perioperative Optimization of Senior Health
Infectious Disease
Infection in a Social
Network
Goal: To model dynamical interactions between
agents in a social network and apply to inferring
the spread of infection.
Chaotic symptoms
Embedded Consent
Five Activities
MS Mosaic App
Weekly tasks no
more than 5
minutes
Initial Analyses
Develop a sparse logistic regression model for
predicting the likelihood of each symptom
experience
Incorporate a hierarchical layer based on Gaussian
processes for modeling time series data (e.g. sleep)
Discover hidden subpopulations within symptoms
(using clustering methodology, such as Dirichlet
Process mixture models)
Evaluate the efficacy of symptom interventions
using longitudinal models and clinical trials
Planned Dataset Evolution
MRI Sub-study
(Duke Pilot)
Symptom sub-
study “Omics” Sub-study
v1.0 - 4.0 (Duke Pilot)
Road-mapped
Sepsis
CKD
Thanks!
Surgery Influenza