You are on page 1of 2

EDITORIALS

Machine Learning and Electronic Health Records:


A Paradigm Shift
Daniel E. Adkins, Ph.D.

In this issue of the Journal, Barack-Corren et al. (1) use for improved prediction accuracy in Barack-Corren et al.’s
machine learning methods to build a highly predictive model approach, such gains would likely come at the expense of
of suicidal behavior using longitudinal electronic health interpretability and inference. Thus, their selection of the
records (EHRs). They do so using a well-established probability- naive Bayesian classifier has the further, unintended merit
based machine learning algorithm, the naive Bayesian of providing an unusually lucid, accessible introduction to
classifier, to mine through approximately 1.7 million patient machine learning for many researchers and clinicians.
records, spanning 15 years (1998–2012), from two large Boston Another limitation, perhaps strategic on Barack-Corren
hospitals. After training the naive Bayesian classifier model on a et al.’s part, is the use of a limited set of standard ICD-9
randomly selected half of the data, the predictive ability of the codes and search terms as predictors, versus performing
model was assessed on the second half, yielding accurate natural language processing of the full semistructured
(35%249% sensitivity at 90%295% specificity) and, critically, data of the EHR. This analytical decision is a significant
early (3–4 years in advance on average) prediction of patients’ limitation, as it drastically reduces the analysis feature space
future suicidal behavior. In this, the authors benefitted from (i.e., the number of predictors considered), which gener-
access to a large and high-quality EHR database and chose an ally results in poorer prediction given data of this size (5, 6).
appropriate, and powerful, analytical method in the naive While the authors do not
Bayesian classifier. Furthermore, the research has clear clinical give a precise number of The emergence of machine
applications in the potential for early detection warnings via predictors used in their learning EHR prediction may
physician EHR notices. Beyond such specifics, the study has analysis, we can safely be seen as analogous to the
broader significance in its demonstration of how the atheoretical assume it is at least an
development of imaging,
machine learning approaches popular in Silicon Valley can suc- order of magnitude less
genetic, or any other new
cessfully mine clinical insights from an exponentially growing than what would be possi-
body of EHR data. It also hints toward a future in which machine ble using natural language source of highly informative
learning of big medical data may become a ubiquitous component processing techniques. medical data.
of clinical research and practice—a prospect that some are un- However, this again raises
comfortable with. the issue of model interpretability, as natural language pro-
While the pace at which machine learning applications cessing approaches may identify highly predictive features
diffuse into clinical research and practice remains to be that offer no clear interpretation or clinical significance (6).
seen, methodological development in the machine learning Contrast that opacity with Barack-Corren et al.’s list of the top
field continues to accelerate. And this suggests one pri- 100 predictors in their naive Bayesian classifier (see Table S2
mary limitation of the current study. That is, while the naive in the article’s online data supplement), which summarizes
Bayesian classifier is well-suited to the current application, it a wealth of clinical insight, and we again see the precision
is an older and remarkably simple method by machine learn- advantages of more sophisticated approaches counterbalanced
ing standards. Fundamentally, the naive Bayesian classifier is a by the interpretability of simpler models like Barack-Corren
direct application of Bayes’ theorem, simply calculating the et al.’s naive Bayesian classifier. This tradeoff is not specific to
product of the prior probability of the outcome of interest the current topic, instead it is a pervasive aspect of machine
(e.g., suicidal behavior) and the probabilities for each predic- learning—a continuum of inference versus prediction that
tor in the data conditional on the outcome of interest (2). This is traversed when moving from simpler approaches, like
analytical simplicity contrasts sharply with more advanced Barack-Corren et al.’s naive Bayesian classifier, to more
machine learning techniques, including neural nets, deep advanced, opaque approaches, including neural nets and
learning, and ensemble methods, which achieve notable deep learning (7, 8).
increases in prediction compared with naive Bayesian clas- Stepping back from the technical aspects of machine
sifier, but are black boxes in terms of estimation, as their learning, this study provides an opportunity to reflect on
models are extremely large, complex, and characterized the trend of the field toward increasingly data-driven ap-
by “hidden layers” (3, 4). So, while there is ample room proaches. Regardless of the promise of machine learning of

Am J Psychiatry 174:2, February 2017 ajp.psychiatryonline.org 93


EDITORIALS

EHR, it would be unwise to endorse the approach without National Institutes of Health and Veterans Affairs regarding
first considering the various professional, ethical, and legal safeguarding, and maximizing benefit from, comparable ar-
issues accompanying the potential improvements in diagnosis chives (e.g., dbGaP [database of Genomics and Phenotypes]).
and treatment. From the perspective of praxis, it is noteworthy In summary, as demonstrated by Barack-Corren et al., the
that the approach, carried to its logical conclusion, is funda- application of machine learning methods to EHRs, and the
mentally atheoretical, which marks a stark departure from potential of extending such analyses to other sources of big
conventional clinical paradigms built primarily on evidence- medical data (e.g., genomics and imaging), could generate
based causal models (9). Furthermore, for some it may seem enormous—yes, even paradigm-shifting—returns in improved
like a slippery slope toward ceding power in the clinic to al- diagnosis and treatment. What remains unclear is the pace
gorithms and devaluing clinician experience and judgment. at which these benefits will be realized, as well as who the
But I would note that the majority of a clinician’s function primary beneficiaries will be.
would not, and indeed could not, be encroached upon by data-
driven analytics. Rather, increasing the role of machine AUTHOR AND ARTICLE INFORMATION
learning applications to EHRs would provide additional in- From the Departments of Psychiatry and Sociology, University of Utah,
puts for the clinician to consider in making diagnostic and Salt Lake City, Utah.
treatment decisions. In this way, the emergence of machine Address correspondence to Dr. Adkins (danieleadkins@gmail.com).
learning EHR prediction may be seen as analogous to the Supported by the National Institute of Mental Health (grant K01MH093731)
development of imaging, genetic, or any other new source of and the University of Utah, Consortium for Families and Health
highly informative medical data. Additionally, there are eth- Research.
ical and legal issues surrounding the mining of EHR, including The author reports no financial relationships with commercial interests.
protecting the patient population from adverse consequences The author thanks Jason D. Thomas and Anna R. Docherty for assistance
stemming from the analysis of their data. This suggests po- and critique.
tentially problematic dynamics if, for instance, EHR data and Accepted October 2016.
analytics are accessed by insurance companies, who may use Am J Psychiatry 2017; 174:93–94; doi: 10.1176/appi.ajp.2016.16101169
the data to discriminate against patients in the market-
place. This risk is compounded by the possibility of black REFERENCES
box machine learning methods inadvertently identifying 1. Barak-Corren Y, Castro VM, Javitt S, et al: Predicting Suicidal Be-
havior From Longitudinal Electronic Health Records. Am J Psychiatry
stratifying criteria that we as a society find unacceptable.
2017; 174:154–162
While ethical arguments for the use of participant data 2. Friedman N, Geiger D, Goldszmidt M: Bayesian network classifiers.
often take the form of efforts to limit access to data, as in the Mach Learn 1997; 29:154–162
well-justified attention paid to patient privacy and non- 3. LeCun Y, Bengio Y, Hinton G: Deep learning. Nature 2015; 521:
disclosure, a powerful argument for the opposite exists in 436–444
4. Schmidhuber J: Deep learning in neural networks: an overview.
regard to enhancing public benefit through the analysis of
Neural Netw 2015; 61:85–117
EHR data. That is, as the data are often collected using some 5. Jurafsky D, Martin JH: Speech and Language Processing, 2nd ed.
combination of patient permission and government funding, Upper Saddle River, NJ, Prentice Hall, 2008
it may be reasonable to consider public benefit as a goal, or 6. Lin J, Dyer C: Data-Intensive Text Processing with MapReduce. San
even an obligation, in the collection and analysis of the data. Rafael, Calif, Morgan and Claypool Publishers, 2010
7. Breiman L: Statistical modeling: the two cultures. Stat Sci 2001; 16:
Although this does not argue against private sector activity,
199–231
it does support a concerted effort to consolidate data and 8. Kelleher JD, Namee BM, D’Arcy A: Fundamentals of Machine Learn-
analyses funded by federal research dollars into a public ing for Predictive Data Analytics. Cambridge, Mass, The MIT Press, 2015
resource—and what a tremendous resource a centralized 9. Greenland S, Pearl J, Robins JM: Causal diagrams for epidemiologic
archive of EHR data staffed with a cadre of machine learning research. Epidemiology 1999; 10:37–48
10. Hall MA: Property, privacy and the pursuit of integrated electronic
analysts could be. Currently, this possibility is prevented by
medical records. Iowa Law Rev 2010; 2010:631–663
data fragmentation, as most EHR data are presently pro- 11. Jensen PB, Jensen LJ, Brunak S: Mining electronic health records:
prietary (10, 11), but this could change with leadership from towards better research applications and clinical care. Nat Rev Genet
federal entities. And we have good precedent from the 2012; 13:395–405

94 ajp.psychiatryonline.org Am J Psychiatry 174:2, February 2017

You might also like