Interpretation of machine learning predictions for patient outcomes in electronic health records.

Journal: AMIA ... Annual Symposium proceedings. AMIA Symposium

Published Date: Mar 4, 2020

Abstract

Electronic health records are an increasingly important resource for understanding the interactions between patient health, environment, and clinical decisions. In this paper we report an empirical study of predictive modeling of seven patient outcomes using three state-of-the-art machine learning methods. Our primary goal is to validate the models by interpreting the importance of predictors in the final models. Central to interpretation is the use of feature importance scores, which vary depending on the underlying methodology. In order to assess feature importance, we compared univariate statistical tests, information-theoretic measures, permutation testing, and normalized coefficients from multivariate logistic regression models. In general we found poor correlation between methods in their assessment of feature importance, even when their performance is comparable and relatively good. However, permutation tests applied to random forest and gradient boosting models showed the most agreement, and the importance scores matched the clinical interpretation most frequently.

Authors

William La Cava

University of Pennsylvania, Philadelphia, PA, USA.
Christopher Bauer

Biomedical and Translational Informatics Institute/Geisinger, Danville, PA, USA.
Jason H Moore

University of Pennsylvania, Philadelphia, PA, USA.
Sarah A Pendergrass

Biomedical and Translational Informatics Institute/Geisinger, Danville, PA, USA.

Keywords

Electronic Health Records Female Humans Logistic Models Machine Learning Male Models, Statistical Models, Theoretical Patient Outcome Assessment

External Resources

View on PubMed PubMed (32308851)

Interpretation of machine learning predictions for patient outcomes in electronic health records.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals