Predicting severe clinical events by learning about life-saving actions and outcomes using distant supervision.

Journal: Journal of biomedical informatics
Published Date:

Abstract

Medical error is a leading cause of patient death in the United States. Among the different types of medical errors, harm to patients caused by doctors missing early signs of deterioration is especially challenging to address due to the heterogeneity of patients' physiological patterns. In this study, we implemented risk prediction models using the gradient boosted tree method to derive risk estimates for acute onset diseases in the near future. The prediction model uses physiological variables as input signals and the time of the administration of outcome-related interventions and discharge diagnoses as labels. We examine four categories of acute onset illness: acute heart failure (AHF), acute lung injury (ALI), acute kidney injury (AKI), and acute liver failure (ALF). To develop and test the model, we consider data from two sources: 23,578 admissions to the Intensive Care Unit (ICU) from the MIMIC-3 dataset (Beth-Israel Hospital) and 16,612 ICU admissions on hospitals affiliated with our institution (University of Washington Medical Center and Harborview Medical Center, the UW-CDR dataset). We systematically identify outcome-related interventions for each acute organ failure, then use them, along with discharge diagnoses, to label proxy events to train gradient boosted trees. The trained models achieve the highest F1 score with a value of 0.6018 when predicting the need for life-saving interventions for ALI within the next 24 h in the MIMIC-3 dataset while showing a median F1 score of 0.3850 from all acute organ failures in both datasets. The approach also achieves the highest F1 score of 0.6301 when classifying a patient's ALI status at the time of discharge from the MIMIC-3 dataset, with a median F1 score of 0.4307 in both datasets. This study shows the potential for using the time of outcome-related intervention administrations and discharge diagnoses as labels to train supervised machine learning models that predict the risk of acute onset illnesses.

Authors

  • Dae Hyun Lee
    Biomedical & Health Informatics, School of Medicine, University of Washington, Seattle, WA, USA. Electronic address: dhlee4@uw.edu.
  • Meliha Yetisgen
    Departments of Biomedical and Health Informatics, University of Washington Medical Center, Seattle2Departments of Linguistics, University of Washington Medical Center, Seattle.
  • Lucy Vanderwende
    Biomedical & Health Informatics, School of Medicine, University of Washington, Seattle, WA, USA.
  • Eric Horvitz
    Microsoft.