Two-Stage Approaches to Accounting for Patient Heterogeneity in Machine Learning Risk Prediction Models in Oncology.

Journal: JCO clinical cancer informatics
Published Date:

Abstract

PURPOSE: Machine learning models developed from electronic health records data have been increasingly used to predict risk of mortality for general oncology patients. But these models may have suboptimal performance because of patient heterogeneity. The objective of this work is to develop a new modeling approach to predicting short-term mortality that accounts for heterogeneity across multiple subgroups in the presence of a large number of electronic health record predictors.

Authors

  • Eun Jeong Oh
    Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA.
  • Ravi B Parikh
    Division of Hematology and Oncology, Perelman School of Medicine, University of Philadelphia, Philadelphia, Pennsylvania.
  • Corey Chivers
    Penn Medicine, University of Pennsylvania, Philadelphia.
  • Jinbo Chen
    Department of Urology, Xiangya Hospital, Central South University, Changsha 410008, China.