Applying Risk Models on Patients with Unknown Predictor Values: An Incremental Learning Approach.

Journal: Studies in health technology and informatics
Published Date:

Abstract

In clinical practice, many patients may have unknown or missing values for some predictors, causing that the developed risk models cannot be directly applied on these patients. In this paper, we propose an incremental learning approach to apply a developed risk model on new patients with unknown predictor values, which imputes a patient's unknown values based on his/her k-nearest neighbors (k-NN) from the incremental population. We perform a real world case study by developing a risk prediction model of stroke for patients with Type 2 diabetes mellitus from EHR data, and incrementally applying the risk model on a sequence of new patients. The experimental results show that our risk prediction model of stroke has good prediction performance. And the k-nearest neighbors based incremental learning approach for data imputation can gradually increase the prediction performance when the model is applied on new patients.

Authors

  • Enliang Xu
    IBM Research - China, Beijing, China.
  • Xiang Li
    Department of Radiology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, United States.
  • Jing Mei
    Ping An Technology, Shenzhen, China.
  • Shiwan Zhao
    IBM Research China, Beijing, China.
  • Gang Hu
    Ping An Health Technology, Beijing, China.
  • Eryu Xia
    IBM Research - China, Beijing, China.
  • Haifeng Liu
    IBM Research China, Beijing, China.
  • Guotong Xie
    Ping An Health Technology, Beijing, China.
  • Meilin Xu
    Pfizer Investment Co. Ltd., Beijing, China.
  • Xuejun Li