Integrated Machine Learning Approaches for Predicting Ischemic Stroke and Thromboembolism in Atrial Fibrillation.

Journal: AMIA ... Annual Symposium proceedings. AMIA Symposium
PMID:

Abstract

Atrial fibrillation (AF) is a common cardiac rhythm disorder, which increases the risk of ischemic stroke and other thromboembolism (TE). Accurate prediction of TE is highly valuable for early intervention to AF patients. However, the prediction performance of previous TE risk models for AF is not satisfactory. In this study, we used integrated machine learning and data mining approaches to build 2-year TE prediction models for AF from Chinese Atrial Fibrillation Registry data. We first performed data cleansing and imputation on the raw data to generate available dataset. Then a series of feature construction and selection methods were used to identify predictive risk factors, based on which supervised learning methods were applied to build the prediction models. The experimental results show that our approach can achieve higher prediction performance (AUC: 0.71~0.74) than previous TE prediction models for AF (AUC: 0.66~0.69), and identify new potential risk factors as well.

Authors

  • Xiang Li
    Department of Radiology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, United States.
  • Haifeng Liu
    IBM Research China, Beijing, China.
  • Xin Du
    Beijing Hospital of TCM, Capital Medical University, Beijing 100010, China.
  • Ping Zhang
    Department of Computer Science and Engineering, The Ohio State University, USA.
  • Gang Hu
    Ping An Health Technology, Beijing, China.
  • Guotong Xie
    Ping An Health Technology, Beijing, China.
  • Shijing Guo
    IBM Research - China, Beijing, China.
  • Meilin Xu
    Pfizer Investment Co. Ltd., Beijing, China.
  • Xiaoping Xie
    Pfizer Investment Co. Ltd., Beijing, China.