An Interpretable Data-Driven Medical Knowledge Discovery Pipeline Based on Artificial Intelligence.

Journal: IEEE journal of biomedical and health informatics
Published Date:

Abstract

Difficulty in knowledge validation is a significant hindrance to knowledge discovery via data mining, especially automatic validation without artificial participation. In the field of medical research, medical knowledge discovery from electronic medical records is a common medical data mining method, but it is difficult to validate the discovered medical knowledge without the participation of medical experts. In this article, we propose a data-driven medical knowledge discovery closed-loop pipeline based on interpretable machine learning and deep learning; the components of the pipeline include Data Generator, Medical Knowledge Mining, Medical Knowledge Evaluation, and Medical Knowledge Application. In addition to completing the discovery of medical knowledge, the pipeline can also automatically validate the knowledge. We apply our pipeline's discovered medical knowledge to a traditional prognostic predictive model of heart failure in a real-world study, demonstrating that the incorporation of medical knowledge can effectively improve the performance of the traditional model. We also construct a scale model based on the discovered medical knowledge and demonstrate that it achieves good performance. To guarantee its medical effectiveness, every process of our pipeline involves the participation of medical experts.

Authors

  • Shaobo Wang
    College of Life Science and Bioengineering, Beijing University of Technology, Beijing, China.
  • Xinhui Du
  • Guangliang Liu
  • Hang Xing
  • Zengtao Jiao
    AI Lab, Yidu Cloud, No.35 of Huayuan North Road, Haidian District, Beijing, 100191, China.
  • Jun Yan
    Department of Statistics, University of Connecticut, Storrs, CT 06269, USA.
  • Youjun Liu
    College of Life Science and Bio-Engineering, Beijing University of Technology, No. 100 Pingleyuan, Chaoyang District, Beijing 100124, China. Electronic address: lyjlma@bjut.edu.cn.
  • Haichen Lv
    School of Science and Information Science, Qingdao Agricultural University, Qingdao, 266109, China.
  • Yunlong Xia
    Department of Cardiology, First Affiliated Hospital of Dalian Medical University, 116011 Dalian, Liaoning, China.