Pitfalls in Developing Machine Learning Models for Predicting Cardiovascular Diseases: Challenge and Solutions.

Journal: Journal of medical Internet research
Published Date:

Abstract

In recent years, there has been explosive development in artificial intelligence (AI), which has been widely applied in the health care field. As a typical AI technology, machine learning models have emerged with great potential in predicting cardiovascular diseases by leveraging large amounts of medical data for training and optimization, which are expected to play a crucial role in reducing the incidence and mortality rates of cardiovascular diseases. Although the field has become a research hot spot, there are still many pitfalls that researchers need to pay close attention to. These pitfalls may affect the predictive performance, credibility, reliability, and reproducibility of the studied models, ultimately reducing the value of the research and affecting the prospects for clinical application. Therefore, identifying and avoiding these pitfalls is a crucial task before implementing the research. However, there is currently a lack of a comprehensive summary on this topic. This viewpoint aims to analyze the existing problems in terms of data quality, data set characteristics, model design, and statistical methods, as well as clinical implications, and provide possible solutions to these problems, such as gathering objective data, improving training, repeating measurements, increasing sample size, preventing overfitting using statistical methods, using specific AI algorithms to address targeted issues, standardizing outcomes and evaluation criteria, and enhancing fairness and replicability, with the goal of offering reference and assistance to researchers, algorithm developers, policy makers, and clinical practitioners.

Authors

  • Yu-Qing Cai
    China Medical University, Shenyang, 110122, China.
  • Da-Xin Gong
    Smart Hospital Management Department, the First Hospital of China Medical University, Shenyang, 110001, China. gongdx@cmu1h.com.
  • Li-Ying Tang
    China Medical University, Shenyang, 110122, China.
  • Yue Cai
    Department of Anesthesiology, Shanxi Province Cancer Hospital/Shanxi Hospital Affiliated to Cancer Hospital, Chinese Academy of Medical Sciences/Cancer Hospital Affiliated to Shanxi Medical University, Taiyuan, China.
  • Hui-Jun Li
    State Key Laboratory of Natural Medicines, China Pharmaceutical University, No. 24 Tongjia Lane, Nanjing 210009, China. Electronic address: cpuli@163.com.
  • Tian-Ci Jing
    Smart Hospital Management Department, the First Hospital of China Medical University, Shenyang, 110001, China.
  • Mengchun Gong
    Institute of Health Management, Southern Medical University, No. 1023-1063, Shatai South Road, Guangzhou 510515, People's Republic of China.
  • Wei Hu
    State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, China.
  • Zhen-Wei Zhang
    China Rongtong Medical & Healthcare Co, Ltd, Chengdu, China.
  • Xingang Zhang
    Department of Cardiology, The First Hospital of China Medical University, Shenyang, China.
  • Guang-Wei Zhang
    Smart Hospital Management Department, the First Hospital of China Medical University, Shenyang, 110001, China. gwzhang@cmu.edu.cn.