Machine Learning-Based Predictive Models for Early Detection of Cardiovascular Diseases: A Study Utilizing Patient Samples from a Tertiary Health Promotion Center in Korea.

Journal: Studies in health technology and informatics
PMID:

Abstract

A machine learning model was developed for cardiovascular diseases prediction based on 21,118 patient checkups data from a tertiary medical institution in Seoul, Korea, collected between 2009 and 2021. XGBoost algorithm showed the highest predictive performance, with an average AUROC of 0.877. In survival analysis, XGBSE achieved an AUROC exceeding 0.9 for 2-9 year predictions, with a C-index of 0.878 across all diseases, outperforming Cox regression (C-index of 0.887). A high-performance prediction model for cardiovascular diseases using the XGBSE algorithm was successfully developed and is poised for real-world clinical application following external simplification and validation.

Authors

  • Kanghyuck Lee
    Department of Biomedicine & Health Sciences, College of Medicine, The Catholic University of Korea, 222 Banpo-daero, Seocho-gu, Seoul 06591, Republic of Korea.
  • Seol Whan Oh
    Department of Biomedicine & Health Sciences, College of Medicine, The Catholic University of Korea, 222 Banpo-daero, Seocho-gu, Seoul 06591, Republic of Korea.
  • Sung-Hwan Kim
    Department of Orthopedic Surgery, Gangnam Severance Hospital, Yonsei University College of Medicine, Seoul, Republic of Korea.
  • Taehoon Ko
    Office of Hospital Information, Seoul National University Hospital, 101 Daehak-ro, Jongno-gu, Seoul, 03080, South Korea.
  • In Young Choi
    Department of Medical Informatics, College of Medicine, The Catholic University of Korea, Seoul, Republic of Korea.