Prediction of cardiovascular disease based on multiple feature selection and improved PSO-XGBoost model.

Journal: Scientific reports
PMID:

Abstract

Cardiovascular disease is a common disease that threatens human health. In order to predict it more accurately, this paper proposes a cardiovascular disease prediction model that combines multiple feature selection, improved particle swarm optimization algorithm, and extreme gradient boosting tree. Firstly, the dataset is preprocessed, and an XGBoost cardiovascular disease prediction model is constructed for model training and compare it with other algorithms. Then, combined with two factor Pearson correlation analysis and feature importance ranking, multiple feature selection is performed, with the optimal feature subset as the feature input. Finally, the improved particle swarm optimization algorithm is used to adjust the hyperparameters of the extreme gradient boosting tree algorithm, and selecting the optimal hyperparameter combination to construct the MFS-DLPSO-XGBoost model. The recall, precision, accuracy, F1 score, and area under the ROC curve (AUC) of the MFS-DLPSO-XGBoost model reached 71.4%, 76.3%, 74.7%, 73.6%, and 80.8%, respectively, which increased by 3.6%, 3.2%, 2.7%, 3.2%, and 2.3% compared to XGBoost. The results indicate that the model proposed in this article has good classification performance and can provide assistance for doctors and patients in predicting and preventing heart disease.

Authors

  • Kerang Cao
    College of Computer Science and Technology, Shenyang University of Chemical Technology, Shenyang, 110142, China.
  • Chang Liu
    Key Lab of Cell Differentiation and Apoptosis of Ministry of Education, Shanghai Jiao Tong University School of Medicine, Shanghai, China.
  • Siqi Yang
    College of Computer Science and Technology, Shenyang University of Chemical Technology, Shenyang, 110142, China.
  • Yuxin Zhang
    State Key Laboratory of Oral Diseases, National Clinical Research Center for Oral Diseases, West China Hospital of Stomatology , Sichuan University , Chengdu 610041 , People's Republic of China.
  • Lili Li
    Department of Gastroenterology, West China Hospital, Sichuan University, Chengdu, China.
  • Hoekyung Jung
    Computer Engineering Dept, Paichai University, Daejeon, 35345, Korea. hkjung@pcu.ac.kr.
  • Shuo Zhang
    Ph.D. Program in Computer Science, The City University of New York, New York, NY, United States.