Comparative performance of multiple ensemble learning models for preoperative prediction of tumor deposits in rectal cancer based on MR imaging.

Journal: Scientific reports
PMID:

Abstract

Ensemble learning can effectively mitigate the risk of model overfitting during training. This study aims to evaluate the performance of ensemble learning models in predicting tumor deposits in rectal cancer (RC) and identify the optimal model for preoperative clinical decision-making. A total of 199 RC patients were analyzed, with radiomic features extracted from T2-weighted and apparent diffusion coefficient images and selected through advanced statistical methods. After that, the bagging-ensemble learning model (random forest), boosting-ensemble learning model (XGBoost, AdaBoost, LightGBM, and CatBoost), and voting-ensemble learning model (integrating 5 classifiers) were applied and optimized using grid search with tenfold cross-validation. The area under the receiver operator characteristic curve, calibration curve, t-distributed stochastic neighbor embedding (t-SNE), and decision curve analysis were adopted to evaluate the performance of each model. The voting-ensemble learning model (VELM) performs best in the testing cohort, with an AUC of 0.875 and an accuracy of 0.800. Notably, Calibration plots confirmed VELM's stability and t-SNE visualization illustrated clear clustering of radiomic features. Decision curve analysis further validated the VELM's superior net benefit across a range of clinical thresholds, underscoring its potential as a reliable tool for clinical decision-making in RC.

Authors

  • Jiayi Wang
    Department of Statistics, Texas A&M University.
  • Fayong Hu
    Department of Gastrointestinal Surgery, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430030, China.
  • Jin Li
    Mental Health Center, West China Hospital, Sichuan University, Chengdu, China.
  • Wenzhi Lv
    Department of Artificial Intelligence, Julei Technology, Wuhan, China.
  • Zhiyong Liu
    State Key Laboratory of Respiratory Disease , Guangzhou Institutes of Biomedicine and Health (GIBH) , Chinese Academy of Sciences (CAS) , Guangzhou-510530 , China . Email: zhang_tianyu@gibh.ac.cn ; ; Tel: (+86)20 3201 5270.
  • Liang Wang
    Information Department, Dazhou Central Hospital, Dazhou 635000, China.