Development and Validation of Time-to-Event Machine Learning Models for Predicting Disease-Free Survival in Patients with Locally Advanced Colorectal Cancer: A Multicenter Cohort Study.

Journal: Annals of surgical oncology
Published Date:

Abstract

BACKGROUND: The postoperative prognosis of locally advanced colorectal cancer (LACRC) exhibits significant heterogeneity. However, conventional models for predicting disease-free survival (DFS) often lack the necessary precision. Therefore, we aim to develop and validate time-to-event machine learning (ML) models for predicting DFS in patients with LACRC, ultimately improving prognostic accuracy. PATIENTS AND METHODS: This multicenter cohort study enrolled 456 patients with LACRC from three medical centers. A training cohort consisting of 350 patients was formed from centers 1 and 2, while an external validation cohort comprising 106 patients was sourced from center 3. Preoperative computed tomography (CT) images were segmented to extract radiomics features, and a radiomics score (radscore) was calculated through feature engineering. In addition, intratumor heterogeneity (ITH) scores were derived by integrating clustered mask regions with global pixel distribution patterns. To predict DFS, five time-to-event ML models were trained: Cox proportional hazards, FastKernelSurvivalSVM, GradientBoostingSurvival (GB-Survival), RandomSurvivalForest, and ExtraSurvivalTrees. Model performance was assessed using the concordance index (C-index), and Survival SHapley Additive exPlanations over time (SurvSHAP (t)) analysis was conducted for model interpretation. RESULTS: Among the models tested, GB-Survival demonstrated the highest predictive performance for DFS, achieving a C-index of 0.7823. SurvSHAP (t) analysis revealed that the key prognostic factors included the ITH score, pathological TNM stage, lymphovascular invasion, radscore, and the prognostic nutritional index. CONCLUSIONS: The GB-Survival model that integrates multimodal data outperforms other time-to-event ML models in predicting DFS for LACRC. This approach may facilitate the development of data-driven treatment strategies and personalized risk stratification for patients with LACRC.

Authors

  • Yongjie Zhou
    Department of Radiology, Jiangxi Cancer Hospital, Nanchang, China; The Second Affiliated Hospital of Nanchang Medical College, Nanchang, China; Jiangxi Clinical Research Center for Cancer, Nanchang, China.
  • Zhichao Zuo
    School of Mathematics and Computational Science, Xiangtan University, Xiangtan, Hunan, China.
  • Jinhong Zhao
    Department of Radiology, The Second Affiliated Hospital, Jiangxi Medical College, Nanchang University, Nanchang, China.
  • Yongming Tan
    Department of Radiology, The First Affiliated Hospital, Jiangxi Medical College, Nanchang University, Nanchang, China.
  • Jinqiu Deng
    School of Mathematics and Computational Science, Xiangtan University, Xiangtan, Hunan, China.
  • Xiang Wei
    School of Software Engineering Beijing Jiaotong University Beijing, China. Electronic address: [email protected].
  • Hao Li
    Department of Urology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China.
  • Lianggeng Gong
    Department of Radiology, the Second Affiliated Hospital of Nanchang University, Nanchang, China.
  • Lan Liu
    School of Statistics, University of Minnesota at Twin Cities.
  • Linhua Zhong
    Department of Radiology, Jiangxi Cancer Hospital, Nanchang, China.

Keywords

No keywords available for this article.