Predicting Breast Cancer Relapse from Histopathological Images with Ensemble Machine Learning Models.

Journal: Current oncology (Toronto, Ont.)
PMID:

Abstract

Relapse and metastasis occur in 30-40% of breast cancer patients, even after targeted treatments like trastuzumab for HER2-positive breast cancer. Accurate individual prognosis is essential for determining appropriate adjuvant treatment and early intervention. This study aims to enhance relapse and metastasis prediction using an innovative framework with machine learning (ML) and ensemble learning (EL) techniques. The developed framework is analyzed using The Cancer Genome Atlas (TCGA) data, which has 123 HER2-positive breast cancer patients. Our two-stage experimental approach first applied six basic ML models (support vector machine, logistic regression, decision tree, random forest, adaptive boosting, and extreme gradient boosting) and then ensembled these models using weighted averaging, soft voting, and hard voting techniques. The weighted averaging ensemble approach achieved enhanced performances of 88.46% accuracy, 89.74% precision, 94.59% sensitivity, 73.33% specificity, 92.11% F-Value, 71.07% Mathew's correlation coefficient, and an AUC of 0.903. This framework enables the accurate prediction of relapse and metastasis in HER2-positive breast cancer patients using H&E images and clinical data, thereby assisting in better treatment decision-making.

Authors

  • Ghanashyam Sahoo
    Department of Computer Science and Engineering, Siksha 'O' Anusandhan (Deemed to Be University), Bhubaneswar 751030, India.
  • Ajit Kumar Nayak
    Department of Computer Science and Information Technology, Siksha 'O' Anusandhan (Deemed to Be University), Bhubaneswar 751030, India.
  • Pradyumna Kumar Tripathy
    Department of Computer Science and Engineering, Silicon University, Bhubaneswar 751024, India.
  • Amrutanshu Panigrahi
    Department of Computer Science and Engineering, Siksha 'O' Anusandhan (Deemed to be University), Bhubaneswar, Odisha, India.
  • Abhilash Pati
    Department of Computer Science and Engineering, Siksha 'O' Anusandhan (Deemed to be University), Bhubaneswar, Odisha, India.
  • Bibhuprasad Sahu
    Department of Information Technology, Vardhaman College of Engineering (Autonomous), Hyderabad, Telangana, India.
  • Chandrakanta Mahanty
    Department of Computer Science and Engineering, GITAM Deemed to Be University, Visakhapatnam 530045, India.
  • Saurav Mallik
    Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center, Houston.