Machine learning in colorectal cancer prediction and diagnosis: a systematic review of models' performance.
Journal:
International journal of medical informatics
Published Date:
Oct 28, 2025
Abstract
INTRODUCTION: Colorectal cancer (CRC) poses a significant global health burden, demanding early and accurate detection strategies. However, Machine Learning (ML) models are increasingly being applied for CRC prediction; yet their performance requires systematic evaluation to guide adoption. PURPOSE: This review evaluates the performance of ML models in predicting and diagnosing CRC, focusing on studies published between 2019 and 2024. It intends to identify the most frequently used ML models, determine their performance, and analyze the impact of different models' settings on their performance. METHODS: A comprehensive search was conducted using SCOPUS, PubMed, and Web of Science databases. Following PRISMA guidelines, studies evaluating ML models for CRC prediction were selected and reviewed. Study selection, data extraction, and risk-of-bias assessment were performed independently by multiple reviewers. Extracted data included study characteristics, model specifications, validation methods, and performance metrics. RESULTS: Thirty studies met the inclusion criteria. Random Forest (RF) was the most frequently evaluated model. At the same time, Ensemble Learning (EML), Neural Networks (ANN/DNN), and Support Vector Machines (SVM) consistently demonstrated the highest performance across multiple metrics. Most studies employed molecular datasets, and feature selection methods varied widely, significantly influencing model performance. CONCLUSIONS: ML models, particularly EML, ANN, DNN, and SVM, demonstrate high diagnostic performance in CRC prediction and diagnosis, suggesting substantial diagnostic potential; any effect on decision-making and outcomes through improved accuracy and early detection requires external and prospective validation. However, variability in datasets, methodologies, and reporting quality highlights significant research gaps, including the lack of standardized validation procedures and consistent performance reporting, which are crucial for facilitating clinical adoption and informing healthcare policy decisions.
Authors
Keywords
No keywords available for this article.