Automated feature learning and survival prognostication in grade 4 glioma using supervised machine learning models.
Journal:
Journal of neuro-oncology
Published Date:
Jun 16, 2025
Abstract
OBJECTIVE: WHO grade 4 glioma is the most common primary malignant brain tumor, with a median survival of only 14.6 months. Predicting survival outcomes remains challenging due to the tumor's heterogeneity and the influence of multiple clinical factors. Machine learning (ML) techniques have demonstrated superior predictive performance compared to traditional statistical models. Embedded feature-selection techniques such as Lasso shrinkage or Random-Forest importance scores are widely used, yet grade-4-glioma prognostic models still rely on an initial clinician-curated variable list and on ad-hoc cut-offs (e.g., "top X features" or "above certain threshold") when deciding how many ranked features to keep-choices that markedly influence model accuracy. We therefore developed a fully data-driven pipeline that begins with an unrestricted pool of clinical, functional, and biomarker variables, employs SHAP values for global importance ranking, and uses automated feature-subset optimization to identify the most optimal combination of predictors that maximizes survival-prediction performance in grade-4 glioma.
Authors
Keywords
No keywords available for this article.