Screening of Cellular Senescence (CS) Related Genes as Biomarkers and Therapeutic Targets for Glioblastoma (GBM) by Integrated Machine Learning (IML)

Journal: bioRxiv
Published Date:

Abstract

Glioblastoma (GBM) is an aggressive brain tumor with limited prognostic biomarkers and therapeutic targets. This study applied an integrated machine learning (IML) framework to discover cellular senescence (CS)-related gene biomarkers and candidate therapeutic targets in GBM. Using Gene Expression Omnibus (GEO) training and validation cohorts, 113 machine learning models across 11 algorithms were integrated to pinpoint CS-associated gene signatures. Differential expression analysis combined with overlap of a CellAge senescence gene set yielded 129 CS-related differentially expressed genes (DEGs). Functional enrichment of these DEGs highlighted pathways related to cellular senescence and cell cycle regulation (Gene Ontology and KEGG). A multivariate classifier constructed via stepwise generalized linear modeling (GLM) and LASSO achieved high diagnostic performance (area under the ROC curve 0.92 in training, each independent validation set is over 0.85). Seven top-ranked genes from this model were validated, with TGFβI emerging as the most robust biomarker (AUC > 0.85) and its elevated expression was associated with significantly shorter overall survival. Spatial transcriptomics and single-cell RNA sequencing localized TGFβI expression to tumor cell clusters harboring high copy number variation (CNV) burdens. Immune microenvironment profiling linked TGFβI expression with increased macrophage infiltration. Finally, single-cell gene set enrichment (scGSEA) and AUCell analyses indicated enrichment of ECM–receptor interaction signaling in TGFβI-expressing cells. In summary, IML combined with spatial and single-cell transcriptomics identified TGFβI as a potent CS-related biomarker and a promising therapeutic target in GBM.

Authors

  • Tianle Xue; Yufan Hu; Haoju Lyu; Tian Xie