Machine learning on multiple epigenetic features reveals H3K27Ac as a driver of gene expression prediction across patients with glioblastoma.

Journal: PLoS computational biology
Published Date:

Abstract

Epigenetic mechanisms play a crucial role in driving transcript expression and shaping the phenotypic plasticity of glioblastoma stem cells (GSCs), contributing to tumor heterogeneity and therapeutic resistance. These mechanisms dynamically regulate the expression of key oncogenic and stemness-associated genes, enabling GSCs to adapt to environmental cues and evade targeted therapies. Importantly, epigenetic reprogramming allows GSCs to transition between cellular states, including therapy-resistant mesenchymal-like phenotypes, underscoring the need for epigenetic-targeting strategies to disrupt these adaptive processes. Understanding these epigenetic drivers of gene expression provides a foundation for novel therapeutic interventions aimed at eradicating GSCs and improving glioblastoma outcomes. Using machine learning (ML), we employ cross-patient prediction of transcript expression in GSCs by combining epigenetic features from various sources, including ATAC-seq, CTCF ChIP-seq, RNAPII ChIP-seq, H3K27Ac ChIP-seq, and RNA-seq. We investigate different ML and deep learning (DL) models for this task and ultimately build our final pipeline using XGBoost. The model trained on one patient generalizes to other 11 patients with high performance. Notably, H3K27Ac alone from a single patient is sufficient to predict gene expression in all 11 patients. Furthermore, the distribution of H3K27Ac peaks across the genomes of all patients is remarkably similar. These findings suggest that GSCs share a common distributional pattern of enhancer activity characterized by H3K27Ac, which can be utilized to predict gene expression in GSCs across patients. In summary, while GSCs are known for their transcriptomic and phenotypic heterogeneity, we propose that they share a common epigenetic pattern of enhancer activation that defines their underlying transcriptomic expression pattern. This pattern can predict gene expression across patient samples, providing valuable insights into the biology of GSCs.

Authors

  • Yusuke Suita
    Laboratory of Cancer Epigenetics and Plasticity, Department of Neurosurgery, Brown University, Providence, Rhode Island, United States of America.
  • Hardy Bright
    Data Science Institute, Brown University, Providence, Rhode Island, United States of America.
  • Yuan Pu
    Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, United States of America.
  • Merih Deniz Toruner
    Laboratory of Cancer Epigenetics and Plasticity, Department of Neurosurgery, Brown University, Providence, Rhode Island, United States of America.
  • Jordan Idehen
    Department of Computer Science, Brown University, Providence, Rhode Island, United States of America.
  • Nikos Tapinos
    Laboratory of Cancer Epigenetics and Plasticity, Department of Neurosurgery, Brown University, Providence, Rhode Island, United States of America.
  • Ritambhara Singh

Keywords

No keywords available for this article.