Interpretable identification of cancer genes across biological networks via transformer-powered graph representation learning.

Journal: Nature biomedical engineering
Published Date:

Abstract

Graph representation learning has been leveraged to identify cancer genes from biological networks. However, its applicability is limited by insufficient interpretability and generalizability under integrative network analysis. Here we report the development of an interpretable and generalizable transformer-based model that accurately predicts cancer genes by leveraging graph representation learning and the integration of multi-omics data with the topologies of homogeneous and heterogeneous networks of biological interactions. The model allows for the interpretation of the respective importance of multi-omic and higher-order structural features, achieved state-of-the-art performance in the prediction of cancer genes across biological networks (including networks of interactions between miRNA and proteins, transcription factors and proteins, and transcription factors and miRNA) in pan-cancer and cancer-specific scenarios, and predicted 57 cancer-gene candidates (including three genes that had not been identified by other models) among 4,729 unlabelled genes across 8 pan-cancer datasets. The model's interpretability and generalization may facilitate the understanding of gene-related regulatory mechanisms and the discovery of new cancer genes.

Authors

  • Xiaorui Su
    Huaxi MR Research Center, Department of Radiology, West China Hospital of Sichuan University, Chengdu, China.
  • Pengwei Hu
    The Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Ürümqi, China.
  • Dongxu Li
    School of Information and Communication, Guilin University of Electronic Technology, Guilin, China.
  • Bowei Zhao
    Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi, 830011, China.
  • Zhaomeng Niu
    Department of Health Informatics, Rutgers School of Health Professions, Piscataway, NJ, USA.
  • Thomas Herget
    Merck KGaA, Darmstadt, Germany.
  • Philip S Yu
    Department of Computer Science, University of Illinois at Chicago, Chicago, IL 60612 USA.
  • Lun Hu
    The Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Ürümqi, China.