PRECOGx: exploring GPCR signaling mechanisms with deep protein representations.

Journal: Nucleic acids research
Published Date:

Abstract

In this study we show that protein language models can encode structural and functional information of GPCR sequences that can be used to predict their signaling and functional repertoire. We used the ESM1b protein embeddings as features and the binding information known from publicly available studies to develop PRECOGx, a machine learning predictor to explore GPCR interactions with G protein and β-arrestin, which we made available through a new webserver (https://precogx.bioinfolab.sns.it/). PRECOGx outperformed its predecessor (e.g. PRECOG) in predicting GPCR-transducer couplings, being also able to consider all GPCR classes. The webserver also provides new functionalities, such as the projection of input sequences on a low-dimensional space describing essential features of the human GPCRome, which is used as a reference to track GPCR variants. Additionally, it allows inspection of the sequence and structural determinants responsible for coupling via the analysis of the most important attention maps used by the models as well as through predicted intramolecular contacts. We demonstrate applications of PRECOGx by predicting the impact of disease variants (ClinVar) and alternative splice forms from healthy tissues (GTEX) of human GPCRs, revealing the power to dissect system biasing mechanisms in both health and disease.

Authors

  • Marin Matic
    Laboratorio di Biologia Bio@SNS, Scuola Normale Superiore, Piazza dei Cavalieri 7, 56126, Pisa, Italy.
  • Gurdeep Singh
    Heidelberg University Biochemistry Centre, 69120 Heidelberg, Germany.
  • Francesco Carli
    Department of Informatics, Università degli Studi di Torino, Torino, Piemonte, Italy.
  • Natalia De Oliveira Rosa
    Laboratorio di Biologia Bio@SNS, Scuola Normale Superiore, Piazza dei Cavalieri 7, 56126, Pisa, Italy.
  • Pasquale Miglionico
    Laboratorio di Biologia Bio@SNS, Scuola Normale Superiore, Piazza dei Cavalieri 7, 56126, Pisa, Italy.
  • Lorenzo Magni
    Laboratorio di Biologia Bio@SNS, Scuola Normale Superiore, Piazza dei Cavalieri 7, 56126, Pisa, Italy.
  • J Silvio Gutkind
    Department of Pharmacology and Moores Cancer Center, University of CA, San Diego, La Jolla, CA 92093, USA.
  • Robert B Russell
    Heidelberg University Biochemistry Centre, 69120 Heidelberg, Germany.
  • Asuka Inoue
    Graduate School of Pharmaceutical Sciences, Tohoku University, Sendai, Miyagi 980-8578, Japan.
  • Francesco Raimondi
    Laboratorio di Biologia Bio@SNS, Scuola Normale Superiore, Piazza dei Cavalieri 7, 56126, Pisa, Italy.