Biophysical prediction of protein-peptide interactions and signaling networks using machine learning.

Journal: Nature methods
PMID:

Abstract

In mammalian cells, much of signal transduction is mediated by weak protein-protein interactions between globular peptide-binding domains (PBDs) and unstructured peptidic motifs in partner proteins. The number and diversity of these PBDs (over 1,800 are known), their low binding affinities and the sensitivity of binding properties to minor sequence variation represent a substantial challenge to experimental and computational analysis of PBD specificity and the networks PBDs create. Here, we introduce a bespoke machine-learning approach, hierarchical statistical mechanical modeling (HSM), capable of accurately predicting the affinities of PBD-peptide interactions across multiple protein families. By synthesizing biophysical priors within a modern machine-learning framework, HSM outperforms existing computational methods and high-throughput experimental assays. HSM models are interpretable in familiar biophysical terms at three spatial scales: the energetics of protein-peptide binding, the multidentate organization of protein-protein interactions and the global architecture of signaling networks.

Authors

  • Joseph M Cunningham
    Laboratory of Systems Pharmacology, Department of Systems Biology, Harvard Medical School, Boston, MA, USA.
  • Grigoriy Koytiger
    Laboratory of Systems Pharmacology, Department of Systems Biology, Harvard Medical School, Boston, MA, USA.
  • Peter K Sorger
    Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA, USA peter_sorger@hms.harvard.edu.
  • Mohammed AlQuraishi
    Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA 02115, USA; Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA. Electronic address: alquraishi@hms.harvard.edu.