SGPPI: structure-aware prediction of protein-protein interactions in rigorous conditions with graph convolutional network.

Journal: Briefings in bioinformatics
Published Date:

Abstract

While deep learning (DL)-based models have emerged as powerful approaches to predict protein-protein interactions (PPIs), the reliance on explicit similarity measures (e.g. sequence similarity and network neighborhood) to known interacting proteins makes these methods ineffective in dealing with novel proteins. The advent of AlphaFold2 presents a significant opportunity and also a challenge to predict PPIs in a straightforward way based on monomer structures while controlling bias from protein sequences. In this work, we established Structure and Graph-based Predictions of Protein Interactions (SGPPI), a structure-based DL framework for predicting PPIs, using the graph convolutional network. In particular, SGPPI focused on protein patches on the protein-protein binding interfaces and extracted the structural, geometric and evolutionary features from the residue contact map to predict PPIs. We demonstrated that our model outperforms traditional machine learning methods and state-of-the-art DL-based methods using non-representation-bias benchmark datasets. Moreover, our model trained on human dataset can be reliably transferred to predict yeast PPIs, indicating that SGPPI can capture converging structural features of protein interactions across various species. The implementation of SGPPI is available at https://github.com/emerson106/SGPPI.

Authors

  • Yan Huang
    Department of Neurology, University of Texas Health Science Center at Houston, Houston, TX.
  • Stefan Wuchty
    Department of Computer Science, University of Miami, Miami, FL, 33146, USA.
  • Yuan Zhou
    Department of Pharmacy, Taihe Hospital, Hubei University of Medicine, Shiyan, China.
  • Ziding Zhang
    State Key Laboratory of Agrobiotechnology, College of Biological Sciences , China Agricultural University , Beijing 100193 , China.