-Complex-Based Machine Learning (HCML) for the Prediction of Protein-Protein Binding Affinity Changes upon Mutation.

Journal: Journal of chemical information and modeling
Published Date:

Abstract

Protein-protein interactions (PPIs) are involved in almost all biological processes in the cell. Understanding protein-protein interactions holds the key for the understanding of biological functions, diseases and the development of therapeutics. Recently, artificial intelligence (AI) models have demonstrated great power in PPIs. However, a key issue for all AI-based PPI models is efficient molecular representations and featurization. Here, we propose -complex-based PPI representation, and -complex-based machine learning models for the prediction of PPI binding affinity changes upon mutation, for the first time. In our model, various complexes (, ) can be generated for the graph representation of protein-protein complex by using different graphs , which reveal -related inner connections within the graph representation of protein-protein complex. Further, for a specific graph , a series of nested complexes are generated to give a multiscale characterization of the PPIs. Its persistent homology and persistent Euler characteristic are used as molecular descriptors and further combined with the machine learning model, in particular, gradient boosting tree (GBT). We systematically test our model on the two most-commonly used data sets, that is, SKEMPI and AB-Bind. It has been found that our model outperforms all the existing models as far as we know, which demonstrates the great potential of our model for the analysis of PPIs. Our model can be used for the analysis and design of efficient antibodies for SARS-CoV-2.

Authors

  • Xiang Liu
    College of Agricultural Science and Engineering, Hohai University, Nanjing 210098, China; Anhui Provincial Key Laboratory of Environmental Pollution Control and Resource Reuse, Anhui Jianzhu University, Hefei 230009, China.
  • Huitao Feng
    Chern Institute of Mathematics and LPMC, Nankai University, Tianjin, China, 300071.
  • Jie Wu
    Center of Disease Control of Qingdao, 175 Shandong Road, Qingdao, Shandong, 266001, China.
  • Kelin Xia
    Division of Mathematical Sciences, School of Physical and Mathematical Sciences, Nanyang Technological University, 637371, Singapore.