GeoNet enables the accurate prediction of protein-ligand binding sites through interpretable geometric deep learning.

Journal: Structure (London, England : 1993)
PMID:

Abstract

The identification of protein binding residues is essential for understanding their functions in vivo. However, it remains a computational challenge to accurately identify binding sites due to the lack of known residue binding patterns. Local residue spatial distribution and its interactive biophysical environment both determine binding patterns. Previous methods could not capture both information simultaneously, resulting in unsatisfactory performance. Here, we present GeoNet, an interpretable geometric deep learning model for predicting DNA, RNA, and protein binding sites by learning the latent residue binding patterns. GeoNet achieves this by introducing a coordinate-free geometric representation to characterize local residue distributions and generating an eigenspace to depict local interactive biophysical environments. Evaluation shows that GeoNet is superior compared to other leading predictors and it shows a strong interpretability of learned representations. We present three test cases, where interaction interfaces were successfully identified with GeoNet.

Authors

  • Jiyun Han
    School of Mathematics and Statistics, Shandong University, Weihai 264209, China.
  • Shizhuo Zhang
    School of Mathematics and Statistics, Shandong University, Weihai 264209, China.
  • Mingming Guan
    School of Mathematics and Statistics, Shandong University, Weihai 264209, China.
  • Qiuyu Li
    School of pharmacy, Southwest Medical University, Luzhou, China.
  • Xin Gao
    Department of Computer Science, New Jersey Institute of Technology, Newark, New Jersey, USA.
  • Juntao Liu
    State Key Laboratory of Transducer Technology, Aerospace Information Research Institute. Chinese Academy of Sciences, Beijing 100190, China.