A Highly Sensitive Model Based on Graph Neural Networks for Enzyme Key Catalytic Residue Prediction.

Journal: Journal of chemical information and modeling
Published Date:

Abstract

Determining the catalytic site of enzymes is a great help for understanding the relationship between protein sequence, structure, and function, which provides the basis and targets for designing, modifying, and enhancing enzyme activity. The unique local spatial configuration bound to the substrate at the active center of the enzyme determines the catalytic ability of enzymes and plays an important role in the catalytic site prediction. As a suitable tool, the graph neural network can better understand and identify the residue sites with unique local spatial configurations due to its remarkable ability to characterize the three-dimensional structural features of proteins. Consequently, a novel model for predicting enzyme catalytic sites has been developed, which incorporates a uniquely designed adaptive edge-gated graph attention neural network (AEGAN). This model is capable of effectively handling sequential and structural characteristics of proteins at various levels, and the extracted features enable an accurate description of the local spatial configuration of the enzyme active site by sampling the local space around candidate residues and special design of amino acid physical and chemical properties. To evaluate its performance, the model was compared with existing catalytic site prediction models using different benchmark datasets and achieved the best results on each benchmark dataset. The model exhibited a sensitivity of 0.9659, accuracy of 0.9226, and area under the precision-recall curve (AUPRC) of 0.9241 on the independent test set constructed for evaluation. Furthermore, the F1-score of this model is nearly four times higher than that of the best-performing similar model in previous studies. This research can serve as a valuable tool to help researchers understand protein sequence-structure-function relationships while facilitating the characterization of novel enzymes of unknown function.

Authors

  • Xiaowei Shen
    National Energy R&D Center for Biorefinery, Beijing University of Chemical Technology, 100029, Beijing, China.
  • Shiding Zhang
    Henan Joint International Research Laboratory of Nanocomposite Sensing Materials, Anyang Institute of Technology, Anyang 455000, Henan Province, China.
  • Jianyu Long
    National Energy R&D Center for Biorefinery, Beijing University of Chemical Technology, 100029, Beijing, China.
  • Changjing Chen
    National Energy R&D Center for Biorefinery, Beijing University of Chemical Technology, 100029, Beijing, China.
  • Meng Wang
    State Key Laboratory of Urban Water Resource and Environment, School of Environment, Harbin Institute of Technology, Harbin 150001, China.
  • Ziheng Cui
    National Energy R&D Center for Biorefinery, Beijing University of Chemical Technology, 100029, Beijing, China.
  • Biqiang Chen
    National Energy R&D Center for Biorefinery, Beijing University of Chemical Technology, 100029, Beijing, China.
  • Tianwei Tan
    National Energy R&D Center for Biorefinery, Beijing University of Chemical Technology, 100029, Beijing, China.