TopEC: prediction of Enzyme Commission classes by 3D graph neural networks and localized 3D protein descriptor.

Journal: Nature communications
PMID:

Abstract

Tools available for inferring enzyme function from general sequence, fold, or evolutionary information are generally successful. However, they can lead to misclassification if a deviation in local structural features influences the function. Here, we present TopEC, a 3D graph neural network based on a localized 3D descriptor to learn chemical reactions of enzymes from enzyme structures and predict Enzyme Commission (EC) classes. Using message-passing frameworks, we include distance and angle information to significantly improve the predictive performance for EC classification (F-score: 0.72) compared to regular 2D graph neural networks. We trained networks without fold bias that can classify enzyme structures for a vast functional space (>800 ECs). Our model is robust to uncertainties in binding site locations and similar functions in distinct binding sites. We observe that TopEC networks learn from an interplay between biochemical features and local shape-dependent features. TopEC is available as a repository on GitHub: https://github.com/IBG4-CBCLab/TopEC and https://doi.org/10.25838/d5p-66 .

Authors

  • Karel van der Weg
    Institute of Bio- and Geosciences (IBG-4: Bioinformatics), Forschungszentrum Jülich GmbH, 52425, Jülich, Germany.
  • Erinc Merdivan
    Helmholtz AI, Helmholtz Munich, Neuherberg, Germany.
  • Marie Piraud
    Department of Informatics, Technische Universität München, Munich, Germany.
  • Holger Gohlke
    Institute for Pharmaceutical and Medicinal Chemistry, Heinrich Heine University Düsseldorf 40225 Düsseldorf, Germany; Institute of Bio- and Geosciences (IBG4: Bioinformatics), Forschungszentrum Jülich GmbH, 52425 Jülich, Germany. Electronic address: gohlke@hhu.de.