Deep Learning-Driven Insights into Enzyme-Substrate Interaction Discovery.

Journal: Journal of chemical information and modeling
Published Date:

Abstract

Enzymes are ubiquitous catalysts with enormous application potential in biomedicine, green chemistry, and biotechnology. However, accurately predicting whether a molecule serves as a substrate for a specific enzyme, especially for novel entities, remains a significant challenge. Compared with traditional experimental methods, computational approaches are much more resource-efficient and time-saving, but they often compromise on accuracy. To address this, we introduce the molecule-enzyme interaction (MEI) model, a novel machine learning framework designed to predict the probability that a given molecule is a substrate for a specified enzyme with high accuracy. Utilizing a comprehensive data set that encapsulates extensive information on enzymatic reactions and enzyme sequences, the MEI model seamlessly combines atomic environmental data with amino acid sequence features through an advanced attention mechanism within a hierarchical neural network. Empirical evaluations have confirmed that the MEI model outperforms the current state-of-the-art model by at least 6.7% in prediction accuracy and 8.5% in AUROC, underscoring its enhanced predictive capabilities. Additionally, the MEI model demonstrates remarkable generalization across data sets of varying qualities and sizes. This adaptability is further evidenced by its successful application in diverse areas, such as predicting interactions within the CYP450 enzyme family and achieving an outstanding accuracy of 90.5% in predicting the enzymatic breakdown of complex plastics within environmental applications. These examples illustrate the model's ability to effectively transfer knowledge from coarsely annotated enzyme databases to smaller, high-precision data sets, robustly modeling both sparse and high-quality databases. We believe that this versatility firmly establishes the MEI model as a foundational tool in enzyme research with immense potential to extend beyond its original scope.

Authors

  • Wenjia Qian
    College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China.
  • Xiaorui Wang
    Structural Biophysics Group, School of Optometry and Vision Sciences, Cardiff University, Cardiff, Wales, UK.
  • Yuansheng Huang
    College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China.
  • Yu Kang
    College of Pharmaceutical Sciences, Zhejiang University , Hangzhou, Zhejiang 310058, P. R. China.
  • Peichen Pan
    Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou 310058, Zhejiang, China. Electronic address: panpeichen@zju.edu.cn.
  • Chang-Yu Hsieh
    Tencent Quantum Laboratory, Tencent, Shenzhen 518057 Guangdong, P. R. China.
  • Tingjun Hou
    College of Pharmaceutical Sciences, Zhejiang University , Hangzhou, Zhejiang 310058, China.