EvoZymePro-Cat: A Protein-Ligand-Aware Deep Learning Framework for Predicting Mutation Effects in Enzyme Function.

Journal: ACS synthetic biology
Published Date:

Abstract

Enzymes are biological catalysts that speed up chemical reactions in an eco-friendly way. Precise enzyme design is hindered by vast sequence space and intricate sequence-structure-function interdependencies. To address these challenges, we developed EvoZymePro-Cat (EZPro-Cat), a deep learning platform for enzyme mutant screening. Conventional methods for predicting absolute mutant activities suffer from systematic errors and limited generalizability. Our pairwise comparison framework directly models relative activity superiority between variants, eliminating dependence on absolute value predictions. The framework integrates full sequence and local structure semantics of protein and ligand information using bilinear attention mechanisms. Protein sequences are encoded using the ESM1b transformer model. Ligands are represented through MolT5 embeddings and MACCS molecular fingerprints. The adaptability of protein residues to their microenvironments is captured by integrating structural features and site-specific evolutionary characteristics. Bilinear attention mechanisms capture long-range intermolecular interactions during catalysis by bidirectional projection and weighted fusion of protein-ligand features. Compared to existing methods, our model exhibits superior performance in identifying improved enzyme mutants through comparative prediction of mutation effects on activity, such as Km and kcat. For deep mutation scanning data sets, a few-shot learning strategy combined with the EZPro-Cat framework boosts prediction precision (AUC 0.908). By using integrated multimodal representations, EZPro-Cat offers a mechanistic and practical solution for functional profiling of intraprotein variants, driving paradigm shifts in highly efficient enzyme discovery and directed evolution.

Authors

  • Ran Xu
    Department of Allied Health Sciences, University of Connecticut, 358 Mansfield Rd, Storrs, CT 06269, USA.
  • Xinkang Li
    Centre in Artificial Intelligence Driven Drug Discovery, Faculty of Applied Sciences, Macao Polytechnic University, 999078, Macao.
  • Jianan Sui
    Centre in Artificial Intelligence Driven Drug Discovery, Faculty of Applied Sciences, Macao Polytechnic University, Macao 999078, China.
  • Lei Wu
    Advanced Photonics Center, Southeast University, Nanjing, 210096, China.
  • Chen Ling
    Toyota Research Institute of North America, 1555 Woodridge Avenue, Ann Arbor, Michigan United States, 48105.
  • Liangzhen Zheng
    Tencent AI Lab, Shenzhen, China.
  • Jingjing Guo
    The School of Management, Hefei University of Technology, Hefei, China.