ESM-Ezy: a deep learning strategy for the mining of novel multicopper oxidases with superior properties.

Journal: Nature communications
PMID:

Abstract

The UniProt database is a valuable resource for biocatalyst discovery, yet predicting enzymatic functions remains challenging, especially for low-similarity sequences. Identifying superior enzymes with enhanced catalytic properties is even harder. To overcome these challenges, we develop ESM-Ezy, an enzyme mining strategy leveraging the ESM-1b protein language model and similarity calculations in semantic space. Using ESM-Ezy, we identify novel multicopper oxidases (MCOs) with superior catalytic properties, achieving a 44% success rate in outperforming query enzymes (QEs) in at least one property, including catalytic efficiency, heat and organic solvent tolerance, and pH stability. Notably, 51% of the MCOs excel in environmental remediation applications, and some exhibited unique structural motifs and unique active centers enhancing their functions. Beyond MCOs, 40% of L-asparaginases identified show higher specific activity and catalytic efficiency than QEs. ESM-Ezy thus provides a promising approach for discovering high-performance biocatalysts with low sequence similarity, accelerating enzyme discovery for industrial applications.

Authors

  • Hui Qian
    The Laboratory of Artificial Intelligence and Bigdata in Ophthalmology, Affiliated Eye Hospital, Nanjing Medical University, Nanjing, China.
  • Yuxuan Wang
    Department of Maxillofacial and Otorhinolaryngology Oncology, Tianjin's Clinical Research Center for Cancer, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin, China.
  • Xibin Zhou
    School of Engineering, Westlake University, Hangzhou, 310014, Zhejiang, China.
  • Tao Gu
    Health Sciences and Innovation, Surrey Memorial Hospital, Fraser Health Authority, Surrey, BC, Canada.
  • Hui Wang
    Department of Vascular Surgery, Xuanwu Hospital, Capital Medical University, Beijing, China.
  • Hao Lyu
    Department of Chemical Engineering, Stanford University, Stanford, CA 94305, USA.
  • Zhikai Li
    School of Engineering, Westlake University, Hangzhou, 310014, Zhejiang, China.
  • Xiuxu Li
    School of Engineering, Westlake University, Hangzhou, 310014, Zhejiang, China.
  • Huan Zhou
    The School of Big Data and Artificial Intelligence, Anhui Xinhua University, Hefei, China.
  • Chengchen Guo
    School of Engineering, Westlake University, Hangzhou, 310014, Zhejiang, China.
  • Fajie Yuan
  • Yajie Wang
    Center for Biomedical Imaging Research, Department of Biomedical Engineering, School of Medicine, Tsinghua University, Beijing, China.