Improved enzyme functional annotation prediction using contrastive learning with structural inference.

Journal: Communications biology
PMID:

Abstract

Recent years have witnessed the remarkable progress of deep learning within the realm of scientific disciplines, yielding a wealth of promising outcomes. A prominent challenge within this domain has been the task of predicting enzyme function, a complex problem that has seen the development of numerous computational methods, particularly those rooted in deep learning techniques. However, the majority of these methods have primarily focused on either amino acid sequence data or protein structure data, neglecting the potential synergy of combining both modalities. To address this gap, we propose a Contrastive Learning framework for Enzyme functional ANnotation prediction combined with protein amino acid sequences and Contact maps (CLEAN-Contact). We rigorously evaluate the performance of our CLEAN-Contact framework against the state-of-the-art enzyme function prediction models using multiple benchmark datasets. Using CLEAN-Contact, we predict previously unknown enzyme functions within the proteome of Prochlorococcus marinus MED4. Our findings convincingly demonstrate the substantial superiority of our CLEAN-Contact framework, marking a significant step forward in enzyme function prediction accuracy.

Authors

  • Yuxin Yang
    Key Laboratory of Biomedical Engineering of Hainan Province, School of Biomedical Engineering, Hainan University, Haikou, China.
  • Abby Jerger
    Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, 1100 Dexter Ave N, Seattle, WA, 98109, USA.
  • Song Feng
    Network Information Center, Xiangya Hospital, Central South University, Xiangya Road, Changsha, 410008, China. fs205@sina.com.
  • Zixu Wang
    Laboratory of Veterinary Anatomy, College of Animal Medicine, China Agricultural University, Haidian, Beijing, 100193, People's Republic of China.
  • Christina Brasfield
    Pacific Northwest National Laboratory, 902 Battelle Blvd, Richland, WA, 99354, USA.
  • Margaret S Cheung
    Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, 1100 Dexter Ave N, Seattle, WA, 98109, USA.
  • Jeremy Zucker
    Earth and Biological Sciences Division, Pacific Northwest National Laboratories, Richland, WA, United States.
  • Qiang Guan
    Department of Computer Science, Kent State University, Kent, OH 44242, USA. Electronic address: qguan@kent.edu.