Harnessing protein language model for structure-based discovery of highly efficient and robust PET hydrolases.

Journal: Nature communications
Published Date:

Abstract

Plastic waste, particularly polyethylene terephthalate (PET), presents significant environmental challenges, driving extensive research into enzymatic biodegradation. However, existing PET hydrolases (PETases) are limited by narrow sequence diversity and suboptimal performance. This study introduces VenusMine, a protein discovery pipeline that integrates protein language models (PLMs) with a representation tree to identify PETases based on structural similarity using sequence information. Using the crystal structure of IsPETase as a template, VenusMine identifies and clusters target proteins. Candidates are further screened using PLM-based assessments of solubility and thermostability, leading to the selection of 34 proteins for biochemical validation. Results reveal that 14 candidates exhibit PET degradation activity across 30-60 °C. Notably, a PET hydrolase from Kibdelosporangium banguiense (KbPETase) demonstrates a melting temperature (T) 32 °C higher than IsPETase and exhibits the highest PET degradation activity within 30 - 65 °C among wild-type PETases. KbPETase also surpasses FastPETase and LCC in catalytic efficiency. X-ray crystallography and molecular dynamics simulations show that KbPETase possesses a conserved catalytic domain and enhanced intramolecular interactions, underpinning its improved functionality and thermostability. This work demonstrates a novel deep learning approach for discovering natural PETases with enhanced properties.

Authors

  • Banghao Wu
    Institute of Natural Sciences, Shanghai Jiao Tong University, Shanghai 200240, China.
  • Bozitao Zhong
    Mila-Québec AI Institute, Montréal, QC, Canada.
  • Lirong Zheng
  • Runye Huang
    School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China.
  • Shifeng Jiang
    School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China.
  • Mingchen Li
    Division of Computational Health Sciences, Department of Surgery, University of Minnesota, Minneapolis, MN, USA.
  • Liang Hong
    Department of Computer Science and Engineering, The Chinese University of Hong Kong, Sha Tin, Hong Kong SAR 999077, China.
  • Pan Tan
    Shanghai Artificial Intelligence Laboratory, Shanghai, China.