Harnessing protein language model for structure-based discovery of highly efficient and robust PET hydrolases.
Journal:
Nature communications
Published Date:
Jul 5, 2025
Abstract
Plastic waste, particularly polyethylene terephthalate (PET), presents significant environmental challenges, driving extensive research into enzymatic biodegradation. However, existing PET hydrolases (PETases) are limited by narrow sequence diversity and suboptimal performance. This study introduces VenusMine, a protein discovery pipeline that integrates protein language models (PLMs) with a representation tree to identify PETases based on structural similarity using sequence information. Using the crystal structure of IsPETase as a template, VenusMine identifies and clusters target proteins. Candidates are further screened using PLM-based assessments of solubility and thermostability, leading to the selection of 34 proteins for biochemical validation. Results reveal that 14 candidates exhibit PET degradation activity across 30-60 °C. Notably, a PET hydrolase from Kibdelosporangium banguiense (KbPETase) demonstrates a melting temperature (T) 32 °C higher than IsPETase and exhibits the highest PET degradation activity within 30 - 65 °C among wild-type PETases. KbPETase also surpasses FastPETase and LCC in catalytic efficiency. X-ray crystallography and molecular dynamics simulations show that KbPETase possesses a conserved catalytic domain and enhanced intramolecular interactions, underpinning its improved functionality and thermostability. This work demonstrates a novel deep learning approach for discovering natural PETases with enhanced properties.