Banghao Wu, Bozitao Zhong, Lirong Zheng, Runye Huang, Shifeng Jiang, Mingchen Li, Liang Hong, Pan Tan
{"title":"Harnessing protein language model for structure-based discovery of highly efficient and robust PET hydrolases","authors":"Banghao Wu, Bozitao Zhong, Lirong Zheng, Runye Huang, Shifeng Jiang, Mingchen Li, Liang Hong, Pan Tan","doi":"10.1038/s41467-025-61599-z","DOIUrl":null,"url":null,"abstract":"<p>Plastic waste, particularly polyethylene terephthalate (PET), presents significant environmental challenges, driving extensive research into enzymatic biodegradation. However, existing PET hydrolases (PETases) are limited by narrow sequence diversity and suboptimal performance. This study introduces VenusMine, a protein discovery pipeline that integrates protein language models (PLMs) with a representation tree to identify PETases based on structural similarity using sequence information. Using the crystal structure of <i>Is</i>PETase as a template, VenusMine identifies and clusters target proteins. Candidates are further screened using PLM-based assessments of solubility and thermostability, leading to the selection of 34 proteins for biochemical validation. Results reveal that 14 candidates exhibit PET degradation activity across 30–60 °C. Notably, a PET hydrolase from <i>Kibdelosporangium banguiense</i> (<i>Kb</i>PETase) demonstrates a melting temperature (T<sub>m</sub>) 32 °C higher than <i>Is</i>PETase and exhibits the highest PET degradation activity within 30 – 65 °C among wild-type PETases. <i>Kb</i>PETase also surpasses FastPETase and LCC in catalytic efficiency. X-ray crystallography and molecular dynamics simulations show that <i>Kb</i>PETase possesses a conserved catalytic domain and enhanced intramolecular interactions, underpinning its improved functionality and thermostability. This work demonstrates a novel deep learning approach for discovering natural PETases with enhanced properties.</p>","PeriodicalId":19066,"journal":{"name":"Nature Communications","volume":"12 1","pages":""},"PeriodicalIF":15.7000,"publicationDate":"2025-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature Communications","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41467-025-61599-z","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Plastic waste, particularly polyethylene terephthalate (PET), presents significant environmental challenges, driving extensive research into enzymatic biodegradation. However, existing PET hydrolases (PETases) are limited by narrow sequence diversity and suboptimal performance. This study introduces VenusMine, a protein discovery pipeline that integrates protein language models (PLMs) with a representation tree to identify PETases based on structural similarity using sequence information. Using the crystal structure of IsPETase as a template, VenusMine identifies and clusters target proteins. Candidates are further screened using PLM-based assessments of solubility and thermostability, leading to the selection of 34 proteins for biochemical validation. Results reveal that 14 candidates exhibit PET degradation activity across 30–60 °C. Notably, a PET hydrolase from Kibdelosporangium banguiense (KbPETase) demonstrates a melting temperature (Tm) 32 °C higher than IsPETase and exhibits the highest PET degradation activity within 30 – 65 °C among wild-type PETases. KbPETase also surpasses FastPETase and LCC in catalytic efficiency. X-ray crystallography and molecular dynamics simulations show that KbPETase possesses a conserved catalytic domain and enhanced intramolecular interactions, underpinning its improved functionality and thermostability. This work demonstrates a novel deep learning approach for discovering natural PETases with enhanced properties.
期刊介绍:
Nature Communications, an open-access journal, publishes high-quality research spanning all areas of the natural sciences. Papers featured in the journal showcase significant advances relevant to specialists in each respective field. With a 2-year impact factor of 16.6 (2022) and a median time of 8 days from submission to the first editorial decision, Nature Communications is committed to rapid dissemination of research findings. As a multidisciplinary journal, it welcomes contributions from biological, health, physical, chemical, Earth, social, mathematical, applied, and engineering sciences, aiming to highlight important breakthroughs within each domain.