{"title":"PRO-LDM: A Conditional Latent Diffusion Model for Protein Sequence Design and Functional Optimization.","authors":"Sitao Zhang, Zixuan Jiang, Rundong Huang, Wenting Huang, Siyuan Peng, Shaoxun Mo, Letao Zhu, Peiheng Li, Ziyi Zhang, Emily Pan, Xi Chen, Yunfei Long, Qi Liang, Jin Tang, Renjing Xu, Rui Qing","doi":"10.1002/advs.202502723","DOIUrl":null,"url":null,"abstract":"<p><p>The diffusion model has grasped enormous attention in the computer vision field and emerged as a promising algorithm in protein design for precise structure and sequence generation. Here PRO-LDM is introduced: a modular multi-tasking framework combining design fidelity and computational efficiency, by integrating the diffusion model in latent space. The model learns biological representations at local and global levels, to design natural-like species with enhanced diversity, or optimize protein properties and functions. Its modular nature also enables the integration with alternative pre-trained encoders for enhanced generalization capability. Outlier design can be implemented by adjusting the classifier-free guidance that enables PRO-LDM to sample vastly different regions in the latent space. The approach is demonstrated in generating a novel green-fluorescence-protein variant with notably enhanced fluorescence in multiple working scenarios along with increased solubility and stability. The model provides a versatile tool to effectively extract physicochemical and evolutionary information in sequences for designing new proteins with optimized performances.</p>","PeriodicalId":117,"journal":{"name":"Advanced Science","volume":" ","pages":"e02723"},"PeriodicalIF":14.3000,"publicationDate":"2025-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Science","FirstCategoryId":"88","ListUrlMain":"https://doi.org/10.1002/advs.202502723","RegionNum":1,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
The diffusion model has grasped enormous attention in the computer vision field and emerged as a promising algorithm in protein design for precise structure and sequence generation. Here PRO-LDM is introduced: a modular multi-tasking framework combining design fidelity and computational efficiency, by integrating the diffusion model in latent space. The model learns biological representations at local and global levels, to design natural-like species with enhanced diversity, or optimize protein properties and functions. Its modular nature also enables the integration with alternative pre-trained encoders for enhanced generalization capability. Outlier design can be implemented by adjusting the classifier-free guidance that enables PRO-LDM to sample vastly different regions in the latent space. The approach is demonstrated in generating a novel green-fluorescence-protein variant with notably enhanced fluorescence in multiple working scenarios along with increased solubility and stability. The model provides a versatile tool to effectively extract physicochemical and evolutionary information in sequences for designing new proteins with optimized performances.
期刊介绍:
Advanced Science is a prestigious open access journal that focuses on interdisciplinary research in materials science, physics, chemistry, medical and life sciences, and engineering. The journal aims to promote cutting-edge research by employing a rigorous and impartial review process. It is committed to presenting research articles with the highest quality production standards, ensuring maximum accessibility of top scientific findings. With its vibrant and innovative publication platform, Advanced Science seeks to revolutionize the dissemination and organization of scientific knowledge.