基于广义扩散模型的蛋白a样肽生成

IF 3.1 3区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY
Tianqian Zhou, Shibo Zhang, Huijia Song, Qiang He, Chun Fang, Xiaozhu Lin
{"title":"基于广义扩散模型的蛋白a样肽生成","authors":"Tianqian Zhou,&nbsp;Shibo Zhang,&nbsp;Huijia Song,&nbsp;Qiang He,&nbsp;Chun Fang,&nbsp;Xiaozhu Lin","doi":"10.1007/s10822-025-00653-w","DOIUrl":null,"url":null,"abstract":"<div><p>With the rapid advancement of biotechnology, protein generation and design based on generative models have demonstrated extensive applications in drug development, vaccine research, and biocatalysis. This research proposes a protein generation method based on the generalized diffusion model, which breaks through the traditional diffusion model’s reliance on Gaussian noise, enables more flexible protein sequence generation, and preliminarily verifies its advantages. Specifically, protein sequences were first encoded using one-hot encoding and input into the diffusion model to generate novel sequences. Subsequently, the tertiary structures of the generated proteins were predicted using AlphaFold, followed by structural alignment and backbone distance calculation via PyMOL to select the optimal sequences. The predicted derivative protein sequence A_005 was screened from the generated sequences and subjected to an affinity assay with Protein A parental. Experimental results revealed that A_005 exhibited remarkably high affinity, as well as a satisfactory dissociation rate and association rate. The findings demonstrate that the protein generation method based on the generalized diffusion model can effectively design protein sequences with high structural and functional similarity to target sequences. While prior studies have shown that both DDPM and generalized diffusion models achieve high generation quality, the generalized diffusion model outperforms in terms of task adaptability. Our research not only opens new technological pathways for protein design but also lays a solid foundation for future applications in biomedicine, providing significant theoretical and experimental evidence for subsequent drug development.</p></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"39 1","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2025-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Protein A-like peptide generation based on generalized diffusion model\",\"authors\":\"Tianqian Zhou,&nbsp;Shibo Zhang,&nbsp;Huijia Song,&nbsp;Qiang He,&nbsp;Chun Fang,&nbsp;Xiaozhu Lin\",\"doi\":\"10.1007/s10822-025-00653-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>With the rapid advancement of biotechnology, protein generation and design based on generative models have demonstrated extensive applications in drug development, vaccine research, and biocatalysis. This research proposes a protein generation method based on the generalized diffusion model, which breaks through the traditional diffusion model’s reliance on Gaussian noise, enables more flexible protein sequence generation, and preliminarily verifies its advantages. Specifically, protein sequences were first encoded using one-hot encoding and input into the diffusion model to generate novel sequences. Subsequently, the tertiary structures of the generated proteins were predicted using AlphaFold, followed by structural alignment and backbone distance calculation via PyMOL to select the optimal sequences. The predicted derivative protein sequence A_005 was screened from the generated sequences and subjected to an affinity assay with Protein A parental. Experimental results revealed that A_005 exhibited remarkably high affinity, as well as a satisfactory dissociation rate and association rate. The findings demonstrate that the protein generation method based on the generalized diffusion model can effectively design protein sequences with high structural and functional similarity to target sequences. While prior studies have shown that both DDPM and generalized diffusion models achieve high generation quality, the generalized diffusion model outperforms in terms of task adaptability. Our research not only opens new technological pathways for protein design but also lays a solid foundation for future applications in biomedicine, providing significant theoretical and experimental evidence for subsequent drug development.</p></div>\",\"PeriodicalId\":621,\"journal\":{\"name\":\"Journal of Computer-Aided Molecular Design\",\"volume\":\"39 1\",\"pages\":\"\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2025-09-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Computer-Aided Molecular Design\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10822-025-00653-w\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computer-Aided Molecular Design","FirstCategoryId":"99","ListUrlMain":"https://link.springer.com/article/10.1007/s10822-025-00653-w","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

随着生物技术的迅速发展,基于生成模型的蛋白质生成和设计已在药物开发、疫苗研究和生物催化方面得到广泛应用。本研究提出了一种基于广义扩散模型的蛋白质生成方法,突破了传统扩散模型对高斯噪声的依赖,使蛋白质序列生成更加灵活,并初步验证了其优势。具体而言,首先使用one-hot编码对蛋白质序列进行编码,并将其输入扩散模型以生成新序列。随后,利用AlphaFold预测生成蛋白的三级结构,并通过PyMOL计算结构比对和主链距离,选择最优序列。从生成的序列中筛选出预测的衍生蛋白序列A_005,并与蛋白A亲本进行亲和实验。实验结果表明,A_005具有非常高的亲和力,并且具有令人满意的解离率和缔合率。研究结果表明,基于广义扩散模型的蛋白质生成方法可以有效地设计出与目标序列结构和功能高度相似的蛋白质序列。虽然已有研究表明,DDPM和广义扩散模型都能达到较高的生成质量,但广义扩散模型在任务适应性方面表现更好。我们的研究不仅为蛋白质设计开辟了新的技术途径,也为未来在生物医学上的应用奠定了坚实的基础,为后续的药物开发提供了重要的理论和实验依据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Protein A-like peptide generation based on generalized diffusion model

Protein A-like peptide generation based on generalized diffusion model

Protein A-like peptide generation based on generalized diffusion model

With the rapid advancement of biotechnology, protein generation and design based on generative models have demonstrated extensive applications in drug development, vaccine research, and biocatalysis. This research proposes a protein generation method based on the generalized diffusion model, which breaks through the traditional diffusion model’s reliance on Gaussian noise, enables more flexible protein sequence generation, and preliminarily verifies its advantages. Specifically, protein sequences were first encoded using one-hot encoding and input into the diffusion model to generate novel sequences. Subsequently, the tertiary structures of the generated proteins were predicted using AlphaFold, followed by structural alignment and backbone distance calculation via PyMOL to select the optimal sequences. The predicted derivative protein sequence A_005 was screened from the generated sequences and subjected to an affinity assay with Protein A parental. Experimental results revealed that A_005 exhibited remarkably high affinity, as well as a satisfactory dissociation rate and association rate. The findings demonstrate that the protein generation method based on the generalized diffusion model can effectively design protein sequences with high structural and functional similarity to target sequences. While prior studies have shown that both DDPM and generalized diffusion models achieve high generation quality, the generalized diffusion model outperforms in terms of task adaptability. Our research not only opens new technological pathways for protein design but also lays a solid foundation for future applications in biomedicine, providing significant theoretical and experimental evidence for subsequent drug development.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Computer-Aided Molecular Design
Journal of Computer-Aided Molecular Design 生物-计算机:跨学科应用
CiteScore
8.00
自引率
8.60%
发文量
56
审稿时长
3 months
期刊介绍: The Journal of Computer-Aided Molecular Design provides a form for disseminating information on both the theory and the application of computer-based methods in the analysis and design of molecules. The scope of the journal encompasses papers which report new and original research and applications in the following areas: - theoretical chemistry; - computational chemistry; - computer and molecular graphics; - molecular modeling; - protein engineering; - drug design; - expert systems; - general structure-property relationships; - molecular dynamics; - chemical database development and usage.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信