Tianqian Zhou, Shibo Zhang, Huijia Song, Qiang He, Chun Fang, Xiaozhu Lin
{"title":"基于广义扩散模型的蛋白a样肽生成","authors":"Tianqian Zhou, Shibo Zhang, Huijia Song, Qiang He, Chun Fang, Xiaozhu Lin","doi":"10.1007/s10822-025-00653-w","DOIUrl":null,"url":null,"abstract":"<div><p>With the rapid advancement of biotechnology, protein generation and design based on generative models have demonstrated extensive applications in drug development, vaccine research, and biocatalysis. This research proposes a protein generation method based on the generalized diffusion model, which breaks through the traditional diffusion model’s reliance on Gaussian noise, enables more flexible protein sequence generation, and preliminarily verifies its advantages. Specifically, protein sequences were first encoded using one-hot encoding and input into the diffusion model to generate novel sequences. Subsequently, the tertiary structures of the generated proteins were predicted using AlphaFold, followed by structural alignment and backbone distance calculation via PyMOL to select the optimal sequences. The predicted derivative protein sequence A_005 was screened from the generated sequences and subjected to an affinity assay with Protein A parental. Experimental results revealed that A_005 exhibited remarkably high affinity, as well as a satisfactory dissociation rate and association rate. The findings demonstrate that the protein generation method based on the generalized diffusion model can effectively design protein sequences with high structural and functional similarity to target sequences. While prior studies have shown that both DDPM and generalized diffusion models achieve high generation quality, the generalized diffusion model outperforms in terms of task adaptability. Our research not only opens new technological pathways for protein design but also lays a solid foundation for future applications in biomedicine, providing significant theoretical and experimental evidence for subsequent drug development.</p></div>","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"39 1","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2025-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Protein A-like peptide generation based on generalized diffusion model\",\"authors\":\"Tianqian Zhou, Shibo Zhang, Huijia Song, Qiang He, Chun Fang, Xiaozhu Lin\",\"doi\":\"10.1007/s10822-025-00653-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>With the rapid advancement of biotechnology, protein generation and design based on generative models have demonstrated extensive applications in drug development, vaccine research, and biocatalysis. This research proposes a protein generation method based on the generalized diffusion model, which breaks through the traditional diffusion model’s reliance on Gaussian noise, enables more flexible protein sequence generation, and preliminarily verifies its advantages. Specifically, protein sequences were first encoded using one-hot encoding and input into the diffusion model to generate novel sequences. Subsequently, the tertiary structures of the generated proteins were predicted using AlphaFold, followed by structural alignment and backbone distance calculation via PyMOL to select the optimal sequences. The predicted derivative protein sequence A_005 was screened from the generated sequences and subjected to an affinity assay with Protein A parental. Experimental results revealed that A_005 exhibited remarkably high affinity, as well as a satisfactory dissociation rate and association rate. The findings demonstrate that the protein generation method based on the generalized diffusion model can effectively design protein sequences with high structural and functional similarity to target sequences. While prior studies have shown that both DDPM and generalized diffusion models achieve high generation quality, the generalized diffusion model outperforms in terms of task adaptability. Our research not only opens new technological pathways for protein design but also lays a solid foundation for future applications in biomedicine, providing significant theoretical and experimental evidence for subsequent drug development.</p></div>\",\"PeriodicalId\":621,\"journal\":{\"name\":\"Journal of Computer-Aided Molecular Design\",\"volume\":\"39 1\",\"pages\":\"\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2025-09-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Computer-Aided Molecular Design\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10822-025-00653-w\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computer-Aided Molecular Design","FirstCategoryId":"99","ListUrlMain":"https://link.springer.com/article/10.1007/s10822-025-00653-w","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
Protein A-like peptide generation based on generalized diffusion model
With the rapid advancement of biotechnology, protein generation and design based on generative models have demonstrated extensive applications in drug development, vaccine research, and biocatalysis. This research proposes a protein generation method based on the generalized diffusion model, which breaks through the traditional diffusion model’s reliance on Gaussian noise, enables more flexible protein sequence generation, and preliminarily verifies its advantages. Specifically, protein sequences were first encoded using one-hot encoding and input into the diffusion model to generate novel sequences. Subsequently, the tertiary structures of the generated proteins were predicted using AlphaFold, followed by structural alignment and backbone distance calculation via PyMOL to select the optimal sequences. The predicted derivative protein sequence A_005 was screened from the generated sequences and subjected to an affinity assay with Protein A parental. Experimental results revealed that A_005 exhibited remarkably high affinity, as well as a satisfactory dissociation rate and association rate. The findings demonstrate that the protein generation method based on the generalized diffusion model can effectively design protein sequences with high structural and functional similarity to target sequences. While prior studies have shown that both DDPM and generalized diffusion models achieve high generation quality, the generalized diffusion model outperforms in terms of task adaptability. Our research not only opens new technological pathways for protein design but also lays a solid foundation for future applications in biomedicine, providing significant theoretical and experimental evidence for subsequent drug development.
期刊介绍:
The Journal of Computer-Aided Molecular Design provides a form for disseminating information on both the theory and the application of computer-based methods in the analysis and design of molecules. The scope of the journal encompasses papers which report new and original research and applications in the following areas:
- theoretical chemistry;
- computational chemistry;
- computer and molecular graphics;
- molecular modeling;
- protein engineering;
- drug design;
- expert systems;
- general structure-property relationships;
- molecular dynamics;
- chemical database development and usage.