基于多项式扩散模型的大肠杆菌合成启动子设计

IF 4.6 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES
iScience Pub Date : 2024-10-18 eCollection Date: 2024-11-15 DOI:10.1016/j.isci.2024.111207
Qixiu Du, May Nee Poon, Xiaocheng Zeng, Pengcheng Zhang, Zheng Wei, Haochen Wang, Ye Wang, Lei Wei, Xiaowo Wang
{"title":"基于多项式扩散模型的大肠杆菌合成启动子设计","authors":"Qixiu Du, May Nee Poon, Xiaocheng Zeng, Pengcheng Zhang, Zheng Wei, Haochen Wang, Ye Wang, Lei Wei, Xiaowo Wang","doi":"10.1016/j.isci.2024.111207","DOIUrl":null,"url":null,"abstract":"<p><p>Generative design of promoters has enhanced the efficiency of <i>de novo</i> creation of functional sequences. Though several deep generative models have been employed in biological sequence generation, including variational autoencoder (VAE) or Wasserstein generative adversarial network (WGAN), these models might struggle with mode collapse and low sample diversity. In this study, we introduce the multinomial diffusion model (MDM) for promoter sequence design and propose a structured set of criteria for effectively comparing the performance of generative models. <i>In silico</i> experiments demonstrate that MDM outperforms existing generative AI approaches. MDM demonstrates superior performance in various computational evaluations, remains robust during the training process, and exhibits a strong ability in capturing weak signals. In addition, we experimentally validated that the majority of our model designed promoters have expression activities <i>in vivo</i>, indicating the practicality and potential of MDM for bioengineering.</p>","PeriodicalId":342,"journal":{"name":"iScience","volume":"27 11","pages":"111207"},"PeriodicalIF":4.6000,"publicationDate":"2024-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11550136/pdf/","citationCount":"0","resultStr":"{\"title\":\"Synthetic promoter design in <i>Escherichia coli</i> based on multinomial diffusion model.\",\"authors\":\"Qixiu Du, May Nee Poon, Xiaocheng Zeng, Pengcheng Zhang, Zheng Wei, Haochen Wang, Ye Wang, Lei Wei, Xiaowo Wang\",\"doi\":\"10.1016/j.isci.2024.111207\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Generative design of promoters has enhanced the efficiency of <i>de novo</i> creation of functional sequences. Though several deep generative models have been employed in biological sequence generation, including variational autoencoder (VAE) or Wasserstein generative adversarial network (WGAN), these models might struggle with mode collapse and low sample diversity. In this study, we introduce the multinomial diffusion model (MDM) for promoter sequence design and propose a structured set of criteria for effectively comparing the performance of generative models. <i>In silico</i> experiments demonstrate that MDM outperforms existing generative AI approaches. MDM demonstrates superior performance in various computational evaluations, remains robust during the training process, and exhibits a strong ability in capturing weak signals. In addition, we experimentally validated that the majority of our model designed promoters have expression activities <i>in vivo</i>, indicating the practicality and potential of MDM for bioengineering.</p>\",\"PeriodicalId\":342,\"journal\":{\"name\":\"iScience\",\"volume\":\"27 11\",\"pages\":\"111207\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2024-10-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11550136/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"iScience\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1016/j.isci.2024.111207\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/11/15 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"iScience","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1016/j.isci.2024.111207","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/15 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

启动子的生成设计提高了从头创建功能序列的效率。虽然在生物序列生成中已经采用了一些深度生成模型,包括变异自动编码器(VAE)或瓦瑟斯坦生成对抗网络(WGAN),但这些模型可能会在模式崩溃和样本多样性低的问题上挣扎。在本研究中,我们介绍了用于启动子序列设计的多叉扩散模型(MDM),并提出了一套有效比较生成模型性能的结构化标准。硅学实验证明,MDM优于现有的生成式人工智能方法。MDM 在各种计算评估中表现出卓越的性能,在训练过程中保持稳健,并在捕捉弱信号方面表现出很强的能力。此外,我们还通过实验验证了大部分模型设计的启动子都具有体内表达活性,这表明 MDM 在生物工程领域具有实用性和潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Synthetic promoter design in Escherichia coli based on multinomial diffusion model.

Generative design of promoters has enhanced the efficiency of de novo creation of functional sequences. Though several deep generative models have been employed in biological sequence generation, including variational autoencoder (VAE) or Wasserstein generative adversarial network (WGAN), these models might struggle with mode collapse and low sample diversity. In this study, we introduce the multinomial diffusion model (MDM) for promoter sequence design and propose a structured set of criteria for effectively comparing the performance of generative models. In silico experiments demonstrate that MDM outperforms existing generative AI approaches. MDM demonstrates superior performance in various computational evaluations, remains robust during the training process, and exhibits a strong ability in capturing weak signals. In addition, we experimentally validated that the majority of our model designed promoters have expression activities in vivo, indicating the practicality and potential of MDM for bioengineering.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
iScience
iScience Multidisciplinary-Multidisciplinary
CiteScore
7.20
自引率
1.70%
发文量
1972
审稿时长
6 weeks
期刊介绍: Science has many big remaining questions. To address them, we will need to work collaboratively and across disciplines. The goal of iScience is to help fuel that type of interdisciplinary thinking. iScience is a new open-access journal from Cell Press that provides a platform for original research in the life, physical, and earth sciences. The primary criterion for publication in iScience is a significant contribution to a relevant field combined with robust results and underlying methodology. The advances appearing in iScience include both fundamental and applied investigations across this interdisciplinary range of topic areas. To support transparency in scientific investigation, we are happy to consider replication studies and papers that describe negative results. We know you want your work to be published quickly and to be widely visible within your community and beyond. With the strong international reputation of Cell Press behind it, publication in iScience will help your work garner the attention and recognition it merits. Like all Cell Press journals, iScience prioritizes rapid publication. Our editorial team pays special attention to high-quality author service and to efficient, clear-cut decisions based on the information available within the manuscript. iScience taps into the expertise across Cell Press journals and selected partners to inform our editorial decisions and help publish your science in a timely and seamless way.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信