Intelligent Design of Escherichia coli Terminators by Coupling Prediction and Generation Models

IF 3.9 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS
Jie Li, Lin-Feng Wu, Kai Liu and Bin-Guang Ma*, 
{"title":"Intelligent Design of Escherichia coli Terminators by Coupling Prediction and Generation Models","authors":"Jie Li,&nbsp;Lin-Feng Wu,&nbsp;Kai Liu and Bin-Guang Ma*,&nbsp;","doi":"10.1021/acssynbio.5c00429","DOIUrl":null,"url":null,"abstract":"<p >Terminators are specific nucleotide sequences located at the 3′ end of a gene and contain transcription termination information. As a fundamental genetic regulatory element, terminators play a crucial role in the design of gene circuits. Accurately characterizing terminator strength is essential for improving the precision of gene circuit designs. Experimental characterization of terminator strength is time-consuming and labor-intensive; therefore, there is a need to develop computational tools capable of accurately predicting terminator strength. Current prediction methods do not fully consider sequence or thermodynamic information related to terminators, lacking robust models for accurate prediction. Meanwhile, deep generative models have demonstrated tremendous potential in the design of biological sequences and are expected to be applied to terminator sequence design. This study focuses on intelligent design of <i>Escherichia coli</i> terminators and primarily conducts the following research: (1) to construct an intrinsic terminator strength prediction model for <i>E. coli</i>, this study extracts sequence features and thermodynamic features from <i>E. coli</i> intrinsic terminators. Machine learning models based on the selected features achieved a prediction performance of <i>R</i><sup>2</sup> = 0.72. (2) This study employs a generative adversarial network (GAN) to learn from intrinsic terminator sequence training data and generate terminator sequences. Evaluation reveals that the generated terminators exhibit similar data distributions to intrinsic terminators, demonstrating the reliability of GAN-generated terminator sequences. (3) This study uses the constructed terminator strength prediction model to screen for strong terminators from the generated set. Experimental verification shows that among the 18 selected terminators, 72% exhibit termination efficiencies greater than 90%, confirming the reliability of the intelligent design approach for <i>E. coli</i> terminators. In sum, this study constructs a terminator strength prediction model and a terminator generation model for <i>E. coli</i>, providing model support for terminator design in gene circuits. This enhances the modularity of biological component design and promotes the development of synthetic biology.</p>","PeriodicalId":26,"journal":{"name":"ACS Synthetic Biology","volume":"14 9","pages":"3744–3752"},"PeriodicalIF":3.9000,"publicationDate":"2025-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Synthetic Biology","FirstCategoryId":"99","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acssynbio.5c00429","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

Terminators are specific nucleotide sequences located at the 3′ end of a gene and contain transcription termination information. As a fundamental genetic regulatory element, terminators play a crucial role in the design of gene circuits. Accurately characterizing terminator strength is essential for improving the precision of gene circuit designs. Experimental characterization of terminator strength is time-consuming and labor-intensive; therefore, there is a need to develop computational tools capable of accurately predicting terminator strength. Current prediction methods do not fully consider sequence or thermodynamic information related to terminators, lacking robust models for accurate prediction. Meanwhile, deep generative models have demonstrated tremendous potential in the design of biological sequences and are expected to be applied to terminator sequence design. This study focuses on intelligent design of Escherichia coli terminators and primarily conducts the following research: (1) to construct an intrinsic terminator strength prediction model for E. coli, this study extracts sequence features and thermodynamic features from E. coli intrinsic terminators. Machine learning models based on the selected features achieved a prediction performance of R2 = 0.72. (2) This study employs a generative adversarial network (GAN) to learn from intrinsic terminator sequence training data and generate terminator sequences. Evaluation reveals that the generated terminators exhibit similar data distributions to intrinsic terminators, demonstrating the reliability of GAN-generated terminator sequences. (3) This study uses the constructed terminator strength prediction model to screen for strong terminators from the generated set. Experimental verification shows that among the 18 selected terminators, 72% exhibit termination efficiencies greater than 90%, confirming the reliability of the intelligent design approach for E. coli terminators. In sum, this study constructs a terminator strength prediction model and a terminator generation model for E. coli, providing model support for terminator design in gene circuits. This enhances the modularity of biological component design and promotes the development of synthetic biology.

Abstract Image

基于耦合预测与生成模型的大肠杆菌终止器智能设计
终止子是位于基因3'端的特定核苷酸序列,包含转录终止信息。终止子作为一种基本的基因调控元件,在基因回路的设计中起着至关重要的作用。准确表征终止子强度对提高基因电路设计精度至关重要。终端强度的实验表征耗时耗力;因此,需要开发能够准确预测终结点强度的计算工具。目前的预测方法没有充分考虑与终止子相关的序列或热力学信息,缺乏准确预测的鲁棒模型。同时,深度生成模型在生物序列设计中显示出巨大的潜力,有望应用于终止序列设计。本研究以大肠杆菌终止子的智能设计为重点,主要进行以下研究:(1)构建大肠杆菌的内征终止子强度预测模型,提取大肠杆菌内征终止子的序列特征和热力学特征。基于所选特征的机器学习模型的预测性能为R2 = 0.72。(2)本研究采用生成式对抗网络(GAN)从固有终止序列训练数据中学习并生成终止序列。评估表明,生成的终止子具有与固有终止子相似的数据分布,证明了gan生成的终止子序列的可靠性。(3)本研究利用构建的终端强度预测模型从生成集中筛选强终端。实验验证表明,在选择的18个终止体中,72%的终止效率大于90%,证实了大肠杆菌终止体智能设计方法的可靠性。综上所述,本研究构建了大肠杆菌终止子强度预测模型和终止子生成模型,为基因回路中的终止子设计提供了模型支持。这增强了生物部件设计的模块化,促进了合成生物学的发展。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
8.00
自引率
10.60%
发文量
380
审稿时长
6-12 weeks
期刊介绍: The journal is particularly interested in studies on the design and synthesis of new genetic circuits and gene products; computational methods in the design of systems; and integrative applied approaches to understanding disease and metabolism. Topics may include, but are not limited to: Design and optimization of genetic systems Genetic circuit design and their principles for their organization into programs Computational methods to aid the design of genetic systems Experimental methods to quantify genetic parts, circuits, and metabolic fluxes Genetic parts libraries: their creation, analysis, and ontological representation Protein engineering including computational design Metabolic engineering and cellular manufacturing, including biomass conversion Natural product access, engineering, and production Creative and innovative applications of cellular programming Medical applications, tissue engineering, and the programming of therapeutic cells Minimal cell design and construction Genomics and genome replacement strategies Viral engineering Automated and robotic assembly platforms for synthetic biology DNA synthesis methodologies Metagenomics and synthetic metagenomic analysis Bioinformatics applied to gene discovery, chemoinformatics, and pathway construction Gene optimization Methods for genome-scale measurements of transcription and metabolomics Systems biology and methods to integrate multiple data sources in vitro and cell-free synthetic biology and molecular programming Nucleic acid engineering.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信