Automated Design of Oligopools and Rapid Analysis of Massively Parallel Barcoded Measurements.

IF 3.7 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS
ACS Synthetic Biology Pub Date : 2024-12-20 Epub Date: 2024-12-06 DOI:10.1021/acssynbio.4c00661
Ayaan Hossain, Daniel P Cetnar, Travis L LaFleur, James R McLellan, Howard M Salis
{"title":"Automated Design of Oligopools and Rapid Analysis of Massively Parallel Barcoded Measurements.","authors":"Ayaan Hossain, Daniel P Cetnar, Travis L LaFleur, James R McLellan, Howard M Salis","doi":"10.1021/acssynbio.4c00661","DOIUrl":null,"url":null,"abstract":"<p><p>Oligopool synthesis and next-generation sequencing enable the construction and characterization of large libraries of designed genetic parts and systems. As library sizes grow, it becomes computationally challenging to optimally design large numbers of primer binding sites, barcode sequences, and overlap regions to obtain efficient assemblies and precise measurements. We present the Oligopool Calculator, an end-to-end suite of algorithms and data structures that rapidly designs many thousands of oligonucleotides within an oligopool and rapidly analyzes many billions of barcoded sequencing reads. We introduce several novel concepts that greatly increase the design and analysis throughput, including orthogonally symmetric barcode design, adaptive decision trees for primer design, a Scry barcode classifier, and efficient read packing. We demonstrate the Oligopool Calculator's capabilities across computational benchmarks and real-data projects, including the design of over four million highly unique and compact barcodes in 1.2 h, the design of universal primer binding sites for one million 200-mer oligos in 15 min, and the analysis of about 500 million deep sequencing reads per hour, all on an 8-core desktop computer. Overall, the Oligopool Calculator accelerates the creative use of massively parallel experiments by eliminating the computational complexity of their design and analysis.</p>","PeriodicalId":26,"journal":{"name":"ACS Synthetic Biology","volume":" ","pages":"4218-4232"},"PeriodicalIF":3.7000,"publicationDate":"2024-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11669329/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Synthetic Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1021/acssynbio.4c00661","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/6 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

Oligopool synthesis and next-generation sequencing enable the construction and characterization of large libraries of designed genetic parts and systems. As library sizes grow, it becomes computationally challenging to optimally design large numbers of primer binding sites, barcode sequences, and overlap regions to obtain efficient assemblies and precise measurements. We present the Oligopool Calculator, an end-to-end suite of algorithms and data structures that rapidly designs many thousands of oligonucleotides within an oligopool and rapidly analyzes many billions of barcoded sequencing reads. We introduce several novel concepts that greatly increase the design and analysis throughput, including orthogonally symmetric barcode design, adaptive decision trees for primer design, a Scry barcode classifier, and efficient read packing. We demonstrate the Oligopool Calculator's capabilities across computational benchmarks and real-data projects, including the design of over four million highly unique and compact barcodes in 1.2 h, the design of universal primer binding sites for one million 200-mer oligos in 15 min, and the analysis of about 500 million deep sequencing reads per hour, all on an 8-core desktop computer. Overall, the Oligopool Calculator accelerates the creative use of massively parallel experiments by eliminating the computational complexity of their design and analysis.

低聚物的自动化设计和大规模平行条形码测量的快速分析。
寡聚合成和下一代测序使设计的遗传部分和系统的大型文库的构建和表征成为可能。随着文库规模的增长,优化设计大量引物结合位点、条形码序列和重叠区域以获得高效组装和精确测量变得具有计算挑战性。我们介绍了Oligopool计算器,这是一套端到端算法和数据结构,可以在Oligopool中快速设计数千个寡核苷酸,并快速分析数十亿个条形码测序读数。我们介绍了几个大大提高设计和分析吞吐量的新概念,包括正交对称条形码设计,引物设计的自适应决策树,Scry条形码分类器和高效读取包装。我们在计算基准和实际数据项目中展示了Oligopool计算器的功能,包括在1.2小时内设计超过400万个高度独特和紧凑的条形码,在15分钟内设计100万个200-mer寡核苷酸的通用引物结合位点,以及每小时分析约5亿次深度测序读取,所有这些都在8核台式计算机上完成。总的来说,Oligopool计算器通过消除设计和分析的计算复杂性,加速了大规模并行实验的创造性使用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
8.00
自引率
10.60%
发文量
380
审稿时长
6-12 weeks
期刊介绍: The journal is particularly interested in studies on the design and synthesis of new genetic circuits and gene products; computational methods in the design of systems; and integrative applied approaches to understanding disease and metabolism. Topics may include, but are not limited to: Design and optimization of genetic systems Genetic circuit design and their principles for their organization into programs Computational methods to aid the design of genetic systems Experimental methods to quantify genetic parts, circuits, and metabolic fluxes Genetic parts libraries: their creation, analysis, and ontological representation Protein engineering including computational design Metabolic engineering and cellular manufacturing, including biomass conversion Natural product access, engineering, and production Creative and innovative applications of cellular programming Medical applications, tissue engineering, and the programming of therapeutic cells Minimal cell design and construction Genomics and genome replacement strategies Viral engineering Automated and robotic assembly platforms for synthetic biology DNA synthesis methodologies Metagenomics and synthetic metagenomic analysis Bioinformatics applied to gene discovery, chemoinformatics, and pathway construction Gene optimization Methods for genome-scale measurements of transcription and metabolomics Systems biology and methods to integrate multiple data sources in vitro and cell-free synthetic biology and molecular programming Nucleic acid engineering.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信