Weiming Lin, Haotian Yu, Weihao Li, Yemin Han, Manman Lv, Han Gao, Mengqing Cheng, Yan Huang, Kun Bi, Zuhong Lu, Quanjun Liu
{"title":"Scaling the High-Yield Potential of Large-Scale DNA Data Storage with Cap-Free DNA Synthesis.","authors":"Weiming Lin, Haotian Yu, Weihao Li, Yemin Han, Manman Lv, Han Gao, Mengqing Cheng, Yan Huang, Kun Bi, Zuhong Lu, Quanjun Liu","doi":"10.1021/acssynbio.5c00175","DOIUrl":null,"url":null,"abstract":"<p><p>DNA has emerged as a promising storage medium for addressing exponentially growing data storage demands, owing to its exceptional information density and chemical stability. However, current DNA synthesis techniques face significant limitations in achieving high-throughput data storage due to constrained synthesis yields. This study presents a novel high-yield cap-free synthesis (HYCFS) strategy that overcomes the limitations of conventional solid-phase phosphoramidite chemistry in both synthesis length and production yield. We established a theoretical product prediction model based on cap-free synthesis characteristics and systematically evaluated the strategy through standard DNA storage workflows. Under column-based synthesis conditions with high coupling efficiency (>99%), this approach demonstrated a 3-fold enhancement in effective sequence yield compared to traditional methods. Theoretical modeling predicts superior performance in array-based synthesis systems for large-scale data storage applications. HYCFS shows potential to enhance DNA-based data storage capacity by 2 orders of magnitude while reducing storage costs, thereby advancing the development of large-scale DNA data storage technologies.</p>","PeriodicalId":26,"journal":{"name":"ACS Synthetic Biology","volume":" ","pages":"2764-2773"},"PeriodicalIF":3.9000,"publicationDate":"2025-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Synthetic Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1021/acssynbio.5c00175","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/7/4 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
DNA has emerged as a promising storage medium for addressing exponentially growing data storage demands, owing to its exceptional information density and chemical stability. However, current DNA synthesis techniques face significant limitations in achieving high-throughput data storage due to constrained synthesis yields. This study presents a novel high-yield cap-free synthesis (HYCFS) strategy that overcomes the limitations of conventional solid-phase phosphoramidite chemistry in both synthesis length and production yield. We established a theoretical product prediction model based on cap-free synthesis characteristics and systematically evaluated the strategy through standard DNA storage workflows. Under column-based synthesis conditions with high coupling efficiency (>99%), this approach demonstrated a 3-fold enhancement in effective sequence yield compared to traditional methods. Theoretical modeling predicts superior performance in array-based synthesis systems for large-scale data storage applications. HYCFS shows potential to enhance DNA-based data storage capacity by 2 orders of magnitude while reducing storage costs, thereby advancing the development of large-scale DNA data storage technologies.
期刊介绍:
The journal is particularly interested in studies on the design and synthesis of new genetic circuits and gene products; computational methods in the design of systems; and integrative applied approaches to understanding disease and metabolism.
Topics may include, but are not limited to:
Design and optimization of genetic systems
Genetic circuit design and their principles for their organization into programs
Computational methods to aid the design of genetic systems
Experimental methods to quantify genetic parts, circuits, and metabolic fluxes
Genetic parts libraries: their creation, analysis, and ontological representation
Protein engineering including computational design
Metabolic engineering and cellular manufacturing, including biomass conversion
Natural product access, engineering, and production
Creative and innovative applications of cellular programming
Medical applications, tissue engineering, and the programming of therapeutic cells
Minimal cell design and construction
Genomics and genome replacement strategies
Viral engineering
Automated and robotic assembly platforms for synthetic biology
DNA synthesis methodologies
Metagenomics and synthetic metagenomic analysis
Bioinformatics applied to gene discovery, chemoinformatics, and pathway construction
Gene optimization
Methods for genome-scale measurements of transcription and metabolomics
Systems biology and methods to integrate multiple data sources
in vitro and cell-free synthetic biology and molecular programming
Nucleic acid engineering.