Zhi Weng, Jiangxue Li, Yi Wu, Xuehao Xiu, Fei Wang, Xiaolei Zuo, Ping Song, Chunhai Fan
{"title":"Massively parallel homogeneous amplification of chip-scale DNA for DNA information storage (MPHAC-DIS)","authors":"Zhi Weng, Jiangxue Li, Yi Wu, Xuehao Xiu, Fei Wang, Xiaolei Zuo, Ping Song, Chunhai Fan","doi":"10.1038/s41467-025-55986-9","DOIUrl":null,"url":null,"abstract":"<p>Chip scale DNA synthesis offers a high-throughput and cost-effective method for large-scale DNA-based information storage. Nevertheless, unbiased information retrieval from low-copy-number sequences remains a barricade that largely arises from the indispensable DNA amplification. Here, we devise a simulation-guided quantitative primer-template hybridization strategy to realize massively parallel homogeneous amplification of chip-scale DNA for DNA information storage (MPHAC-DIS). Using a fixed-energy primer design, we demonstrate the unbiasedness of MPHAC for amplifying 100,000-plex sequences. Simulations reveal that MPHAC achieves a fold-80 value of 1.0 compared to 3.2 with conventional fixed-length primers, lowering costs by up to four orders of magnitude through reduced over-sequencing. The MPHAC-DIS system using 35,406 encoded oligonucleotide allows simultaneous access of multimedia files including text, images, and videos with high decoding accuracy at very low sequencing depths. Specifically, even a ~ 1 × sequencing depth, with the combination of machine learning, results in an acceptable decoding accuracy of ~80%. The programmable and predictable MPHAC-DIS method thus opens new door for DNA-based large-scale data storage with potential industrial applications.</p>","PeriodicalId":19066,"journal":{"name":"Nature Communications","volume":"2 1","pages":""},"PeriodicalIF":15.7000,"publicationDate":"2025-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature Communications","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41467-025-55986-9","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Chip scale DNA synthesis offers a high-throughput and cost-effective method for large-scale DNA-based information storage. Nevertheless, unbiased information retrieval from low-copy-number sequences remains a barricade that largely arises from the indispensable DNA amplification. Here, we devise a simulation-guided quantitative primer-template hybridization strategy to realize massively parallel homogeneous amplification of chip-scale DNA for DNA information storage (MPHAC-DIS). Using a fixed-energy primer design, we demonstrate the unbiasedness of MPHAC for amplifying 100,000-plex sequences. Simulations reveal that MPHAC achieves a fold-80 value of 1.0 compared to 3.2 with conventional fixed-length primers, lowering costs by up to four orders of magnitude through reduced over-sequencing. The MPHAC-DIS system using 35,406 encoded oligonucleotide allows simultaneous access of multimedia files including text, images, and videos with high decoding accuracy at very low sequencing depths. Specifically, even a ~ 1 × sequencing depth, with the combination of machine learning, results in an acceptable decoding accuracy of ~80%. The programmable and predictable MPHAC-DIS method thus opens new door for DNA-based large-scale data storage with potential industrial applications.
期刊介绍:
Nature Communications, an open-access journal, publishes high-quality research spanning all areas of the natural sciences. Papers featured in the journal showcase significant advances relevant to specialists in each respective field. With a 2-year impact factor of 16.6 (2022) and a median time of 8 days from submission to the first editorial decision, Nature Communications is committed to rapid dissemination of research findings. As a multidisciplinary journal, it welcomes contributions from biological, health, physical, chemical, Earth, social, mathematical, applied, and engineering sciences, aiming to highlight important breakthroughs within each domain.