Xiaoluo Huang, Junting Cui, Wei Qiang, Jianwen Ye, Yu Wang, Xinying Xie, Yuanzhen Li, Junbiao Dai
{"title":"Storage-D: A user-friendly platform that enables practical and personalized DNA data storage","authors":"Xiaoluo Huang, Junting Cui, Wei Qiang, Jianwen Ye, Yu Wang, Xinying Xie, Yuanzhen Li, Junbiao Dai","doi":"10.1002/imt2.168","DOIUrl":null,"url":null,"abstract":"<p>Deoxyribonucleic acid (DNA) has been suggested as a very promising medium for data storage in recent years. Although numerous studies have advocated for DNA data storage, its practical application remains obscure and there is a lack of a user-oriented platform. Here, we developed a DNA data storage platform, named Storage-D, which allows users to convert their data into DNA sequences of any length and vice versa by selecting algorithms, error-correction, random-access, and codec pin strategies in terms of their own choice. It incorporates a newly designed “Wukong” algorithm, which provides over 20 trillion codec pins for data privacy use. This algorithm can also control GC content to the selected standard, as well as adjust the homopolymer run length to a defined level, while maintaining a high coding potential of ~1.98 bis/nt, allowing it to outperform previous algorithms. By connecting to a commercial DNA synthesis and sequencing platform with “Storage-D,” we successfully stored “Diagnosis and treatment protocol for COVID-19 patients” into 200 nt oligo pools in vitro, and 500 bp genes in vivo which replicated in both normal and extreme bacteria. Together, this platform allows for practical and personalized DNA data storage, potentially with a wide range of applications.</p>","PeriodicalId":73342,"journal":{"name":"iMeta","volume":"3 2","pages":""},"PeriodicalIF":23.7000,"publicationDate":"2024-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/imt2.168","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"iMeta","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/imt2.168","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Deoxyribonucleic acid (DNA) has been suggested as a very promising medium for data storage in recent years. Although numerous studies have advocated for DNA data storage, its practical application remains obscure and there is a lack of a user-oriented platform. Here, we developed a DNA data storage platform, named Storage-D, which allows users to convert their data into DNA sequences of any length and vice versa by selecting algorithms, error-correction, random-access, and codec pin strategies in terms of their own choice. It incorporates a newly designed “Wukong” algorithm, which provides over 20 trillion codec pins for data privacy use. This algorithm can also control GC content to the selected standard, as well as adjust the homopolymer run length to a defined level, while maintaining a high coding potential of ~1.98 bis/nt, allowing it to outperform previous algorithms. By connecting to a commercial DNA synthesis and sequencing platform with “Storage-D,” we successfully stored “Diagnosis and treatment protocol for COVID-19 patients” into 200 nt oligo pools in vitro, and 500 bp genes in vivo which replicated in both normal and extreme bacteria. Together, this platform allows for practical and personalized DNA data storage, potentially with a wide range of applications.
近年来,脱氧核糖核酸(DNA)被认为是一种非常有前途的数据存储介质。虽然许多研究都提倡 DNA 数据存储,但其实际应用仍然模糊不清,而且缺乏面向用户的平台。在这里,我们开发了一个 DNA 数据存储平台,名为 Storage-D,它允许用户通过选择算法、纠错、随机存取和编解码器针策略,将数据转换成任意长度的 DNA 序列,反之亦然。它采用了新设计的 "悟空 "算法,可提供超过 20 万亿个编解码引脚,用于数据隐私保护。该算法还能将 GC 含量控制在所选标准内,并将同源多聚物的运行长度调整到规定水平,同时保持 ~1.98 bis/nt 的高编码潜力,使其优于以往的算法。通过与商业 DNA 合成和测序平台 "Storage-D "连接,我们成功地将 "COVID-19 患者的诊断和治疗方案 "在体外存储到 200 nt 的寡聚物池中,在体内存储到 500 bp 的基因中,这些基因在正常细菌和极端细菌中都能复制。该平台可实现实用的个性化 DNA 数据存储,具有广泛的应用前景。