Storage-D: A user-friendly platform that enables practical and personalized DNA data storage

IF 23.7 Q1 MICROBIOLOGY
iMeta Pub Date : 2024-01-21 DOI:10.1002/imt2.168
Xiaoluo Huang, Junting Cui, Wei Qiang, Jianwen Ye, Yu Wang, Xinying Xie, Yuanzhen Li, Junbiao Dai
{"title":"Storage-D: A user-friendly platform that enables practical and personalized DNA data storage","authors":"Xiaoluo Huang,&nbsp;Junting Cui,&nbsp;Wei Qiang,&nbsp;Jianwen Ye,&nbsp;Yu Wang,&nbsp;Xinying Xie,&nbsp;Yuanzhen Li,&nbsp;Junbiao Dai","doi":"10.1002/imt2.168","DOIUrl":null,"url":null,"abstract":"<p>Deoxyribonucleic acid (DNA) has been suggested as a very promising medium for data storage in recent years. Although numerous studies have advocated for DNA data storage, its practical application remains obscure and there is a lack of a user-oriented platform. Here, we developed a DNA data storage platform, named Storage-D, which allows users to convert their data into DNA sequences of any length and vice versa by selecting algorithms, error-correction, random-access, and codec pin strategies in terms of their own choice. It incorporates a newly designed “Wukong” algorithm, which provides over 20 trillion codec pins for data privacy use. This algorithm can also control GC content to the selected standard, as well as adjust the homopolymer run length to a defined level, while maintaining a high coding potential of ~1.98 bis/nt, allowing it to outperform previous algorithms. By connecting to a commercial DNA synthesis and sequencing platform with “Storage-D,” we successfully stored “Diagnosis and treatment protocol for COVID-19 patients” into 200 nt oligo pools in vitro, and 500 bp genes in vivo which replicated in both normal and extreme bacteria. Together, this platform allows for practical and personalized DNA data storage, potentially with a wide range of applications.</p>","PeriodicalId":73342,"journal":{"name":"iMeta","volume":"3 2","pages":""},"PeriodicalIF":23.7000,"publicationDate":"2024-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/imt2.168","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"iMeta","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/imt2.168","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Deoxyribonucleic acid (DNA) has been suggested as a very promising medium for data storage in recent years. Although numerous studies have advocated for DNA data storage, its practical application remains obscure and there is a lack of a user-oriented platform. Here, we developed a DNA data storage platform, named Storage-D, which allows users to convert their data into DNA sequences of any length and vice versa by selecting algorithms, error-correction, random-access, and codec pin strategies in terms of their own choice. It incorporates a newly designed “Wukong” algorithm, which provides over 20 trillion codec pins for data privacy use. This algorithm can also control GC content to the selected standard, as well as adjust the homopolymer run length to a defined level, while maintaining a high coding potential of ~1.98 bis/nt, allowing it to outperform previous algorithms. By connecting to a commercial DNA synthesis and sequencing platform with “Storage-D,” we successfully stored “Diagnosis and treatment protocol for COVID-19 patients” into 200 nt oligo pools in vitro, and 500 bp genes in vivo which replicated in both normal and extreme bacteria. Together, this platform allows for practical and personalized DNA data storage, potentially with a wide range of applications.

Abstract Image

Storage-D:用户友好型平台,可实现实用的个性化 DNA 数据存储
近年来,脱氧核糖核酸(DNA)被认为是一种非常有前途的数据存储介质。虽然许多研究都提倡 DNA 数据存储,但其实际应用仍然模糊不清,而且缺乏面向用户的平台。在这里,我们开发了一个 DNA 数据存储平台,名为 Storage-D,它允许用户通过选择算法、纠错、随机存取和编解码器针策略,将数据转换成任意长度的 DNA 序列,反之亦然。它采用了新设计的 "悟空 "算法,可提供超过 20 万亿个编解码引脚,用于数据隐私保护。该算法还能将 GC 含量控制在所选标准内,并将同源多聚物的运行长度调整到规定水平,同时保持 ~1.98 bis/nt 的高编码潜力,使其优于以往的算法。通过与商业 DNA 合成和测序平台 "Storage-D "连接,我们成功地将 "COVID-19 患者的诊断和治疗方案 "在体外存储到 200 nt 的寡聚物池中,在体内存储到 500 bp 的基因中,这些基因在正常细菌和极端细菌中都能复制。该平台可实现实用的个性化 DNA 数据存储,具有广泛的应用前景。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
10.80
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信