Exploring potential biosafety implications in DNA information storage

IF 3.5 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH
Shangzhe Li , Yue Shi , Jing Yang , Haizhou Liu , Lijia Jia , Di Liu
{"title":"Exploring potential biosafety implications in DNA information storage","authors":"Shangzhe Li ,&nbsp;Yue Shi ,&nbsp;Jing Yang ,&nbsp;Haizhou Liu ,&nbsp;Lijia Jia ,&nbsp;Di Liu","doi":"10.1016/j.bsheal.2025.03.006","DOIUrl":null,"url":null,"abstract":"<div><div>Deoxyribonucleic acid (DNA) information storage has emerged as a promising solution to address the challenges of traditional silicon-based data storage systems. However, the biosafety implications of artificially synthesized DNA sequences in this technology remain understudied. This research evaluates the biosafety risks associated with five representative DNA storage encoding methods [Church, Goldman, DNA Fountain, Grass, and movable-type (MT) encoding] by analyzing their sequence similarities to natural biological DNA. Through Kraken2 taxonomic classification and Basic Local Alignment Search Tool for nucleotides (BLASTn) alignment analysis, we found that while most artificially designed DNA sequences showed significant differences from known biological sequences, specific encoding methods produced sequences similar to natural genomes. The MT encoding method showed the highest annotation rate (4.59 %) in Kraken2 analysis, while Goldman and Fountain methods demonstrated significant local sequence alignments in BLASTn analysis. Sequence length positively correlated with annotation rates, suggesting longer sequences pose potentially higher biosafety risks. Furthermore, aligned sequences often exhibited characteristics of tandem repeats, particularly in non-coding regions. These findings highlight the importance of incorporating biosafety considerations in DNA storage encoding method development and suggest that randomization strategies may help mitigate potential risks. Our study provides valuable insights into the safe advancement of DNA storage technology and emphasizes the need for comprehensive biosafety evaluation in synthetic biology applications.</div></div>","PeriodicalId":36178,"journal":{"name":"Biosafety and Health","volume":"7 2","pages":"Pages 132-139"},"PeriodicalIF":3.5000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biosafety and Health","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2590053625000424","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0

Abstract

Deoxyribonucleic acid (DNA) information storage has emerged as a promising solution to address the challenges of traditional silicon-based data storage systems. However, the biosafety implications of artificially synthesized DNA sequences in this technology remain understudied. This research evaluates the biosafety risks associated with five representative DNA storage encoding methods [Church, Goldman, DNA Fountain, Grass, and movable-type (MT) encoding] by analyzing their sequence similarities to natural biological DNA. Through Kraken2 taxonomic classification and Basic Local Alignment Search Tool for nucleotides (BLASTn) alignment analysis, we found that while most artificially designed DNA sequences showed significant differences from known biological sequences, specific encoding methods produced sequences similar to natural genomes. The MT encoding method showed the highest annotation rate (4.59 %) in Kraken2 analysis, while Goldman and Fountain methods demonstrated significant local sequence alignments in BLASTn analysis. Sequence length positively correlated with annotation rates, suggesting longer sequences pose potentially higher biosafety risks. Furthermore, aligned sequences often exhibited characteristics of tandem repeats, particularly in non-coding regions. These findings highlight the importance of incorporating biosafety considerations in DNA storage encoding method development and suggest that randomization strategies may help mitigate potential risks. Our study provides valuable insights into the safe advancement of DNA storage technology and emphasizes the need for comprehensive biosafety evaluation in synthetic biology applications.
探讨DNA信息存储的潜在生物安全意义
脱氧核糖核酸(DNA)信息存储已成为解决传统硅基数据存储系统挑战的一种有前途的解决方案。然而,人工合成DNA序列对生物安全的影响仍有待进一步研究。本研究通过分析5种具有代表性的DNA存储编码方法(Church、Goldman、DNA Fountain、Grass和movable-type (MT)编码)与天然生物DNA的序列相似性,评估了它们的生物安全风险。通过Kraken2分类和BLASTn比对分析,我们发现大多数人工设计的DNA序列与已知的生物序列存在显著差异,但特定的编码方法产生的序列与天然基因组相似。MT编码方法在Kraken2分析中标注率最高(4.59%),而Goldman和Fountain编码方法在BLASTn分析中显示出显著的局部序列比对。序列长度与注释率正相关,表明较长的序列具有较高的潜在生物安全风险。此外,排列序列通常表现出串联重复的特征,特别是在非编码区。这些发现强调了在DNA存储编码方法开发中纳入生物安全考虑的重要性,并表明随机化策略可能有助于减轻潜在风险。本研究为DNA存储技术的安全发展提供了有价值的见解,并强调了在合成生物学应用中进行综合生物安全性评价的必要性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Biosafety and Health
Biosafety and Health Medicine-Infectious Diseases
CiteScore
7.60
自引率
0.00%
发文量
116
审稿时长
66 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信