MISIP: a data standard for the reuse and reproducibility of any stable isotope probing-derived nucleic acid sequence and experiment.

IF 11.8 2区 生物学 Q1 MULTIDISCIPLINARY SCIENCES
Abigayle Simpson, Elisha M Wood-Charlson, Montana Smith, Benjamin J Koch, Kathleen Beilsmith, Jeffrey A Kimbrel, Matthew Kellom, Christopher I Hunter, Ramona L Walls, Lynn M Schriml, Roland C Wilhelm
{"title":"MISIP: a data standard for the reuse and reproducibility of any stable isotope probing-derived nucleic acid sequence and experiment.","authors":"Abigayle Simpson, Elisha M Wood-Charlson, Montana Smith, Benjamin J Koch, Kathleen Beilsmith, Jeffrey A Kimbrel, Matthew Kellom, Christopher I Hunter, Ramona L Walls, Lynn M Schriml, Roland C Wilhelm","doi":"10.1093/gigascience/giae071","DOIUrl":null,"url":null,"abstract":"<p><p>DNA/RNA-stable isotope probing (SIP) is a powerful tool to link in situ microbial activity to sequencing data. Every SIP dataset captures distinct information about microbial community metabolism, process rates, and population dynamics, offering valuable insights for a wide range of research questions. Data reuse maximizes the information derived from the labor and resource-intensive SIP approaches. Yet, a review of publicly available SIP sequencing metadata showed that critical information necessary for reproducibility and reuse was often missing. Here, we outline the Minimum Information for any Stable Isotope Probing Sequence (MISIP) according to the Minimum Information for any (x) Sequence (MIxS) framework and include examples of MISIP reporting for common SIP experiments. Our objectives are to expand the capacity of MIxS to accommodate SIP-specific metadata and guide SIP users in metadata collection when planning and reporting an experiment. The MISIP standard requires 5 metadata fields-isotope, isotopolog, isotopolog label, labeling approach, and gradient position-and recommends several fields that represent best practices in acquiring and reporting SIP sequencing data (e.g., gradient density and nucleic acid amount). The standard is intended to be used in concert with other MIxS checklists to comprehensively describe the origin of sequence data, such as for marker genes (MISIP-MIMARKS) or metagenomes (MISIP-MIMS), in combination with metadata required by an environmental extension (e.g., soil). The adoption of the proposed data standard will improve the reuse of any sequence derived from a SIP experiment and, by extension, deepen understanding of in situ biogeochemical processes and microbial ecology.</p>","PeriodicalId":12581,"journal":{"name":"GigaScience","volume":null,"pages":null},"PeriodicalIF":11.8000,"publicationDate":"2024-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11471955/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"GigaScience","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/gigascience/giae071","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

DNA/RNA-stable isotope probing (SIP) is a powerful tool to link in situ microbial activity to sequencing data. Every SIP dataset captures distinct information about microbial community metabolism, process rates, and population dynamics, offering valuable insights for a wide range of research questions. Data reuse maximizes the information derived from the labor and resource-intensive SIP approaches. Yet, a review of publicly available SIP sequencing metadata showed that critical information necessary for reproducibility and reuse was often missing. Here, we outline the Minimum Information for any Stable Isotope Probing Sequence (MISIP) according to the Minimum Information for any (x) Sequence (MIxS) framework and include examples of MISIP reporting for common SIP experiments. Our objectives are to expand the capacity of MIxS to accommodate SIP-specific metadata and guide SIP users in metadata collection when planning and reporting an experiment. The MISIP standard requires 5 metadata fields-isotope, isotopolog, isotopolog label, labeling approach, and gradient position-and recommends several fields that represent best practices in acquiring and reporting SIP sequencing data (e.g., gradient density and nucleic acid amount). The standard is intended to be used in concert with other MIxS checklists to comprehensively describe the origin of sequence data, such as for marker genes (MISIP-MIMARKS) or metagenomes (MISIP-MIMS), in combination with metadata required by an environmental extension (e.g., soil). The adoption of the proposed data standard will improve the reuse of any sequence derived from a SIP experiment and, by extension, deepen understanding of in situ biogeochemical processes and microbial ecology.

MISIP:任何稳定同位素探针衍生核酸序列和实验的再利用和可重复性数据标准。
DNA/RNA 稳定同位素探测(SIP)是将原位微生物活动与测序数据联系起来的强大工具。每个 SIP 数据集都能捕捉到有关微生物群落新陈代谢、过程速率和种群动态的独特信息,为广泛的研究问题提供有价值的见解。数据再利用可以最大限度地利用从劳动和资源密集型 SIP 方法中获得的信息。然而,对公开可用的 SIP 测序元数据的审查表明,往往缺少重现和重用所需的关键信息。在此,我们根据任何(x)序列的最低信息量(MIxS)框架,概述了任何稳定同位素探针序列(MISIP)的最低信息量,并列举了常见 SIP 实验的 MISIP 报告实例。我们的目标是扩大 MIxS 的容量,以容纳 SIP 特定的元数据,并指导 SIP 用户在规划和报告实验时收集元数据。MISIP 标准要求 5 个元数据字段--同位素、同位素、同位素标签、标记方法和梯度位置,并推荐了几个代表获取和报告 SIP 测序数据最佳实践的字段(如梯度密度和核酸量)。该标准旨在与其他 MIxS 核对表配合使用,以全面描述序列数据的来源,如标记基因(MISIP-MIMARKS)或元基因组(MISIP-MIMS),并结合环境扩展(如土壤)所需的元数据。采用拟议的数据标准将提高从 SIP 实验中获得的任何序列的再利用率,进而加深对原位生物地球化学过程和微生物生态学的了解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
GigaScience
GigaScience MULTIDISCIPLINARY SCIENCES-
CiteScore
15.50
自引率
1.10%
发文量
119
审稿时长
1 weeks
期刊介绍: GigaScience seeks to transform data dissemination and utilization in the life and biomedical sciences. As an online open-access open-data journal, it specializes in publishing "big-data" studies encompassing various fields. Its scope includes not only "omic" type data and the fields of high-throughput biology currently serviced by large public repositories, but also the growing range of more difficult-to-access data, such as imaging, neuroscience, ecology, cohort data, systems biology and other new types of large-scale shareable data.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信