以实验室为中心、基于工作流程的环境 DNA 研究数据管理系统

Research ideas and outcomes Pub Date : 2024-03-28 DOI:10.3897/rio.10.e120483

Alex Borisenko, Robert G. Young, Robert Hanner

{"title":"以实验室为中心、基于工作流程的环境 DNA 研究数据管理系统","authors":"Alex Borisenko, Robert G. Young, Robert Hanner","doi":"10.3897/rio.10.e120483","DOIUrl":null,"url":null,"abstract":"The adoption of environmental DNA approaches as a standard tool for biodiversity monitoring leads to the increase in the number of eDNA-based species occurrence records; however, considerable disparity remains in the nature and quality of associated information, much of it unpublished and/or poorly parametrised. A robust system for tracking biological materials from their point of origin through laboratory analyses is required to connect inferred taxon occurrences with analytical history and provenance data. The bulk of eDNA research is currently driven by small-scale operations where the tasks of digitisation, organisation and cross-referencing field records with laboratory analytical data and biomaterial sample location, are often performed manually and disconnected.\n We present an integrative, full-stack data management solution that provides a structured ontological concept, a minimalist data schema for eDNA research and a software application prototype designed to facilitate real-time digitisation, parsing, annotation and archival of eDNA data. The system tracks the provenance and analytical history of biological samples through a structured hierarchy of events, linked with associated digital file attachment archives, such as images and raw sequence files, and with inferred taxonomic occurrence records. The data entry process is compartmentalised and incorporated into the corresponding stages of standard operations used in fieldwork, biological collection management and laboratory analysis. Resulting data records can be integrated into various output formats required for large-scale analytics, publication and/or submission to global data aggregators. The prototype is implemented on the Microsoft 365 platform as a relational database (Access) linked to cloud-based data tables (SharePoint) and a set of associated data conversion spreadsheets (Excel). The system is designed primarily around the data management needs of small research labs; however, it is scalable to larger institutions and inter-institutional academic networks.","PeriodicalId":92718,"journal":{"name":"Research ideas and outcomes","volume":"138 8","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A lab-centric, workflow-based data management system for environmental DNA research\",\"authors\":\"Alex Borisenko, Robert G. Young, Robert Hanner\",\"doi\":\"10.3897/rio.10.e120483\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The adoption of environmental DNA approaches as a standard tool for biodiversity monitoring leads to the increase in the number of eDNA-based species occurrence records; however, considerable disparity remains in the nature and quality of associated information, much of it unpublished and/or poorly parametrised. A robust system for tracking biological materials from their point of origin through laboratory analyses is required to connect inferred taxon occurrences with analytical history and provenance data. The bulk of eDNA research is currently driven by small-scale operations where the tasks of digitisation, organisation and cross-referencing field records with laboratory analytical data and biomaterial sample location, are often performed manually and disconnected.\\n We present an integrative, full-stack data management solution that provides a structured ontological concept, a minimalist data schema for eDNA research and a software application prototype designed to facilitate real-time digitisation, parsing, annotation and archival of eDNA data. The system tracks the provenance and analytical history of biological samples through a structured hierarchy of events, linked with associated digital file attachment archives, such as images and raw sequence files, and with inferred taxonomic occurrence records. The data entry process is compartmentalised and incorporated into the corresponding stages of standard operations used in fieldwork, biological collection management and laboratory analysis. Resulting data records can be integrated into various output formats required for large-scale analytics, publication and/or submission to global data aggregators. The prototype is implemented on the Microsoft 365 platform as a relational database (Access) linked to cloud-based data tables (SharePoint) and a set of associated data conversion spreadsheets (Excel). The system is designed primarily around the data management needs of small research labs; however, it is scalable to larger institutions and inter-institutional academic networks.\",\"PeriodicalId\":92718,\"journal\":{\"name\":\"Research ideas and outcomes\",\"volume\":\"138 8\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-03-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Research ideas and outcomes\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3897/rio.10.e120483\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Research ideas and outcomes","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3897/rio.10.e120483","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

采用环境 DNA 方法作为生物多样性监测的标准工具，导致基于 eDNA 的物种出现记录数量增加；然而，相关信息的性质和质量仍存在相当大的差异，其中大部分未公开发表和/或参数化不足。需要一个强大的系统来追踪生物材料从原产地到实验室分析的整个过程，以便将推断的分类群出现与分析历史和来源数据联系起来。目前，大部分 eDNA 研究都是由小规模操作驱动的，在小规模操作中，数字化、组织以及将野外记录与实验室分析数据和生物材料样本位置进行交叉比对等任务通常都是手动完成且互不关联。我们提出了一种集成式全栈数据管理解决方案，为 eDNA 研究提供了一个结构化的本体概念、一个简约的数据模式和一个软件应用程序原型，旨在促进 eDNA 数据的实时数字化、解析、注释和存档。该系统通过结构化的事件层次跟踪生物样本的来源和分析历史，并与相关的数字文件附件档案（如图像和原始序列文件）以及推断的分类出现记录相连接。数据录入过程分门别类，并纳入野外工作、生物采集管理和实验室分析中使用的标准操作的相应阶段。结果数据记录可整合为大规模分析、出版和/或提交给全球数据聚合器所需的各种输出格式。原型系统是在微软 365 平台上实施的，它是一个关系数据库（Access），与基于云的数据表（SharePoint）和一套相关的数据转换电子表格（Excel）相连接。该系统主要围绕小型研究实验室的数据管理需求而设计，但也可扩展到大型机构和机构间学术网络。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A lab-centric, workflow-based data management system for environmental DNA research

The adoption of environmental DNA approaches as a standard tool for biodiversity monitoring leads to the increase in the number of eDNA-based species occurrence records; however, considerable disparity remains in the nature and quality of associated information, much of it unpublished and/or poorly parametrised. A robust system for tracking biological materials from their point of origin through laboratory analyses is required to connect inferred taxon occurrences with analytical history and provenance data. The bulk of eDNA research is currently driven by small-scale operations where the tasks of digitisation, organisation and cross-referencing field records with laboratory analytical data and biomaterial sample location, are often performed manually and disconnected. We present an integrative, full-stack data management solution that provides a structured ontological concept, a minimalist data schema for eDNA research and a software application prototype designed to facilitate real-time digitisation, parsing, annotation and archival of eDNA data. The system tracks the provenance and analytical history of biological samples through a structured hierarchy of events, linked with associated digital file attachment archives, such as images and raw sequence files, and with inferred taxonomic occurrence records. The data entry process is compartmentalised and incorporated into the corresponding stages of standard operations used in fieldwork, biological collection management and laboratory analysis. Resulting data records can be integrated into various output formats required for large-scale analytics, publication and/or submission to global data aggregators. The prototype is implemented on the Microsoft 365 platform as a relational database (Access) linked to cloud-based data tables (SharePoint) and a set of associated data conversion spreadsheets (Excel). The system is designed primarily around the data management needs of small research labs; however, it is scalable to larger institutions and inter-institutional academic networks.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Research ideas and outcomes

自引率

0.00%

发文量

审稿时长

2 weeks