OntoBiodiv: Reconnecting Biodiversity Data with Specimens

2022 5th International Conference on Networking, Information Systems and Security: Envisage Intelligent Systems in 5g//6G-based Interconnected Digital Worlds (NISS) Pub Date : 2022-03-30 DOI:10.1109/NISS55057.2022.10085505

D. R. Saleh, Y. Kartika, Zaenal Akbar, A. Krisnadhi, L. Manik

{"title":"OntoBiodiv: Reconnecting Biodiversity Data with Specimens","authors":"D. R. Saleh, Y. Kartika, Zaenal Akbar, A. Krisnadhi, L. Manik","doi":"10.1109/NISS55057.2022.10085505","DOIUrl":null,"url":null,"abstract":"Biodiversity data can be produced from preserved specimens where multiple pieces of information (e.g., taxonomic identification) will be extracted from biological samples or materials. Another approach, observation-based, collects data digitally without actual biological samples or materials. The latter approach has produced much more data compared to the first one. However, with recent technological developments, the tangible samples or materials preserved by the first approach have become gold mines because they opened more opportunities for scientific discovery. For example, a new method for genomic investigation can be performed on specimens collected a decade ago. However, this new investigation will only be possible with preserved specimens. Therefore, it is necessary to shift the focus of biodiversity data collection to the specimens-oriented. Unfortunately, most of the current biodiversity data standards cover specimens minimally. This work proposes a schema to extend an existing biodiversity data standard (i.e., Darwin Core) where specimens are the core. The extension covers a variety of data properties of specimens, including the generalization of multiple kinds of information that can be obtained by extracting from specimens. Comparing the coverage ratio and matching scores with the existing one reveals the superiority of the proposed schema. The evaluation results show that the proposed schema covers up to 80% higher and has the utmost exact match scores for specimen-based biodiversity data. This work initiates our effort to reconnect biodiversity data to specimens.","PeriodicalId":138637,"journal":{"name":"2022 5th International Conference on Networking, Information Systems and Security: Envisage Intelligent Systems in 5g//6G-based Interconnected Digital Worlds (NISS)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 5th International Conference on Networking, Information Systems and Security: Envisage Intelligent Systems in 5g//6G-based Interconnected Digital Worlds (NISS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NISS55057.2022.10085505","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Biodiversity data can be produced from preserved specimens where multiple pieces of information (e.g., taxonomic identification) will be extracted from biological samples or materials. Another approach, observation-based, collects data digitally without actual biological samples or materials. The latter approach has produced much more data compared to the first one. However, with recent technological developments, the tangible samples or materials preserved by the first approach have become gold mines because they opened more opportunities for scientific discovery. For example, a new method for genomic investigation can be performed on specimens collected a decade ago. However, this new investigation will only be possible with preserved specimens. Therefore, it is necessary to shift the focus of biodiversity data collection to the specimens-oriented. Unfortunately, most of the current biodiversity data standards cover specimens minimally. This work proposes a schema to extend an existing biodiversity data standard (i.e., Darwin Core) where specimens are the core. The extension covers a variety of data properties of specimens, including the generalization of multiple kinds of information that can be obtained by extracting from specimens. Comparing the coverage ratio and matching scores with the existing one reveals the superiority of the proposed schema. The evaluation results show that the proposed schema covers up to 80% higher and has the utmost exact match scores for specimen-based biodiversity data. This work initiates our effort to reconnect biodiversity data to specimens.

查看原文本刊更多论文

OntoBiodiv:用标本重新连接生物多样性数据

生物多样性数据可以从保存的标本中产生，其中将从生物样品或材料中提取多条信息(例如，分类鉴定)。另一种方法是基于观察的，它以数字方式收集数据，而不需要实际的生物样本或材料。后一种方法比第一种方法产生了更多的数据。然而，随着最近技术的发展，通过第一种方法保存的有形样品或材料已经成为金矿，因为它们为科学发现提供了更多的机会。例如，可以对十年前收集的标本进行基因组调查的新方法。然而，这项新的研究只能在保存完好的标本上进行。因此，有必要将生物多样性数据收集的重点转向以标本为导向。不幸的是，目前大多数生物多样性数据标准对标本的覆盖程度很低。这项工作提出了一个模式来扩展现有的生物多样性数据标准(即达尔文核心)，其中标本是核心。该扩展涵盖了标本的多种数据属性，包括从标本中提取可获得的多种信息的泛化。通过与现有模式的覆盖率和匹配分数的比较，揭示了所提模式的优越性。评价结果表明，该模式对基于标本的生物多样性数据具有最高的精确匹配分数，覆盖范围提高了80%。这项工作开启了我们将生物多样性数据与标本重新联系起来的努力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 5th International Conference on Networking, Information Systems and Security: Envisage Intelligent Systems in 5g//6G-based Interconnected Digital Worlds (NISS)

自引率

0.00%

发文量