OntoBiodiv: Reconnecting Biodiversity Data with Specimens

D. R. Saleh, Y. Kartika, Zaenal Akbar, A. Krisnadhi, L. Manik
{"title":"OntoBiodiv: Reconnecting Biodiversity Data with Specimens","authors":"D. R. Saleh, Y. Kartika, Zaenal Akbar, A. Krisnadhi, L. Manik","doi":"10.1109/NISS55057.2022.10085505","DOIUrl":null,"url":null,"abstract":"Biodiversity data can be produced from preserved specimens where multiple pieces of information (e.g., taxonomic identification) will be extracted from biological samples or materials. Another approach, observation-based, collects data digitally without actual biological samples or materials. The latter approach has produced much more data compared to the first one. However, with recent technological developments, the tangible samples or materials preserved by the first approach have become gold mines because they opened more opportunities for scientific discovery. For example, a new method for genomic investigation can be performed on specimens collected a decade ago. However, this new investigation will only be possible with preserved specimens. Therefore, it is necessary to shift the focus of biodiversity data collection to the specimens-oriented. Unfortunately, most of the current biodiversity data standards cover specimens minimally. This work proposes a schema to extend an existing biodiversity data standard (i.e., Darwin Core) where specimens are the core. The extension covers a variety of data properties of specimens, including the generalization of multiple kinds of information that can be obtained by extracting from specimens. Comparing the coverage ratio and matching scores with the existing one reveals the superiority of the proposed schema. The evaluation results show that the proposed schema covers up to 80% higher and has the utmost exact match scores for specimen-based biodiversity data. This work initiates our effort to reconnect biodiversity data to specimens.","PeriodicalId":138637,"journal":{"name":"2022 5th International Conference on Networking, Information Systems and Security: Envisage Intelligent Systems in 5g//6G-based Interconnected Digital Worlds (NISS)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 5th International Conference on Networking, Information Systems and Security: Envisage Intelligent Systems in 5g//6G-based Interconnected Digital Worlds (NISS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NISS55057.2022.10085505","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Biodiversity data can be produced from preserved specimens where multiple pieces of information (e.g., taxonomic identification) will be extracted from biological samples or materials. Another approach, observation-based, collects data digitally without actual biological samples or materials. The latter approach has produced much more data compared to the first one. However, with recent technological developments, the tangible samples or materials preserved by the first approach have become gold mines because they opened more opportunities for scientific discovery. For example, a new method for genomic investigation can be performed on specimens collected a decade ago. However, this new investigation will only be possible with preserved specimens. Therefore, it is necessary to shift the focus of biodiversity data collection to the specimens-oriented. Unfortunately, most of the current biodiversity data standards cover specimens minimally. This work proposes a schema to extend an existing biodiversity data standard (i.e., Darwin Core) where specimens are the core. The extension covers a variety of data properties of specimens, including the generalization of multiple kinds of information that can be obtained by extracting from specimens. Comparing the coverage ratio and matching scores with the existing one reveals the superiority of the proposed schema. The evaluation results show that the proposed schema covers up to 80% higher and has the utmost exact match scores for specimen-based biodiversity data. This work initiates our effort to reconnect biodiversity data to specimens.
OntoBiodiv:用标本重新连接生物多样性数据
生物多样性数据可以从保存的标本中产生,其中将从生物样品或材料中提取多条信息(例如,分类鉴定)。另一种方法是基于观察的,它以数字方式收集数据,而不需要实际的生物样本或材料。后一种方法比第一种方法产生了更多的数据。然而,随着最近技术的发展,通过第一种方法保存的有形样品或材料已经成为金矿,因为它们为科学发现提供了更多的机会。例如,可以对十年前收集的标本进行基因组调查的新方法。然而,这项新的研究只能在保存完好的标本上进行。因此,有必要将生物多样性数据收集的重点转向以标本为导向。不幸的是,目前大多数生物多样性数据标准对标本的覆盖程度很低。这项工作提出了一个模式来扩展现有的生物多样性数据标准(即达尔文核心),其中标本是核心。该扩展涵盖了标本的多种数据属性,包括从标本中提取可获得的多种信息的泛化。通过与现有模式的覆盖率和匹配分数的比较,揭示了所提模式的优越性。评价结果表明,该模式对基于标本的生物多样性数据具有最高的精确匹配分数,覆盖范围提高了80%。这项工作开启了我们将生物多样性数据与标本重新联系起来的努力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信