Carrie Andrew, Sharif Islam, Claus Weiland, Dag Endresen
{"title":"用于组织和传播复杂研究项目和数字双胞胎的生物多样性数据标准:指南","authors":"Carrie Andrew, Sharif Islam, Claus Weiland, Dag Endresen","doi":"arxiv-2405.19857","DOIUrl":null,"url":null,"abstract":"Biodiversity data are substantially increasing, spurred by technological\nadvances and community (citizen) science initiatives. To integrate data is,\nlikewise, becoming more commonplace. Open science promotes open sharing and\ndata usage. Data standardization is an instrument for the organization and\nintegration of biodiversity data, which is required for complex research\nprojects and digital twins. However, just like with an actual instrument, there\nis a learning curve to understanding the data standards field. Here we provide\na guide, for data providers and data users, on the logistics of compiling and\nutilizing biodiversity data. We emphasize data standards, because they are\nintegral to data integration. Three primary avenues for compiling biodiversity\ndata are compared, explaining the importance of research infrastructures for\ncoordinated long-term data aggregation. We exemplify the Biodiversity Digital\nTwin (BioDT) as a case study. Four approaches to data standardization are\npresented in terms of the balance between practical constraints and the\nadvancement of the data standards field. We aim for this paper to guide and\nraise awareness of the existing issues related to data standardization, and\nespecially how data standards are key to data interoperability, i.e., machine\naccessibility. The future is promising for computational biodiversity\nadvancements, such as with the BioDT project, but it rests upon the shoulders\nof machine actionability and readability, and that requires data standards for\ncomputational communication.","PeriodicalId":501219,"journal":{"name":"arXiv - QuanBio - Other Quantitative Biology","volume":"88 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Biodiversity data standards for the organization and dissemination of complex research projects and digital twins: a guide\",\"authors\":\"Carrie Andrew, Sharif Islam, Claus Weiland, Dag Endresen\",\"doi\":\"arxiv-2405.19857\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Biodiversity data are substantially increasing, spurred by technological\\nadvances and community (citizen) science initiatives. To integrate data is,\\nlikewise, becoming more commonplace. Open science promotes open sharing and\\ndata usage. Data standardization is an instrument for the organization and\\nintegration of biodiversity data, which is required for complex research\\nprojects and digital twins. However, just like with an actual instrument, there\\nis a learning curve to understanding the data standards field. Here we provide\\na guide, for data providers and data users, on the logistics of compiling and\\nutilizing biodiversity data. We emphasize data standards, because they are\\nintegral to data integration. Three primary avenues for compiling biodiversity\\ndata are compared, explaining the importance of research infrastructures for\\ncoordinated long-term data aggregation. We exemplify the Biodiversity Digital\\nTwin (BioDT) as a case study. Four approaches to data standardization are\\npresented in terms of the balance between practical constraints and the\\nadvancement of the data standards field. We aim for this paper to guide and\\nraise awareness of the existing issues related to data standardization, and\\nespecially how data standards are key to data interoperability, i.e., machine\\naccessibility. The future is promising for computational biodiversity\\nadvancements, such as with the BioDT project, but it rests upon the shoulders\\nof machine actionability and readability, and that requires data standards for\\ncomputational communication.\",\"PeriodicalId\":501219,\"journal\":{\"name\":\"arXiv - QuanBio - Other Quantitative Biology\",\"volume\":\"88 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-05-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - QuanBio - Other Quantitative Biology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2405.19857\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Other Quantitative Biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2405.19857","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Biodiversity data standards for the organization and dissemination of complex research projects and digital twins: a guide
Biodiversity data are substantially increasing, spurred by technological
advances and community (citizen) science initiatives. To integrate data is,
likewise, becoming more commonplace. Open science promotes open sharing and
data usage. Data standardization is an instrument for the organization and
integration of biodiversity data, which is required for complex research
projects and digital twins. However, just like with an actual instrument, there
is a learning curve to understanding the data standards field. Here we provide
a guide, for data providers and data users, on the logistics of compiling and
utilizing biodiversity data. We emphasize data standards, because they are
integral to data integration. Three primary avenues for compiling biodiversity
data are compared, explaining the importance of research infrastructures for
coordinated long-term data aggregation. We exemplify the Biodiversity Digital
Twin (BioDT) as a case study. Four approaches to data standardization are
presented in terms of the balance between practical constraints and the
advancement of the data standards field. We aim for this paper to guide and
raise awareness of the existing issues related to data standardization, and
especially how data standards are key to data interoperability, i.e., machine
accessibility. The future is promising for computational biodiversity
advancements, such as with the BioDT project, but it rests upon the shoulders
of machine actionability and readability, and that requires data standards for
computational communication.