{"title":"Progress with Repository-based Annotation Infrastructure for Biodiversity Applications","authors":"Peter Cornwell","doi":"10.3897/biss.7.112707","DOIUrl":"https://doi.org/10.3897/biss.7.112707","url":null,"abstract":"Rapid development since the 1980s of technologies for analysing texts, has led not only to widespread employment of text 'mining', but also to now-pervasive large language model artificial intelligence (AI) applications. However, building new, concise, data resources from historic, as well as contemporary scientific literature, which can be employed efficiently at scale by automation and which have long-term value for the research community, has proved more elusive. Efforts at codifying analyses, such as the Text Encoding Initiative (TEI), date from the early 1990s and were initially driven by the social sciences and humanities (SSH) and linguistics communities, and extended with multiple XML-based tagging schemes, including in biodiversity (Miller et al. 2012). In 2010, the Bio-Ontologies Special Interest Group (of the International Society for Computational Biology) presented its Annotation Ontology (AO), incorporating JavaScript Object Notation and broadening previous XML-based approaches (Ciccarese et al. 2011). From 2011, the Open Annotation Data Model (OADM) (Sanderson et al. 2013) focused on cross-domain standards with utility for Web 3.0, leading to the W3C Web Annotation Data Model (WADM) Recommendation in February 2017*1 and the potential for unifying the multiplicity of already-in-use tagging approaches. This continual evolution has made the preservation of investment using annotation methods, and in particular of the connections between annotations and their context in source literature, particularly challenging. Infrastructure that entered service during the intervening years does not yet support WADM, and has only recently started to address the parallel emergence of page imagery-based standards such as the International Image Interoperability Framework (IIIF). Notably, IIIF instruments such as Mirador-2, which has been employed widely for manual creation and editing of annotations in SSH, continue to employ the now-deprecated OADM. Although multiple efforts now address combining IIIF and TEI text coordinate systems, they are currently fundamentally incompatible. However, emerging repository technologies enable preservation of annotation investment to be accomplished comprehensively for the first time. Native IIIF support enables interactive previewing of annotations within repository graphical user interfaces and dynamic serialisation technologies provide compatibility with existing XML-based infrastructures. Repository access controls can permit experts to trace annotation sources in original texts even if the literature is not publicly accessible, e.g., due to copyright restriction. This is of paramount importance, not only because surrounding context can be crucial to qualify formal terms that have been annotated, such as collecting country. Also, contemporary automated text mining—essential for operation at the scale of known biodiversity literature—is not 100% accurate and manual checking of uncertainties is currently essentia","PeriodicalId":9011,"journal":{"name":"Biodiversity Information Science and Standards","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134911890","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Richardson Ciguene, Aurélien Miralles, Francis Clément
{"title":"Application of Fuzzy Measures to Move Towards Cyber-Taxonomy","authors":"Richardson Ciguene, Aurélien Miralles, Francis Clément","doi":"10.3897/biss.7.112677","DOIUrl":"https://doi.org/10.3897/biss.7.112677","url":null,"abstract":"The species inventory of global biodiversity is constantly revised and refined by taxonomic research, through the addition of newly discovered species and the reclassification of known species. This almost three-century-old project provides essential knowledge for humankind. In particular, knowledge of biodiversity establishes a foundation for developing appropriate conservation strategies. An accurate global inventory of species relies on the study of millions of specimens housed all around the world in natural history collections. For the last two decades, biological taxonomy has generated an increasing amount of data every year, and notably through the digitization of collection specimens, has gradually been transformed into a big data science. In recognition of this trend, the French National Museum of Natural History has embarked on a major research and engineering challenge within its information system: the adoption of cyber-taxonomic practices that require easy access to data on specimens housed in natural history collections all over the world. To this end, an important step is to automatically complete and reconcile the heterogeneous classification data usually associated with specimens managed in different collection databases. We describe here a new fuzzy approach to reconciling the classifications in multiple databases, enabling more accurate taxonomic retrieval of specimen data across databases.","PeriodicalId":9011,"journal":{"name":"Biodiversity Information Science and Standards","volume":"2021 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134912505","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Migration of the Catalogue of Afrotropical Bees into TaxonWorks","authors":"Dmitry Dmitriev, Connal Eardley, Willem Coetzer","doi":"10.3897/biss.7.112702","DOIUrl":"https://doi.org/10.3897/biss.7.112702","url":null,"abstract":"The Catalogue of Afrotropical Bees provides a comprehensive checklist of species of bees known from Sub-Saharan Africa and the western Indian Ocean islands, excluding the honey bee ( Apis mellifera Linnaeus) (Eardley and Urban 2010). The checklist has a detailed bibliography of the group, distribution records, and biological associations (visited flowers, host plants, plants used as nests, as well as parasitoids). The database, which was originally built in Microsoft Access, and later managed using Specify Software, was recently migrated to TaxonWorks. TaxonWorks is an integrated, web-based platform designed specifically for the needs of practicing taxonomists and biodiversity scientists, and maintained by the SpeciesFile Group. TaxonWorks has a variety of tools that were designed to help import, manage, validate, and package data for future exports (e.g., in the Darwin Core-Archives (DwC-A; GBIF 2021) or Catalogue of Life's COL-DP formats). Although TaxonWorks has batch upload functionality (e.g., in Darwin Core-Archive, BibTeX format), the complexity of the original dataset (Fig. 1) required special handling, and a custom migration was built to transfer the data from the original format. TaxonWorks could now be used to produce a paper-style catalogue or share the data via the TaxonWorks public interface, TaxonPages.","PeriodicalId":9011,"journal":{"name":"Biodiversity Information Science and Standards","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134912817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"I Know Something You Don’t Know: The annotation saga continues…","authors":"James Macklin, David Shorthouse, Falko Glöckler","doi":"10.3897/biss.7.112715","DOIUrl":"https://doi.org/10.3897/biss.7.112715","url":null,"abstract":"Over the past 20 years, the biodiversity informatics community has pursued components of the digital annotation landscape with varying degrees of success. We will provide an historical overview of the theory, the advancements made through a few key projects, and will identify some of the ongoing challenges and opportunities. The fundamental principles remain unchanged since annotations were first proposed. Someone (or something): (1) has an enhancement to make elsewhere from the source where original data or information are generated or transcribed; (2) wishes to broadcast these statements to the originator and to others who may benefit; and (3) expects persistence, discoverability, and attribution for their contributions alongside the source. The Filtered Push project (Morris et al. 2013) considered several use cases and pioneered development of services based on the technology of the day. The exchange of data between parties in a universally consistent way necessitated the development of a novel draft standard for data annotations via an extension of the World Wide Web Consortium’s Web Annotation Working Group standard (Sanderson et al. 2013) to be sufficiently informative for a data curator to confidently make a decision. Figure 2 from Morris et al. (2013), reproduced here as Fig. 1, outlines the composition of an annotation data package for a taxonomic identification. The package contains the data object(s) associated with an occurrence, an expression of the motivation(s) for updating, some evidence for an assertion, and a stated expectation for how the receiving entity should take action. The Filtered Push and Annosys (Tschöpe et al. 2013) projects also considered implementation strategies involving collection management systems (e.g., Symbiota) and portals (e.g., European Distributed Institute of Taxonomy, EDIT). However, there remain technological barriers for these systems to operate at scale, the least of which is the absence of globally unique, persistent, resolvable identifiers for shared objects and concepts. Major aggregation infrastructures like the Global Biodiversity Information Facility (GBIF) and the Distributed System of Scientific Collections (DiSSCo) rely on data enhancement to improve the quality of their resources and have annotation services in their work plans. More recently, the Digital Extended Specimen (DES) concept (Hardisty et al. 2022) will rely on annotation services as key components of the proposed infrastructure. Recent work on annotation services more generally has considered various new forms of packaging and delivery such as Frictionless Data (Fowler et al. 2018), Journal Article Tag Suite XML (Agosti et al. 2022), or nanopublications (Kuhn et al. 2018). There is risk in fragmentation of this landscape and disenfranchisement of both biological collections and the wider research community if we fail to align the purpose, content, and structure of these packages or if these fail to remain aligned with FAIR prin","PeriodicalId":9011,"journal":{"name":"Biodiversity Information Science and Standards","volume":"214 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134913130","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Amplifying the Power of eDNA by Making it FAIR","authors":"Miwa Takahashi, Oliver Berry","doi":"10.3897/biss.7.112553","DOIUrl":"https://doi.org/10.3897/biss.7.112553","url":null,"abstract":"Environmental DNA (eDNA) is a fast-growing biomonitoring approach to detect species and map their distributions, with the number of eDNA publications exponentially increasing in the past decade. While millions of DNA sequences are often generated and assigned to taxa in each publication, these records are stored in numerous locations (e.g., supplementary materials at journals’ servers, open data publishing platforms such as Dryad) and in various formats, which makes it difficult to find, access, re-use and integrate datasets. Making eDNA data FAIR (findable, accessible, interoperable, re-usable) has vast potential to improve how the biological environment is measured and how change is detected and understood. For instance, it would allow biomonitoring and species distribution modelling studies across extended space and time scales, which is logistically difficult or impossible for individual projects. It would also shed light on “dark” (unassigned) DNA sequences by facilitating their storage and re-analyses with updated ever-growing DNA reference databases. Several challenges are associated with making eDNA FAIR, including how to standardise data formats and bioinformatics workflows, and simplifying the process of post-publication data archiving so that it is acceptable for eDNA practitioners to adopt. Over the next three years, we plan to work closely with biodiversity data platforms such as the Global Biodiversity Information Facility (GBIF) and Atlas of Living Austrlia (ALA), eDNA science journals, and eDNA practitioners, to solve these challenges and enable eDNA to achieve its revolutionary potential as a unified source of information that supports environmental management.","PeriodicalId":9011,"journal":{"name":"Biodiversity Information Science and Standards","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135689431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Next Steps Towards Better Living Atlas Deployments and Maintenance","authors":"David Martin, Vicente Ruiz Jurado","doi":"10.3897/biss.7.112560","DOIUrl":"https://doi.org/10.3897/biss.7.112560","url":null,"abstract":"The Living Atlases project, facilitated by the Global Biodiversity Information Facility (GBIF) and the Atlas of Living Australia (ALA), has successfully operated for more than a decade, establishing collaborations with over 30 countries who have implemented ALA components within their respective environments. Over this period, technological advancements and the prevalence of cloud platforms have transformed the landscape of infrastructure management. In this presentation, we will explore innovative approaches to streamline the installation process of ALA, capitalizing on the benefits offered by cloud platforms and cutting-edge technologies. Furthermore, ALA has maintained a strong collaborative partnership with GBIF over the past four years, focusing on data ingestion pipelines and, more recently, engaging in shared user interface (UI) development. These improvements aim to enhance the maintainability of ALA modules, enabling organizations to leverage the advantages provided by cloud-based solutions and novel technologies.","PeriodicalId":9011,"journal":{"name":"Biodiversity Information Science and Standards","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135878450","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Olaf Bánki, Markus Döring, Thomas Jeppesen, Donald Hobern
{"title":"Demonstration of Taxonomic Name Data Services through ChecklistBank","authors":"Olaf Bánki, Markus Döring, Thomas Jeppesen, Donald Hobern","doi":"10.3897/biss.7.112544","DOIUrl":"https://doi.org/10.3897/biss.7.112544","url":null,"abstract":"ChecklistBank is a publishing platform and open data repository focused on taxonomic and nomenclatural datasets. It was launched at the end of 2020, and is a joint development by Catalogue of Life (COL) and the Global Biodiversity Information Facility (GBIF). Close to 50K datasets, mostly originating from published literature mediated through Plazi's TreatmentBank, Pensoft Publishers and the European Journal of Taxonomy, are openly accessible through ChecklistBank. Data sets also include sources with (Molecular) Operational Taxonomic Units, such as from UNITE / PlutoF, National Center for Biotechnology Information Taxonomy / European Nuclotide Archive, and the International Barcode of Life / BoLD. Next to various taxonomic datasets (also from regional and national levels, e.g., shared through GBIF) and nomenclatural datasets (e.g., Zoobank, International Plant Names Index), ChecklistBank also links out to the various original initiatives websites (e.g., World Register of Marine Species, Integrated Taxonomic Information System, COL China, Species Files). ChecklistBank also holds all the tooling that is needed to assemble the COL Checklist, the authoritative global species list of all described organisms. The COL Checklist 2023 (Bánki et al. 2023), containing more than 2.1 million accepted species, is assembled from 164 global taxonomic data sources mediated through ChecklistBank. The COL Checklist contains name usage identifiers, and each checklist version and its underpinning data sources are issued with digital object identifiers. After the launching of ChecklistBank, the EU funded Biodiversity Community Integrated Knowledge Library (BiCIKL) project contributed to additional improvements to ChecklistBank. These added functionalities include, amongst others, a name usage search, name match, and a taxonomic data comparison. The tooling used to assemble the COL Checklist has been generalised through a ChecklistBank 'project functionality' supporting the assembly of a species list. During the demonstration, several of the functionalities, developed in the context of the EU BiCIKL project, will be highlighted.","PeriodicalId":9011,"journal":{"name":"Biodiversity Information Science and Standards","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135831397","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pierre Bonnet, Antoine Affouard, Jean-Christophe Lombardo, Mathias Chouet, Hugo Gresse, Vanessa Hequet, Remi Palard, Maxime Fromholtz, Vincent Espitalier, Hervé Goëau, Benjamin Deneu, Christophe Botella, Joaquim Estopinan, César Leblanc, Maximilien Servajean, François Munoz, Alexis Joly
{"title":"Synergizing Digital, Biological, and Participatory Sciences for Global Plant Species Identification: Enabling access to a worldwide identification service","authors":"Pierre Bonnet, Antoine Affouard, Jean-Christophe Lombardo, Mathias Chouet, Hugo Gresse, Vanessa Hequet, Remi Palard, Maxime Fromholtz, Vincent Espitalier, Hervé Goëau, Benjamin Deneu, Christophe Botella, Joaquim Estopinan, César Leblanc, Maximilien Servajean, François Munoz, Alexis Joly","doi":"10.3897/biss.7.112545","DOIUrl":"https://doi.org/10.3897/biss.7.112545","url":null,"abstract":"Human activities have a growing impact on global biodiversity. While our understanding of biodiversity worldwide is not yet comprehensive, it is crucial to explore effective means of characterizing it in order to mitigate these impacts. The advancements in data storage, exchange capabilities, and the increasing availability of extensive taxonomic, ecological, and environmental databases offer possibilities for implementing new approaches that can address knowledge gaps regarding species and habitats. This enhanced knowledge will, in turn, facilitate improved management practices and enable better local governance of territories. Meeting these requirements necessitates the development of innovative tools and methods to respond to these needs. Citizen science platforms have emerged as valuable resources for generating large amounts of biodiversity data, thanks to their visibility and attractiveness to individuals involved in territorial management and education. These platforms present new opportunities to train deep learning models for automated species recognition, leveraging the substantial volumes of multimedia data they accumulate. However, effectively managing, curating, and disseminating the data and services generated by these platforms remains a significant challenge that hinders the achievement of their objectives. In line with this, the GUARDEN and MAMBO European projects aim to utilize the Pl@ntNet participatory science platform (Affouard et al. 2021) to develop and implement novel computational services to enable the widespread creation of floristic inventories. In the pursuit of this project, various standards and reference datasets have been employed, such as the POWO (Plants of the World Online) world checklist and the WGSRPD (World Geographical Scheme for Recording Plant Distributions) standard, to establish a foundation for creating a global service that aids in plant identification through visual analysis. This service relies on a NoSQL (Not Only Structured Query Language) data management system ArangoDB (Arango Database), utilizes state-of-the-art automated visual classification models (vision transformers), and operates on a distributed IT (Information Technology) infrastructure that leverages the capabilities of collaborative stakeholders interested in supporting this initiative. Global-scale automated workflows have been established specifically for the collection, analysis, and dissemination of illustrated occurrences of plant species. These workflows now enable the development of new IT tools that facilitate the description and monitoring of species and habitat conservation statuses. A comprehensive presentation highlighting the significant advancements achieved will be provided to share the lessons learned during its development and ensure the widespread adoption of this service within the scientific community.","PeriodicalId":9011,"journal":{"name":"Biodiversity Information Science and Standards","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135878792","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Maui Hudson, Jane Anderson, Riley Taitingfong, Andrew Martinez, Stephanie Carroll
{"title":"Recognising Indigenous Provenance in Biodiversity Records","authors":"Maui Hudson, Jane Anderson, Riley Taitingfong, Andrew Martinez, Stephanie Carroll","doi":"10.3897/biss.7.112610","DOIUrl":"https://doi.org/10.3897/biss.7.112610","url":null,"abstract":"The advent of data-driven technologies and the increasing demand for data have brought about unique challenges for Indigenous data governance. The CARE principles emphasize Collective Benefit, Authority, Responsibility, and Ethics as essential pillars for ensuring that Indigenous data rights are upheld, Indigenous knowledge is protected, and Indigenous Peoples are active participants in data governance processes (Carroll et al. 2020, Carroll et al. 2021). Identifying tangible activities and providing guidance to centre Indigenous perspectives provide a comprehensive approach to address the complexities of Indigenous data governance in a rapidly evolving data landscape (Gupta et al. 2023, Jennings et al. 2023, Sterner and Elliott 2023). Biodiversity research has increasingly recognized the intertwined relationship between biological diversity and cultural practices, leading to discussions about how research can strengthen the evidence base, build trust, enhance legitimacy for decision making (Alexander et al. 2021) and explore requirements for Indigenous metadata (Jennings et al. 2023). An Indigenous Metadata Bundle Communique, produced following an Indigenous Metadata Symposium, recommended the initial categories as: Governance, Provenance, Lands & Waters, Protocols, and Local Contexts Notices & Labels. Traditional Knowledge (TK) and Biocultural (BC) Labels have emerged as essential tools for recognising and maintaining Indigenous provenance, protocols and permissions in records for both natural ecosystems and cultural heritage (Anderson et al. 2020, Liggins et al. 2021) emphasizing the importance of Indigenous Peoples and local knowledge systems in research and digital management. Biocultural labels acknowledge the intricate links between biodiversity and cultural diversity, emphasizing the role of indigenous communities in preserving biodiversity through their traditional practices (Hudson et al. 2021). By recognizing the intrinsic value of these relationships, TK and BC labels not only contribute to a more holistic understanding of biodiversity but also promote ethical considerations and mutual respect between researchers and local communities, fostering collaborative partnerships for research and conservation initiatives (McCartney et al. 2023). Addressing the CARE Principles for Indigenous Data Governance in biodiversity research introduces several challenges and opportunities. Ethical concerns regarding recognition of Indigenous rights and interests in data (Hudson et al. 2023), intellectual property rights, cultural appropriation, and equitable benefit sharing, must be navigated sensitively (Carroll et al. 2022b, Golan et al. 2022). Moreover, fostering effective communication between researchers and communities is paramount for ensuring the accuracy and authenticity of Indigenous metadata and protocols for appropriate use (Carroll et al. 2022a). However, these challenges are offset by the potential for enriching scientific knowledg","PeriodicalId":9011,"journal":{"name":"Biodiversity Information Science and Standards","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135831122","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Implementing the CARE Principles for Indigenous Data Governance in Biodiversity Data Management","authors":"Riley Taitingfong, Stephanie Carroll","doi":"10.3897/biss.7.112615","DOIUrl":"https://doi.org/10.3897/biss.7.112615","url":null,"abstract":"Indigenous data governance is a critical aspect of upholding Indigenous rights and fostering equitable partnerships in biodiversity research and data management. An estimated 80% of the planet’s biodiversity exists on Indigenous lands (Sobrevila 2008), and the majority of Indigenous data derived from specimens taken from Indigenous lands are held by non-Indigenous entities and institutions. The CARE Principles (Collective benefit, Authority to control, Responsibility, and Ethics) are designed to guide the inclusion of Indigenous peoples in data governance, and increase their access to and benefit from data (Carroll et al. 2020). This talk will share emerging tools and resources that can be leveraged to implement the CARE Principles within repositories and institutions that hold Indigenous data. This talk highlights two primary tools to promote Indigenous data governance in repositories: a phased framework to guide third-party holders of Indigenous data through foundational learning and concrete steps to apply the CARE principles in their respective settings, and the CARE criteria, an assessment tool by which researchers and institutions can evaluate the maturity of CARE implementation, identify areas for improvement, and allow other entities such as funders and publishers to evaluate CARE compliance. a phased framework to guide third-party holders of Indigenous data through foundational learning and concrete steps to apply the CARE principles in their respective settings, and the CARE criteria, an assessment tool by which researchers and institutions can evaluate the maturity of CARE implementation, identify areas for improvement, and allow other entities such as funders and publishers to evaluate CARE compliance.","PeriodicalId":9011,"journal":{"name":"Biodiversity Information Science and Standards","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135830419","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}