Elizabeth H. Wenk, William K. Cornwell, Anne Fuchs, Fonti Kar, Anna M. Monro, Hervé Sauquet, Ruby E. Stephens, Daniel S. Falster
{"title":"APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census","authors":"Elizabeth H. Wenk, William K. Cornwell, Anne Fuchs, Fonti Kar, Anna M. Monro, Hervé Sauquet, Ruby E. Stephens, Daniel S. Falster","doi":"10.1071/bt24014","DOIUrl":null,"url":null,"abstract":"<p>Here we present ‘APCalign’, an R package and accompanying browser-sourced application to align and update scientific names for Australian vascular plants to the most likely currently accepted name in the Australian Plant Census (APC) or a name in the Australian Plant Names Index (APNI). Scientific names are the label assigned to unique taxon concepts by the scientific community, but this common terminology is most useful if a taxon concept is consistently referred to by the same name. These links can be broken because of either spelling mistakes or taxonomic changes. Automated tools are required to resolve taxon lists, aligning and updating long lists of possibly erroneous scientific names to the most likely currently accepted names. It is essential that tools specific to the APC/APNI be developed, because these lists specify an endorsed national-level nomenclature used in government legislation and include the uniquely Australian concept of phrase names, absent in global taxonomic datasets. To align input names to names within the APC or APNI, ‘APCalign’ works progressively through a sequence of checks that combine different permutations of the input name, exact versus fuzzy matches, matches that consider the entire name input versus a subset of words, and character strings that indicate a name can be resolved only to a genus or family. The aligned names are then, when possible, updated to a currently accepted taxon concept within the APC. This package should facilitate all research outputs that require diverse scientific name lists to be merged or outdated lists to be updated.</p>","PeriodicalId":8607,"journal":{"name":"Australian Journal of Botany","volume":null,"pages":null},"PeriodicalIF":0.9000,"publicationDate":"2024-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Australian Journal of Botany","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1071/bt24014","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"PLANT SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Here we present ‘APCalign’, an R package and accompanying browser-sourced application to align and update scientific names for Australian vascular plants to the most likely currently accepted name in the Australian Plant Census (APC) or a name in the Australian Plant Names Index (APNI). Scientific names are the label assigned to unique taxon concepts by the scientific community, but this common terminology is most useful if a taxon concept is consistently referred to by the same name. These links can be broken because of either spelling mistakes or taxonomic changes. Automated tools are required to resolve taxon lists, aligning and updating long lists of possibly erroneous scientific names to the most likely currently accepted names. It is essential that tools specific to the APC/APNI be developed, because these lists specify an endorsed national-level nomenclature used in government legislation and include the uniquely Australian concept of phrase names, absent in global taxonomic datasets. To align input names to names within the APC or APNI, ‘APCalign’ works progressively through a sequence of checks that combine different permutations of the input name, exact versus fuzzy matches, matches that consider the entire name input versus a subset of words, and character strings that indicate a name can be resolved only to a genus or family. The aligned names are then, when possible, updated to a currently accepted taxon concept within the APC. This package should facilitate all research outputs that require diverse scientific name lists to be merged or outdated lists to be updated.
期刊介绍:
Australian Journal of Botany is an international journal for publication of original research in plant science. We seek papers of broad interest with relevance to Southern Hemisphere ecosystems. Our scope encompasses all approaches to understanding plant biology.
Australian Journal of Botany is published with the endorsement of the Commonwealth Scientific and Industrial Research Organisation (CSIRO) and the Australian Academy of Science.