APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census

IF 0.9 4区 生物学 Q4 PLANT SCIENCES
Elizabeth H. Wenk, William K. Cornwell, Anne Fuchs, Fonti Kar, Anna M. Monro, Hervé Sauquet, Ruby E. Stephens, Daniel S. Falster
{"title":"APCalign: an R package workflow and app for aligning and updating flora names to the Australian Plant Census","authors":"Elizabeth H. Wenk, William K. Cornwell, Anne Fuchs, Fonti Kar, Anna M. Monro, Hervé Sauquet, Ruby E. Stephens, Daniel S. Falster","doi":"10.1071/bt24014","DOIUrl":null,"url":null,"abstract":"<p>Here we present ‘APCalign’, an R package and accompanying browser-sourced application to align and update scientific names for Australian vascular plants to the most likely currently accepted name in the Australian Plant Census (APC) or a name in the Australian Plant Names Index (APNI). Scientific names are the label assigned to unique taxon concepts by the scientific community, but this common terminology is most useful if a taxon concept is consistently referred to by the same name. These links can be broken because of either spelling mistakes or taxonomic changes. Automated tools are required to resolve taxon lists, aligning and updating long lists of possibly erroneous scientific names to the most likely currently accepted names. It is essential that tools specific to the APC/APNI be developed, because these lists specify an endorsed national-level nomenclature used in government legislation and include the uniquely Australian concept of phrase names, absent in global taxonomic datasets. To align input names to names within the APC or APNI, ‘APCalign’ works progressively through a sequence of checks that combine different permutations of the input name, exact versus fuzzy matches, matches that consider the entire name input versus a subset of words, and character strings that indicate a name can be resolved only to a genus or family. The aligned names are then, when possible, updated to a currently accepted taxon concept within the APC. This package should facilitate all research outputs that require diverse scientific name lists to be merged or outdated lists to be updated.</p>","PeriodicalId":8607,"journal":{"name":"Australian Journal of Botany","volume":null,"pages":null},"PeriodicalIF":0.9000,"publicationDate":"2024-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Australian Journal of Botany","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1071/bt24014","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"PLANT SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Here we present ‘APCalign’, an R package and accompanying browser-sourced application to align and update scientific names for Australian vascular plants to the most likely currently accepted name in the Australian Plant Census (APC) or a name in the Australian Plant Names Index (APNI). Scientific names are the label assigned to unique taxon concepts by the scientific community, but this common terminology is most useful if a taxon concept is consistently referred to by the same name. These links can be broken because of either spelling mistakes or taxonomic changes. Automated tools are required to resolve taxon lists, aligning and updating long lists of possibly erroneous scientific names to the most likely currently accepted names. It is essential that tools specific to the APC/APNI be developed, because these lists specify an endorsed national-level nomenclature used in government legislation and include the uniquely Australian concept of phrase names, absent in global taxonomic datasets. To align input names to names within the APC or APNI, ‘APCalign’ works progressively through a sequence of checks that combine different permutations of the input name, exact versus fuzzy matches, matches that consider the entire name input versus a subset of words, and character strings that indicate a name can be resolved only to a genus or family. The aligned names are then, when possible, updated to a currently accepted taxon concept within the APC. This package should facilitate all research outputs that require diverse scientific name lists to be merged or outdated lists to be updated.

APCalign:根据澳大利亚植物普查对植物区系名称进行对齐和更新的 R 软件包工作流程和应用程序
我们在此介绍 "APCalign",它是一个 R 软件包和配套的浏览器源应用程序,用于将澳大利亚维管植物的学名与澳大利亚植物普查(APC)中目前最有可能接受的名称或澳大利亚植物名称索引(APNI)中的名称进行对齐和更新。学名是科学界分配给独特分类群概念的标签,但如果一个分类群概念被一致地称为相同的名称,这种通用术语就最有用了。由于拼写错误或分类变化,这些链接可能会中断。需要自动化工具来解决分类群列表问题,将一长串可能有误的学名与目前最有可能被接受的名称进行对齐和更新。开发专门针对 APC/APNI 的工具至关重要,因为这些列表规定了政府立法中使用的国家级命名法,并包含澳大利亚特有的短语名称概念,而全球分类数据集中并不存在这一概念。为了将输入名称与 APC 或 APNI 中的名称进行对齐,"APCalign "会逐步进行一系列检查,包括输入名称的不同排列、精确匹配与模糊匹配、整个输入名称与单词子集的匹配,以及表示名称只能解析为属或科的字符串。然后,在可能的情况下,对齐后的名称会更新为 APC 中目前公认的分类群概念。该软件包应有助于所有需要合并不同科学名称列表或更新过时列表的研究成果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Australian Journal of Botany
Australian Journal of Botany 生物-植物科学
CiteScore
2.30
自引率
18.20%
发文量
26
审稿时长
6-12 weeks
期刊介绍: Australian Journal of Botany is an international journal for publication of original research in plant science. We seek papers of broad interest with relevance to Southern Hemisphere ecosystems. Our scope encompasses all approaches to understanding plant biology. Australian Journal of Botany is published with the endorsement of the Commonwealth Scientific and Industrial Research Organisation (CSIRO) and the Australian Academy of Science.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信