{"title":"A framework for identifying the polyploid complex in Rorippa (Brassicaceae): combining trait evolution, herbarium records, and machine learning.","authors":"Ting-Shen Han, Jun-Xian Lv, Yao-Wu Xing","doi":"10.1093/aob/mcag050","DOIUrl":null,"url":null,"abstract":"<p><strong>Background and aims: </strong>Species identification in polyploid plants remains challenging due to morphological continuity and genomic redundancy. Such taxonomic uncertainties obscure evolutionary or ecological inference. A critical solution involves the reassessment of polyploid collections using stable diagnostic traits and integrative approaches. Here, we examined the Rorippa dubia-indica complex (Brassicaceae), a morphologically overlapping tetraploid-hexaploid lineage natively distributed in East Asia.</p><p><strong>Methods: </strong>We developed a framework that integrates experimental phenotyping, herbarium reassessment, and computational modeling for secondary species assessment of polyploid plants. The framework incorporates spatiotemporal data from 3,136 field-collected (2017-2020) and 2,015 herbarium (1893-2021) specimens. Species were circumscribed using experimental assessments of anatomical, cytological, and morphological traits, interpreted within a phylogenetically informed evolutionary context. Stable diagnostic traits were then applied to reidentify specimens for improved species distribution models. Finally, curated trait and species data were used to train machine learning classification models to reconstruct the diagnostic rationale underlying specimen identification.</p><p><strong>Key results: </strong>Seed arrangement, petal number, and genome size exhibited clear interspecific differentiation. Phylogenomic analyses based on chloroplast genomes further resolved species circumscription consistent with these traits. According to the revision of specimens and classification models defined by machine learning, we found that initial misidentification rates reached 12-50% across virtual or physical specimens, largely due to reliance on plastic traits such as leaf shape. These errors substantially distorted spatial distribution models and future climate projections.</p><p><strong>Conclusions: </strong>Our findings underscore the need for secondary specimen evaluation. The framework demonstrates the importance of integrating morphologic and phylogenetic inference with machine learning tools to resolve taxonomically difficult polyploid complexes. This approach offers direct applications for biodiversity assessment, evolutionary research, and conservation planning.</p>","PeriodicalId":8023,"journal":{"name":"Annals of botany","volume":" ","pages":""},"PeriodicalIF":3.6000,"publicationDate":"2026-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of botany","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/aob/mcag050","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PLANT SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Background and aims: Species identification in polyploid plants remains challenging due to morphological continuity and genomic redundancy. Such taxonomic uncertainties obscure evolutionary or ecological inference. A critical solution involves the reassessment of polyploid collections using stable diagnostic traits and integrative approaches. Here, we examined the Rorippa dubia-indica complex (Brassicaceae), a morphologically overlapping tetraploid-hexaploid lineage natively distributed in East Asia.
Methods: We developed a framework that integrates experimental phenotyping, herbarium reassessment, and computational modeling for secondary species assessment of polyploid plants. The framework incorporates spatiotemporal data from 3,136 field-collected (2017-2020) and 2,015 herbarium (1893-2021) specimens. Species were circumscribed using experimental assessments of anatomical, cytological, and morphological traits, interpreted within a phylogenetically informed evolutionary context. Stable diagnostic traits were then applied to reidentify specimens for improved species distribution models. Finally, curated trait and species data were used to train machine learning classification models to reconstruct the diagnostic rationale underlying specimen identification.
Key results: Seed arrangement, petal number, and genome size exhibited clear interspecific differentiation. Phylogenomic analyses based on chloroplast genomes further resolved species circumscription consistent with these traits. According to the revision of specimens and classification models defined by machine learning, we found that initial misidentification rates reached 12-50% across virtual or physical specimens, largely due to reliance on plastic traits such as leaf shape. These errors substantially distorted spatial distribution models and future climate projections.
Conclusions: Our findings underscore the need for secondary specimen evaluation. The framework demonstrates the importance of integrating morphologic and phylogenetic inference with machine learning tools to resolve taxonomically difficult polyploid complexes. This approach offers direct applications for biodiversity assessment, evolutionary research, and conservation planning.
期刊介绍:
Annals of Botany is an international plant science journal publishing novel and rigorous research in all areas of plant science. It is published monthly in both electronic and printed forms with at least two extra issues each year that focus on a particular theme in plant biology. The Journal is managed by the Annals of Botany Company, a not-for-profit educational charity established to promote plant science worldwide.
The Journal publishes original research papers, invited and submitted review articles, ''Research in Context'' expanding on original work, ''Botanical Briefings'' as short overviews of important topics, and ''Viewpoints'' giving opinions. All papers in each issue are summarized briefly in Content Snapshots , there are topical news items in the Plant Cuttings section and Book Reviews . A rigorous review process ensures that readers are exposed to genuine and novel advances across a wide spectrum of botanical knowledge. All papers aim to advance knowledge and make a difference to our understanding of plant science.