Laura Villegas, Lucy Jimenez, Joëlle van der Sprong, Oleksandr Holovachov, Ann-Marie Waldvogel, Philipp H Schiffer
{"title":"Ultraconserved Elements and Machine Learning Classifiers Enable Robust Phylogenetics and Taxonomy in Model and Non-Model Nematodes.","authors":"Laura Villegas, Lucy Jimenez, Joëlle van der Sprong, Oleksandr Holovachov, Ann-Marie Waldvogel, Philipp H Schiffer","doi":"10.1111/1755-0998.70046","DOIUrl":null,"url":null,"abstract":"<p><p>Nematodes are among the most diverse animals, yet only around 28,000 of an estimated one million species have been morphologically described. Their small size, morphological simplicity, and cryptic diversity complicate phylogenetic analyses. Traditional morphological and single-locus molecular approaches often lack resolution for both recent and ancient divergences. To address these limitations, we developed the first ultraconserved elements (UCEs) probe sets for two nematode families: Panagrolaimidae, a group of non-model organisms with limited genomic resources when compared to model taxa, and Rhabditidae, which includes the model species Caenorhabditis elegans. Our probe sets targeted 1612 loci for Panagrolaimidae and 100,397 for Rhabditidae. In vitro testing recovered up to 1457 loci in Panagrolaimidae, supporting robust phylogenetic reconstruction. Results were largely consistent with previous analyses, except for one strain reclassified as Neocephalobus halophilus BSS8. Using machine learning, we determined the minimum number of loci needed for accurate genus-level classification. For Rhabditidae, XGBoost achieved high accuracy with just 46 loci. For Panagrolaimidae, 39 loci were most informative. Our UCE-based approach offers a scalable and cost-effective framework for phylogenomics, enhancing taxonomic resolution and evolutionary inference in nematodes. It is well suited for biodiversity assessments and shallow, field-based sequencing, expanding research possibilities across this ecologically important phylum.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":" ","pages":"e70046"},"PeriodicalIF":5.5000,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Ecology Resources","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1111/1755-0998.70046","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Nematodes are among the most diverse animals, yet only around 28,000 of an estimated one million species have been morphologically described. Their small size, morphological simplicity, and cryptic diversity complicate phylogenetic analyses. Traditional morphological and single-locus molecular approaches often lack resolution for both recent and ancient divergences. To address these limitations, we developed the first ultraconserved elements (UCEs) probe sets for two nematode families: Panagrolaimidae, a group of non-model organisms with limited genomic resources when compared to model taxa, and Rhabditidae, which includes the model species Caenorhabditis elegans. Our probe sets targeted 1612 loci for Panagrolaimidae and 100,397 for Rhabditidae. In vitro testing recovered up to 1457 loci in Panagrolaimidae, supporting robust phylogenetic reconstruction. Results were largely consistent with previous analyses, except for one strain reclassified as Neocephalobus halophilus BSS8. Using machine learning, we determined the minimum number of loci needed for accurate genus-level classification. For Rhabditidae, XGBoost achieved high accuracy with just 46 loci. For Panagrolaimidae, 39 loci were most informative. Our UCE-based approach offers a scalable and cost-effective framework for phylogenomics, enhancing taxonomic resolution and evolutionary inference in nematodes. It is well suited for biodiversity assessments and shallow, field-based sequencing, expanding research possibilities across this ecologically important phylum.
期刊介绍:
Molecular Ecology Resources promotes the creation of comprehensive resources for the scientific community, encompassing computer programs, statistical and molecular advancements, and a diverse array of molecular tools. Serving as a conduit for disseminating these resources, the journal targets a broad audience of researchers in the fields of evolution, ecology, and conservation. Articles in Molecular Ecology Resources are crafted to support investigations tackling significant questions within these disciplines.
In addition to original resource articles, Molecular Ecology Resources features Reviews, Opinions, and Comments relevant to the field. The journal also periodically releases Special Issues focusing on resource development within specific areas.