超保守元件和机器学习分类器实现模型和非模型线虫的鲁棒系统发育和分类。

IF 5.5 1区 生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY
Laura Villegas, Lucy Jimenez, Joëlle van der Sprong, Oleksandr Holovachov, Ann-Marie Waldvogel, Philipp H Schiffer
{"title":"超保守元件和机器学习分类器实现模型和非模型线虫的鲁棒系统发育和分类。","authors":"Laura Villegas, Lucy Jimenez, Joëlle van der Sprong, Oleksandr Holovachov, Ann-Marie Waldvogel, Philipp H Schiffer","doi":"10.1111/1755-0998.70046","DOIUrl":null,"url":null,"abstract":"<p><p>Nematodes are among the most diverse animals, yet only around 28,000 of an estimated one million species have been morphologically described. Their small size, morphological simplicity, and cryptic diversity complicate phylogenetic analyses. Traditional morphological and single-locus molecular approaches often lack resolution for both recent and ancient divergences. To address these limitations, we developed the first ultraconserved elements (UCEs) probe sets for two nematode families: Panagrolaimidae, a group of non-model organisms with limited genomic resources when compared to model taxa, and Rhabditidae, which includes the model species Caenorhabditis elegans. Our probe sets targeted 1612 loci for Panagrolaimidae and 100,397 for Rhabditidae. In vitro testing recovered up to 1457 loci in Panagrolaimidae, supporting robust phylogenetic reconstruction. Results were largely consistent with previous analyses, except for one strain reclassified as Neocephalobus halophilus BSS8. Using machine learning, we determined the minimum number of loci needed for accurate genus-level classification. For Rhabditidae, XGBoost achieved high accuracy with just 46 loci. For Panagrolaimidae, 39 loci were most informative. Our UCE-based approach offers a scalable and cost-effective framework for phylogenomics, enhancing taxonomic resolution and evolutionary inference in nematodes. It is well suited for biodiversity assessments and shallow, field-based sequencing, expanding research possibilities across this ecologically important phylum.</p>","PeriodicalId":211,"journal":{"name":"Molecular Ecology Resources","volume":" ","pages":"e70046"},"PeriodicalIF":5.5000,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Ultraconserved Elements and Machine Learning Classifiers Enable Robust Phylogenetics and Taxonomy in Model and Non-Model Nematodes.\",\"authors\":\"Laura Villegas, Lucy Jimenez, Joëlle van der Sprong, Oleksandr Holovachov, Ann-Marie Waldvogel, Philipp H Schiffer\",\"doi\":\"10.1111/1755-0998.70046\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Nematodes are among the most diverse animals, yet only around 28,000 of an estimated one million species have been morphologically described. Their small size, morphological simplicity, and cryptic diversity complicate phylogenetic analyses. Traditional morphological and single-locus molecular approaches often lack resolution for both recent and ancient divergences. To address these limitations, we developed the first ultraconserved elements (UCEs) probe sets for two nematode families: Panagrolaimidae, a group of non-model organisms with limited genomic resources when compared to model taxa, and Rhabditidae, which includes the model species Caenorhabditis elegans. Our probe sets targeted 1612 loci for Panagrolaimidae and 100,397 for Rhabditidae. In vitro testing recovered up to 1457 loci in Panagrolaimidae, supporting robust phylogenetic reconstruction. Results were largely consistent with previous analyses, except for one strain reclassified as Neocephalobus halophilus BSS8. Using machine learning, we determined the minimum number of loci needed for accurate genus-level classification. For Rhabditidae, XGBoost achieved high accuracy with just 46 loci. For Panagrolaimidae, 39 loci were most informative. Our UCE-based approach offers a scalable and cost-effective framework for phylogenomics, enhancing taxonomic resolution and evolutionary inference in nematodes. It is well suited for biodiversity assessments and shallow, field-based sequencing, expanding research possibilities across this ecologically important phylum.</p>\",\"PeriodicalId\":211,\"journal\":{\"name\":\"Molecular Ecology Resources\",\"volume\":\" \",\"pages\":\"e70046\"},\"PeriodicalIF\":5.5000,\"publicationDate\":\"2025-10-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Molecular Ecology Resources\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1111/1755-0998.70046\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Ecology Resources","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1111/1755-0998.70046","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

线虫是最多样化的动物之一,但在估计的100万种中,只有大约2.8万种被形态学地描述过。它们的体积小,形态简单,和隐蔽的多样性使系统发育分析复杂化。传统的形态学和单位点分子方法往往缺乏解决最近和古代的分歧。为了解决这些限制,我们为两个线虫科开发了第一个超保守元件(UCEs)探针集:Panagrolaimidae,一组与模型分类群相比基因组资源有限的非模式生物,以及Rhabditidae,其中包括模型物种秀丽隐杆线虫。我们的探针集针对Panagrolaimidae的1612个位点和Rhabditidae的100,397个位点。体外测试在Panagrolaimidae中恢复了多达1457个位点,支持强大的系统发育重建。结果与先前的分析基本一致,除了一个菌株被重新分类为新嗜盐头孢菌BSS8。使用机器学习,我们确定了准确的属级分类所需的最小位点数量。对于Rhabditidae, XGBoost仅使用46个位点就实现了很高的准确性。对于拟蝇科,39个位点信息量最大。我们基于uce的方法为系统基因组学提供了一个可扩展且具有成本效益的框架,提高了线虫的分类分辨率和进化推断。它非常适合生物多样性评估和浅层野外测序,扩大了这一重要生态门的研究可能性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Ultraconserved Elements and Machine Learning Classifiers Enable Robust Phylogenetics and Taxonomy in Model and Non-Model Nematodes.

Nematodes are among the most diverse animals, yet only around 28,000 of an estimated one million species have been morphologically described. Their small size, morphological simplicity, and cryptic diversity complicate phylogenetic analyses. Traditional morphological and single-locus molecular approaches often lack resolution for both recent and ancient divergences. To address these limitations, we developed the first ultraconserved elements (UCEs) probe sets for two nematode families: Panagrolaimidae, a group of non-model organisms with limited genomic resources when compared to model taxa, and Rhabditidae, which includes the model species Caenorhabditis elegans. Our probe sets targeted 1612 loci for Panagrolaimidae and 100,397 for Rhabditidae. In vitro testing recovered up to 1457 loci in Panagrolaimidae, supporting robust phylogenetic reconstruction. Results were largely consistent with previous analyses, except for one strain reclassified as Neocephalobus halophilus BSS8. Using machine learning, we determined the minimum number of loci needed for accurate genus-level classification. For Rhabditidae, XGBoost achieved high accuracy with just 46 loci. For Panagrolaimidae, 39 loci were most informative. Our UCE-based approach offers a scalable and cost-effective framework for phylogenomics, enhancing taxonomic resolution and evolutionary inference in nematodes. It is well suited for biodiversity assessments and shallow, field-based sequencing, expanding research possibilities across this ecologically important phylum.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Molecular Ecology Resources
Molecular Ecology Resources 生物-进化生物学
CiteScore
15.60
自引率
5.20%
发文量
170
审稿时长
3 months
期刊介绍: Molecular Ecology Resources promotes the creation of comprehensive resources for the scientific community, encompassing computer programs, statistical and molecular advancements, and a diverse array of molecular tools. Serving as a conduit for disseminating these resources, the journal targets a broad audience of researchers in the fields of evolution, ecology, and conservation. Articles in Molecular Ecology Resources are crafted to support investigations tackling significant questions within these disciplines. In addition to original resource articles, Molecular Ecology Resources features Reviews, Opinions, and Comments relevant to the field. The journal also periodically releases Special Issues focusing on resource development within specific areas.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信