Paving the way for deeper insights into nematode community composition with long-read metabarcoding: ecological and biogeographical coverage of the sequences

IF 10.3 1区 农林科学 Q1 SOIL SCIENCE
Arash Noshadi, Reza Ghaderi, Uffe N. Nielsen, Helen L. Hayden, Ji-Zheng He
{"title":"Paving the way for deeper insights into nematode community composition with long-read metabarcoding: ecological and biogeographical coverage of the sequences","authors":"Arash Noshadi, Reza Ghaderi, Uffe N. Nielsen, Helen L. Hayden, Ji-Zheng He","doi":"10.1016/j.soilbio.2025.110001","DOIUrl":null,"url":null,"abstract":"Long-read metabarcoding has excellent potential to advance nematode community ecology beyond the limitations of morphological and short-read approaches. The ecological and biogeographical background of existing sequence databases offers important insights into the potential application of high-throughput sequencing approaches in nematode community analyses. This study searched public databases for the three universal marker genes used in nematode molecular taxonomy studies, <em>i.e</em>., 18S ribosomal RNA (18S rRNA), 28S ribosomal RNA (28S rRNA), and cytochrome c oxidase subunit I (COI) genes, to retrieve full-length sequences suitable for long-read metabarcoding of soil nematode communities. The most full-length sequences were found for COI with 17534, followed by 4898 for 18S rRNA and 800 for 28S rRNA. These full-length sequences represented 185, 54, and 163 unique families; 626, 160, and 609 unique genera; and 1320, 235, and 1527 unique species for 18S rRNA, 28S rRNA, and COI markers, respectively. Nucleotide composition and diversity analyses across the three markers revealed distinct patterns affecting their utility for taxonomic studies. Geographically, the majority of the sequences were from the United States, China, Japan or Germany. Additionally, precise country-of-origin information was lacking for the majority of sequences, highlighting the limitations of the current databases and rendering robust geographic analyses difficult. Full-length sequences were assigned to an ecological framework, revealing that, for nematode trophic groups, herbivores were the most numerous group (10735 sequences), followed by animal parasites (6588 sequences), bacterivores (1785 sequences) and entomopathogenic nematodes (1513 sequences), whereas other trophic groups had fewer representative sequences. Assigning the sequences to colonizer-persister (c-p) classification revealed that all c-p groups were covered by the retrieved full-length sequences, particularly c-p 3, which had the highest number of sequences (6691). This study provides a foundational understanding of the molecular data currently available for use in long-read metabarcoding databases, facilitating ecological research on nematode communities.","PeriodicalId":21888,"journal":{"name":"Soil Biology & Biochemistry","volume":"45 1","pages":""},"PeriodicalIF":10.3000,"publicationDate":"2025-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Soil Biology & Biochemistry","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1016/j.soilbio.2025.110001","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SOIL SCIENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Long-read metabarcoding has excellent potential to advance nematode community ecology beyond the limitations of morphological and short-read approaches. The ecological and biogeographical background of existing sequence databases offers important insights into the potential application of high-throughput sequencing approaches in nematode community analyses. This study searched public databases for the three universal marker genes used in nematode molecular taxonomy studies, i.e., 18S ribosomal RNA (18S rRNA), 28S ribosomal RNA (28S rRNA), and cytochrome c oxidase subunit I (COI) genes, to retrieve full-length sequences suitable for long-read metabarcoding of soil nematode communities. The most full-length sequences were found for COI with 17534, followed by 4898 for 18S rRNA and 800 for 28S rRNA. These full-length sequences represented 185, 54, and 163 unique families; 626, 160, and 609 unique genera; and 1320, 235, and 1527 unique species for 18S rRNA, 28S rRNA, and COI markers, respectively. Nucleotide composition and diversity analyses across the three markers revealed distinct patterns affecting their utility for taxonomic studies. Geographically, the majority of the sequences were from the United States, China, Japan or Germany. Additionally, precise country-of-origin information was lacking for the majority of sequences, highlighting the limitations of the current databases and rendering robust geographic analyses difficult. Full-length sequences were assigned to an ecological framework, revealing that, for nematode trophic groups, herbivores were the most numerous group (10735 sequences), followed by animal parasites (6588 sequences), bacterivores (1785 sequences) and entomopathogenic nematodes (1513 sequences), whereas other trophic groups had fewer representative sequences. Assigning the sequences to colonizer-persister (c-p) classification revealed that all c-p groups were covered by the retrieved full-length sequences, particularly c-p 3, which had the highest number of sequences (6691). This study provides a foundational understanding of the molecular data currently available for use in long-read metabarcoding databases, facilitating ecological research on nematode communities.
利用长读元条形码为深入了解线虫群落组成铺平道路:序列的生态和生物地理覆盖
长读元条形码在突破形态学和短读方法的限制,推进线虫群落生态学方面具有很好的潜力。现有序列数据库的生态和生物地理背景为高通量测序方法在线虫群落分析中的潜在应用提供了重要的见解。本研究检索了线虫分子分类学研究中常用的3个通用标记基因,即18S核糖体RNA (18S rRNA)、28S核糖体RNA (28S rRNA)和细胞色素c氧化酶亚基I (COI)基因,检索了适合土壤线虫群落长读元条形码编码的全长序列。COI全长序列最多,为17534,18S rRNA为4898,28S rRNA为800。这些全长序列分别代表185、54和163个独特的家族;626、160和609唯一属;18S rRNA、28S rRNA和COI标记的独特种数分别为1320、235和1527种。三种标记的核苷酸组成和多样性分析揭示了影响其分类研究效用的不同模式。从地理位置上看,大多数序列来自美国、中国、日本和德国。此外,大多数序列缺乏精确的原产国信息,这突出了当前数据库的局限性,并使可靠的地理分析变得困难。结果表明,在线虫的营养类群中,草食线虫最多(10735条),其次是动物寄生虫(6588条)、细菌线虫(1785条)和昆虫病原线虫(1513条),其他营养类群代表性序列较少。结果表明,所有的c-p类群都被检索到的全长序列所覆盖,特别是c- p3,其序列数最多(6691条)。本研究为长读元条形码数据库中现有的线虫分子数据提供了基础认识,为线虫群落的生态学研究提供了便利。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Soil Biology & Biochemistry
Soil Biology & Biochemistry 农林科学-土壤科学
CiteScore
16.90
自引率
9.30%
发文量
312
审稿时长
49 days
期刊介绍: Soil Biology & Biochemistry publishes original research articles of international significance focusing on biological processes in soil and their applications to soil and environmental quality. Major topics include the ecology and biochemical processes of soil organisms, their effects on the environment, and interactions with plants. The journal also welcomes state-of-the-art reviews and discussions on contemporary research in soil biology and biochemistry.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信