Towards真核浮游生物定量元条形码:一种改善18S rRNA基因拷贝数偏差的方法

Jon Lapeyra Martin, Ioulia Santi, P. Pitta, U. John, N. Gypens
{"title":"Towards真核浮游生物定量元条形码:一种改善18S rRNA基因拷贝数偏差的方法","authors":"Jon Lapeyra Martin, Ioulia Santi, P. Pitta, U. John, N. Gypens","doi":"10.3897/mbmg.6.85794","DOIUrl":null,"url":null,"abstract":"Plankton metabarcoding is increasingly implemented in marine ecosystem assessments and is more cost-efficient and less time-consuming than monitoring based on microscopy (morphological). 18S rRNA gene is the most widely used marker for groups’ and species’ detection and classification within marine eukaryotic microorganisms. These datasets have commonly relied on the acquisition of organismal abundances directly from the number of DNA sequences (i.e. reads). Besides the inherent technical biases in metabarcoding, the largely varying 18S rRNA gene copy numbers (GCN) among marine protists (ranging from tens to thousands) is one of the most important biological biases for species quantification. In this work, we present a gene copy number correction factor (CF) for four marine planktonic groups: Bacillariophyta, Dinoflagellata, Ciliophora miscellaneous and flagellated cells. On the basis of the theoretical assumption that ‘1 read’ is equivalent to ‘1 GCN’, we used the GCN median values per plankton group to calculate the corrected cell number and biomass relative abundances. The species-specific absolute GCN per cell were obtained from various studies published in the literature. We contributed to the development of a species-specific 18S rRNA GCN database proposed by previous authors. To assess the efficiency of the correction factor we compared the metabarcoding, morphological and corrected relative abundances (in cell number and biomass) of 15 surface water samples collected in the Belgian Coastal Zone. Results showed that the application of the correction factor over metabarcoding results enables us to significantly improve the estimates of cell abundances for Dinoflagellata, Ciliophora and flagellated cells, but not for Bacillariophyta. This is likely to due to large biovolume plasticity in diatoms not corresponding to genome size and gene copy numbers. C-biomass relative abundance estimations directly from amplicon reads were only improved for Dinoflagellata and Ciliophora. The method is still facing biases related to the low number of species GCN assessed. Nevertheless, the increase of species in the GCN database may lead to the refinement of the proposed correction factor.","PeriodicalId":18374,"journal":{"name":"Metabarcoding and Metagenomics","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":"{\"title\":\"Towards quantitative metabarcoding of eukaryotic plankton: an approach to improve 18S rRNA gene copy number bias\",\"authors\":\"Jon Lapeyra Martin, Ioulia Santi, P. Pitta, U. John, N. Gypens\",\"doi\":\"10.3897/mbmg.6.85794\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Plankton metabarcoding is increasingly implemented in marine ecosystem assessments and is more cost-efficient and less time-consuming than monitoring based on microscopy (morphological). 18S rRNA gene is the most widely used marker for groups’ and species’ detection and classification within marine eukaryotic microorganisms. These datasets have commonly relied on the acquisition of organismal abundances directly from the number of DNA sequences (i.e. reads). Besides the inherent technical biases in metabarcoding, the largely varying 18S rRNA gene copy numbers (GCN) among marine protists (ranging from tens to thousands) is one of the most important biological biases for species quantification. In this work, we present a gene copy number correction factor (CF) for four marine planktonic groups: Bacillariophyta, Dinoflagellata, Ciliophora miscellaneous and flagellated cells. On the basis of the theoretical assumption that ‘1 read’ is equivalent to ‘1 GCN’, we used the GCN median values per plankton group to calculate the corrected cell number and biomass relative abundances. The species-specific absolute GCN per cell were obtained from various studies published in the literature. We contributed to the development of a species-specific 18S rRNA GCN database proposed by previous authors. To assess the efficiency of the correction factor we compared the metabarcoding, morphological and corrected relative abundances (in cell number and biomass) of 15 surface water samples collected in the Belgian Coastal Zone. Results showed that the application of the correction factor over metabarcoding results enables us to significantly improve the estimates of cell abundances for Dinoflagellata, Ciliophora and flagellated cells, but not for Bacillariophyta. This is likely to due to large biovolume plasticity in diatoms not corresponding to genome size and gene copy numbers. C-biomass relative abundance estimations directly from amplicon reads were only improved for Dinoflagellata and Ciliophora. The method is still facing biases related to the low number of species GCN assessed. Nevertheless, the increase of species in the GCN database may lead to the refinement of the proposed correction factor.\",\"PeriodicalId\":18374,\"journal\":{\"name\":\"Metabarcoding and Metagenomics\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"19\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Metabarcoding and Metagenomics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3897/mbmg.6.85794\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Metabarcoding and Metagenomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3897/mbmg.6.85794","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 19

摘要

浮游生物代谢条形码越来越多地应用于海洋生态系统评估,与基于显微镜(形态学)的监测相比,它更具成本效益,耗时更少。18S rRNA基因是海洋真核微生物中最广泛使用的群体和物种检测和分类标记。这些数据集通常依赖于直接从DNA序列的数量(即读数)中获取生物体丰度。除了代谢编码中固有的技术偏差外,海洋原生生物中18S rRNA基因拷贝数(GCN)的巨大差异(从数万到数千不等)是物种量化最重要的生物学偏差之一。在这项工作中,我们提出了一种适用于四个海洋浮游生物群的基因拷贝数校正因子(CF):硅藻门、甲藻门、杂生和有鞭毛细胞。基于“1读数”相当于“1 GCN”的理论假设,我们使用每个浮游生物组的GCN中值来计算校正的细胞数量和生物量相对丰度。从文献中发表的各种研究中获得了每个细胞的物种特异性绝对GCN。我们为先前作者提出的物种特异性18S rRNA GCN数据库的开发做出了贡献。为了评估校正因子的效率,我们比较了在比利时海岸带采集的15个地表水样本的代谢编码、形态和校正后的相对丰度(细胞数量和生物量)。结果表明,将校正因子应用于代谢编码结果,使我们能够显著提高对甲藻、侧弯藻和有鞭毛细胞的细胞丰度的估计,但不能提高对硅藻的细胞丰度估计。这可能是由于硅藻的大生物体积可塑性与基因组大小和基因拷贝数不对应。直接从扩增子读数中估计的C生物量相对丰度仅对甲藻和香鱼有所改善。该方法仍然面临着与所评估的GCN物种数量较少有关的偏差。然而,GCN数据库中物种的增加可能会导致所提出的校正因子的细化。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Towards quantitative metabarcoding of eukaryotic plankton: an approach to improve 18S rRNA gene copy number bias
Plankton metabarcoding is increasingly implemented in marine ecosystem assessments and is more cost-efficient and less time-consuming than monitoring based on microscopy (morphological). 18S rRNA gene is the most widely used marker for groups’ and species’ detection and classification within marine eukaryotic microorganisms. These datasets have commonly relied on the acquisition of organismal abundances directly from the number of DNA sequences (i.e. reads). Besides the inherent technical biases in metabarcoding, the largely varying 18S rRNA gene copy numbers (GCN) among marine protists (ranging from tens to thousands) is one of the most important biological biases for species quantification. In this work, we present a gene copy number correction factor (CF) for four marine planktonic groups: Bacillariophyta, Dinoflagellata, Ciliophora miscellaneous and flagellated cells. On the basis of the theoretical assumption that ‘1 read’ is equivalent to ‘1 GCN’, we used the GCN median values per plankton group to calculate the corrected cell number and biomass relative abundances. The species-specific absolute GCN per cell were obtained from various studies published in the literature. We contributed to the development of a species-specific 18S rRNA GCN database proposed by previous authors. To assess the efficiency of the correction factor we compared the metabarcoding, morphological and corrected relative abundances (in cell number and biomass) of 15 surface water samples collected in the Belgian Coastal Zone. Results showed that the application of the correction factor over metabarcoding results enables us to significantly improve the estimates of cell abundances for Dinoflagellata, Ciliophora and flagellated cells, but not for Bacillariophyta. This is likely to due to large biovolume plasticity in diatoms not corresponding to genome size and gene copy numbers. C-biomass relative abundance estimations directly from amplicon reads were only improved for Dinoflagellata and Ciliophora. The method is still facing biases related to the low number of species GCN assessed. Nevertheless, the increase of species in the GCN database may lead to the refinement of the proposed correction factor.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Metabarcoding and Metagenomics
Metabarcoding and Metagenomics Agricultural and Biological Sciences-Animal Science and Zoology
CiteScore
5.40
自引率
0.00%
发文量
25
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信