Jon Lapeyra Martin, Ioulia Santi, P. Pitta, U. John, N. Gypens
{"title":"Towards真核浮游生物定量元条形码:一种改善18S rRNA基因拷贝数偏差的方法","authors":"Jon Lapeyra Martin, Ioulia Santi, P. Pitta, U. John, N. Gypens","doi":"10.3897/mbmg.6.85794","DOIUrl":null,"url":null,"abstract":"Plankton metabarcoding is increasingly implemented in marine ecosystem assessments and is more cost-efficient and less time-consuming than monitoring based on microscopy (morphological). 18S rRNA gene is the most widely used marker for groups’ and species’ detection and classification within marine eukaryotic microorganisms. These datasets have commonly relied on the acquisition of organismal abundances directly from the number of DNA sequences (i.e. reads). Besides the inherent technical biases in metabarcoding, the largely varying 18S rRNA gene copy numbers (GCN) among marine protists (ranging from tens to thousands) is one of the most important biological biases for species quantification. In this work, we present a gene copy number correction factor (CF) for four marine planktonic groups: Bacillariophyta, Dinoflagellata, Ciliophora miscellaneous and flagellated cells. On the basis of the theoretical assumption that ‘1 read’ is equivalent to ‘1 GCN’, we used the GCN median values per plankton group to calculate the corrected cell number and biomass relative abundances. The species-specific absolute GCN per cell were obtained from various studies published in the literature. We contributed to the development of a species-specific 18S rRNA GCN database proposed by previous authors. To assess the efficiency of the correction factor we compared the metabarcoding, morphological and corrected relative abundances (in cell number and biomass) of 15 surface water samples collected in the Belgian Coastal Zone. Results showed that the application of the correction factor over metabarcoding results enables us to significantly improve the estimates of cell abundances for Dinoflagellata, Ciliophora and flagellated cells, but not for Bacillariophyta. This is likely to due to large biovolume plasticity in diatoms not corresponding to genome size and gene copy numbers. C-biomass relative abundance estimations directly from amplicon reads were only improved for Dinoflagellata and Ciliophora. The method is still facing biases related to the low number of species GCN assessed. Nevertheless, the increase of species in the GCN database may lead to the refinement of the proposed correction factor.","PeriodicalId":18374,"journal":{"name":"Metabarcoding and Metagenomics","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":"{\"title\":\"Towards quantitative metabarcoding of eukaryotic plankton: an approach to improve 18S rRNA gene copy number bias\",\"authors\":\"Jon Lapeyra Martin, Ioulia Santi, P. Pitta, U. John, N. Gypens\",\"doi\":\"10.3897/mbmg.6.85794\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Plankton metabarcoding is increasingly implemented in marine ecosystem assessments and is more cost-efficient and less time-consuming than monitoring based on microscopy (morphological). 18S rRNA gene is the most widely used marker for groups’ and species’ detection and classification within marine eukaryotic microorganisms. These datasets have commonly relied on the acquisition of organismal abundances directly from the number of DNA sequences (i.e. reads). Besides the inherent technical biases in metabarcoding, the largely varying 18S rRNA gene copy numbers (GCN) among marine protists (ranging from tens to thousands) is one of the most important biological biases for species quantification. In this work, we present a gene copy number correction factor (CF) for four marine planktonic groups: Bacillariophyta, Dinoflagellata, Ciliophora miscellaneous and flagellated cells. On the basis of the theoretical assumption that ‘1 read’ is equivalent to ‘1 GCN’, we used the GCN median values per plankton group to calculate the corrected cell number and biomass relative abundances. The species-specific absolute GCN per cell were obtained from various studies published in the literature. We contributed to the development of a species-specific 18S rRNA GCN database proposed by previous authors. To assess the efficiency of the correction factor we compared the metabarcoding, morphological and corrected relative abundances (in cell number and biomass) of 15 surface water samples collected in the Belgian Coastal Zone. Results showed that the application of the correction factor over metabarcoding results enables us to significantly improve the estimates of cell abundances for Dinoflagellata, Ciliophora and flagellated cells, but not for Bacillariophyta. This is likely to due to large biovolume plasticity in diatoms not corresponding to genome size and gene copy numbers. C-biomass relative abundance estimations directly from amplicon reads were only improved for Dinoflagellata and Ciliophora. The method is still facing biases related to the low number of species GCN assessed. Nevertheless, the increase of species in the GCN database may lead to the refinement of the proposed correction factor.\",\"PeriodicalId\":18374,\"journal\":{\"name\":\"Metabarcoding and Metagenomics\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"19\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Metabarcoding and Metagenomics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3897/mbmg.6.85794\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Metabarcoding and Metagenomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3897/mbmg.6.85794","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Towards quantitative metabarcoding of eukaryotic plankton: an approach to improve 18S rRNA gene copy number bias
Plankton metabarcoding is increasingly implemented in marine ecosystem assessments and is more cost-efficient and less time-consuming than monitoring based on microscopy (morphological). 18S rRNA gene is the most widely used marker for groups’ and species’ detection and classification within marine eukaryotic microorganisms. These datasets have commonly relied on the acquisition of organismal abundances directly from the number of DNA sequences (i.e. reads). Besides the inherent technical biases in metabarcoding, the largely varying 18S rRNA gene copy numbers (GCN) among marine protists (ranging from tens to thousands) is one of the most important biological biases for species quantification. In this work, we present a gene copy number correction factor (CF) for four marine planktonic groups: Bacillariophyta, Dinoflagellata, Ciliophora miscellaneous and flagellated cells. On the basis of the theoretical assumption that ‘1 read’ is equivalent to ‘1 GCN’, we used the GCN median values per plankton group to calculate the corrected cell number and biomass relative abundances. The species-specific absolute GCN per cell were obtained from various studies published in the literature. We contributed to the development of a species-specific 18S rRNA GCN database proposed by previous authors. To assess the efficiency of the correction factor we compared the metabarcoding, morphological and corrected relative abundances (in cell number and biomass) of 15 surface water samples collected in the Belgian Coastal Zone. Results showed that the application of the correction factor over metabarcoding results enables us to significantly improve the estimates of cell abundances for Dinoflagellata, Ciliophora and flagellated cells, but not for Bacillariophyta. This is likely to due to large biovolume plasticity in diatoms not corresponding to genome size and gene copy numbers. C-biomass relative abundance estimations directly from amplicon reads were only improved for Dinoflagellata and Ciliophora. The method is still facing biases related to the low number of species GCN assessed. Nevertheless, the increase of species in the GCN database may lead to the refinement of the proposed correction factor.