{"title":"测序读数的合并与连接:从扩增子数据全面分析微生物组的生物信息学工作流程。","authors":"Meganathan P Ramakodi","doi":"10.1093/femsle/fnae009","DOIUrl":null,"url":null,"abstract":"<p><p>A comprehensive profiling of microbial diversity is essential to understand the ecosystem functions. Universal primer sets such as the 515Y/926R could amplify a part of 16S and 18S rRNA and infer the diversity of prokaryotes and eukaryotes. However, the analyses of mixed sequencing data pose a bioinformatics challenge; the 16S and 18S rRNA sequences need to be separated first and analysed individually/independently due to variations in the amplicon length. This study describes an alternative strategy, a merging and concatenation workflow, to analyse the mixed amplicon data without separating the 16S and 18S rRNA sequences. The workflow was tested with 24 mock community (MC) samples, and the analyses resolved the composition of prokaryotes and eukaryotes adequately. In addition, there was a strong correlation (cor = 0.950; P-value = 4.754e-10) between the observed and expected abundances in the MC samples, which suggests that the computational approach could infer the microbial proportions accurately. Further, 18 samples collected from the Sundarbans mangrove region were analysed as a case study. The analyses identified Proteobacteria, Bacteroidota, Actinobacteriota, Cyanobacteria, and Crenarchaeota as dominant bacterial phyla and eukaryotic divisions such as Metazoa, Gyrista, Cryptophyta, Chlorophyta, and Dinoflagellata were found to be dominant in the samples. Thus, the results support the applicability of the method in environmental microbiome research. The merging and concatenation workflow presented here requires considerably less computational resources and uses widely/commonly used bioinformatics packages, saving researchers analyses time (for equivalent sample numbers, compared to the conventional approach) required to infer the diversity of major microbial domains from mixed amplicon data at comparable accuracy.</p>","PeriodicalId":12214,"journal":{"name":"Fems Microbiology Letters","volume":" ","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Merging and concatenation of sequencing reads: a bioinformatics workflow for the comprehensive profiling of microbiome from amplicon data.\",\"authors\":\"Meganathan P Ramakodi\",\"doi\":\"10.1093/femsle/fnae009\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>A comprehensive profiling of microbial diversity is essential to understand the ecosystem functions. Universal primer sets such as the 515Y/926R could amplify a part of 16S and 18S rRNA and infer the diversity of prokaryotes and eukaryotes. However, the analyses of mixed sequencing data pose a bioinformatics challenge; the 16S and 18S rRNA sequences need to be separated first and analysed individually/independently due to variations in the amplicon length. This study describes an alternative strategy, a merging and concatenation workflow, to analyse the mixed amplicon data without separating the 16S and 18S rRNA sequences. The workflow was tested with 24 mock community (MC) samples, and the analyses resolved the composition of prokaryotes and eukaryotes adequately. In addition, there was a strong correlation (cor = 0.950; P-value = 4.754e-10) between the observed and expected abundances in the MC samples, which suggests that the computational approach could infer the microbial proportions accurately. Further, 18 samples collected from the Sundarbans mangrove region were analysed as a case study. The analyses identified Proteobacteria, Bacteroidota, Actinobacteriota, Cyanobacteria, and Crenarchaeota as dominant bacterial phyla and eukaryotic divisions such as Metazoa, Gyrista, Cryptophyta, Chlorophyta, and Dinoflagellata were found to be dominant in the samples. Thus, the results support the applicability of the method in environmental microbiome research. The merging and concatenation workflow presented here requires considerably less computational resources and uses widely/commonly used bioinformatics packages, saving researchers analyses time (for equivalent sample numbers, compared to the conventional approach) required to infer the diversity of major microbial domains from mixed amplicon data at comparable accuracy.</p>\",\"PeriodicalId\":12214,\"journal\":{\"name\":\"Fems Microbiology Letters\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2024-01-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Fems Microbiology Letters\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1093/femsle/fnae009\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MICROBIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fems Microbiology Letters","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/femsle/fnae009","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MICROBIOLOGY","Score":null,"Total":0}
Merging and concatenation of sequencing reads: a bioinformatics workflow for the comprehensive profiling of microbiome from amplicon data.
A comprehensive profiling of microbial diversity is essential to understand the ecosystem functions. Universal primer sets such as the 515Y/926R could amplify a part of 16S and 18S rRNA and infer the diversity of prokaryotes and eukaryotes. However, the analyses of mixed sequencing data pose a bioinformatics challenge; the 16S and 18S rRNA sequences need to be separated first and analysed individually/independently due to variations in the amplicon length. This study describes an alternative strategy, a merging and concatenation workflow, to analyse the mixed amplicon data without separating the 16S and 18S rRNA sequences. The workflow was tested with 24 mock community (MC) samples, and the analyses resolved the composition of prokaryotes and eukaryotes adequately. In addition, there was a strong correlation (cor = 0.950; P-value = 4.754e-10) between the observed and expected abundances in the MC samples, which suggests that the computational approach could infer the microbial proportions accurately. Further, 18 samples collected from the Sundarbans mangrove region were analysed as a case study. The analyses identified Proteobacteria, Bacteroidota, Actinobacteriota, Cyanobacteria, and Crenarchaeota as dominant bacterial phyla and eukaryotic divisions such as Metazoa, Gyrista, Cryptophyta, Chlorophyta, and Dinoflagellata were found to be dominant in the samples. Thus, the results support the applicability of the method in environmental microbiome research. The merging and concatenation workflow presented here requires considerably less computational resources and uses widely/commonly used bioinformatics packages, saving researchers analyses time (for equivalent sample numbers, compared to the conventional approach) required to infer the diversity of major microbial domains from mixed amplicon data at comparable accuracy.
期刊介绍:
FEMS Microbiology Letters gives priority to concise papers that merit rapid publication by virtue of their originality, general interest and contribution to new developments in microbiology. All aspects of microbiology, including virology, are covered.
2019 Impact Factor: 1.987, Journal Citation Reports (Source Clarivate, 2020)
Ranking: 98/135 (Microbiology)
The journal is divided into eight Sections:
Physiology and Biochemistry (including genetics, molecular biology and ‘omic’ studies)
Food Microbiology (from food production and biotechnology to spoilage and food borne pathogens)
Biotechnology and Synthetic Biology
Pathogens and Pathogenicity (including medical, veterinary, plant and insect pathogens – particularly those relating to food security – with the exception of viruses)
Environmental Microbiology (including ecophysiology, ecogenomics and meta-omic studies)
Virology (viruses infecting any organism, including Bacteria and Archaea)
Taxonomy and Systematics (for publication of novel taxa, taxonomic reclassifications and reviews of a taxonomic nature)
Professional Development (including education, training, CPD, research assessment frameworks, research and publication metrics, best-practice, careers and history of microbiology)
If you are unsure which Section is most appropriate for your manuscript, for example in the case of transdisciplinary studies, we recommend that you contact the Editor-In-Chief by email prior to submission. Our scope includes any type of microorganism - all members of the Bacteria and the Archaea and microbial members of the Eukarya (yeasts, filamentous fungi, microbial algae, protozoa, oomycetes, myxomycetes, etc.) as well as all viruses.