Comparative analysis of HiSeq3000 and BGISEQ-500 sequencing platform over whole genome sequencing metagenomics data

Genomics & Informatics Pub Date : 2023-12-31 DOI:10.5808/gi.23072

Animesh Kumar, E. Robertsen, N. Willassen, Juan Fu, Erik Hjerde

{"title":"Comparative analysis of HiSeq3000 and BGISEQ-500 sequencing platform over whole genome sequencing metagenomics data","authors":"Animesh Kumar, E. Robertsen, N. Willassen, Juan Fu, Erik Hjerde","doi":"10.5808/gi.23072","DOIUrl":null,"url":null,"abstract":"Recent advances in sequencing technologies and platforms have enabled to generate metagenomics sequences using different sequencing platforms. In this study, we analyzed and compared shotgun metagenomic sequences generated by HiSeq3000 and BGISEQ-500 platforms from 12 sediment samples collected across the Norwegian coast. Metagenomics DNA sequences were normalized to an equal number of bases for both platforms and further evaluated by using different taxonomic classifiers, reference databases, and assemblers. Normalized BGISEQ-500 sequences retained more reads and base counts after preprocessing, while a slightly higher fraction of HiSeq3000 sequences were taxonomically classified. Kaiju classified a higher percentage of reads relative to Kraken2 for both platforms, and comparison of reference database for taxonomic classification showed that MAR database outperformed RefSeq. Assembly using MEGAHIT produced longer assemblies and higher total contigs count in majority of HiSeq3000 samples than using metaSPAdes, but the assembly statistics notably improved with unprocessed or normalized reads. Our results indicate that both platforms perform comparably in terms of the percentage of taxonomically classified reads and assembled contig statistics for metagenomics samples. This study provides valuable insights for researchers in selecting an appropriate sequencing platform and bioinformatics pipeline for their metagenomics studies.","PeriodicalId":197222,"journal":{"name":"Genomics & Informatics","volume":"108 14","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genomics & Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5808/gi.23072","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Recent advances in sequencing technologies and platforms have enabled to generate metagenomics sequences using different sequencing platforms. In this study, we analyzed and compared shotgun metagenomic sequences generated by HiSeq3000 and BGISEQ-500 platforms from 12 sediment samples collected across the Norwegian coast. Metagenomics DNA sequences were normalized to an equal number of bases for both platforms and further evaluated by using different taxonomic classifiers, reference databases, and assemblers. Normalized BGISEQ-500 sequences retained more reads and base counts after preprocessing, while a slightly higher fraction of HiSeq3000 sequences were taxonomically classified. Kaiju classified a higher percentage of reads relative to Kraken2 for both platforms, and comparison of reference database for taxonomic classification showed that MAR database outperformed RefSeq. Assembly using MEGAHIT produced longer assemblies and higher total contigs count in majority of HiSeq3000 samples than using metaSPAdes, but the assembly statistics notably improved with unprocessed or normalized reads. Our results indicate that both platforms perform comparably in terms of the percentage of taxonomically classified reads and assembled contig statistics for metagenomics samples. This study provides valuable insights for researchers in selecting an appropriate sequencing platform and bioinformatics pipeline for their metagenomics studies.

查看原文本刊更多论文

HiSeq3000 和 BGISEQ-500 测序平台对全基因组测序元基因组学数据的比较分析

测序技术和平台的最新进展使得利用不同测序平台生成元基因组序列成为可能。在这项研究中，我们分析并比较了从挪威海岸采集的12个沉积物样本中，由HiSeq3000和BGISEQ-500平台生成的枪式元基因组序列。两种平台的元基因组 DNA 序列均归一化为相同的碱基数，并通过使用不同的分类分类器、参考数据库和组合器进行进一步评估。归一化的 BGISEQ-500 序列在预处理后保留了更多的读数和碱基数，而 HiSeq3000 序列的分类比例略高。与 Kraken2 相比，Kaiju 对两种平台的读数进行分类的比例更高，对分类参考数据库进行比较后发现，MAR 数据库的分类结果优于 RefSeq。与使用 metaSPAdes 相比，在大多数 HiSeq3000 样本中，使用 MEGAHIT 进行的组装产生了更长的组装和更高的总片段数，但使用未经处理或归一化的读数时，组装统计量明显提高。我们的研究结果表明，这两种平台在元基因组学样本的分类读数百分比和组装等位基因统计方面表现相当。这项研究为研究人员选择合适的测序平台和生物信息学流水线进行元基因组学研究提供了有价值的见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Genomics & Informatics

自引率

0.00%

发文量