Using SPAdes De Novo Assembler

Q1 Biochemistry, Genetics and Molecular Biology
Andrey Prjibelski, Dmitry Antipov, Dmitry Meleshko, Alla Lapidus, Anton Korobeynikov
{"title":"Using SPAdes De Novo Assembler","authors":"Andrey Prjibelski,&nbsp;Dmitry Antipov,&nbsp;Dmitry Meleshko,&nbsp;Alla Lapidus,&nbsp;Anton Korobeynikov","doi":"10.1002/cpbi.102","DOIUrl":null,"url":null,"abstract":"<p>SPAdes—St. Petersburg genome Assembler—was originally developed for de novo assembly of genome sequencing data produced for cultivated microbial isolates and for single-cell genomic DNA sequencing. With time, the functionality of SPAdes was extended to enable assembly of IonTorrent data, as well as hybrid assembly from short and long reads (PacBio and Oxford Nanopore). In this article we present protocols for five different assembly pipelines that comprise the SPAdes package and that are used for assembly of metagenomes and transcriptomes as well as assembly of putative plasmids and biosynthetic gene clusters from whole-genome sequencing and metagenomic datasets. In addition, we present guidelines for understanding results with use cases for each pipeline, and several additional support protocols that help in using SPAdes properly. © 2020 Wiley Periodicals LLC.</p><p><b>Basic Protocol 1</b>: Assembling isolate bacterial datasets</p><p><b>Basic Protocol 2</b>: Assembling metagenomic datasets</p><p><b>Basic Protocol 3</b>: Assembling sets of putative plasmids</p><p><b>Basic Protocol 4</b>: Assembling transcriptomes</p><p><b>Basic Protocol 5</b>: Assembling putative biosynthetic gene clusters</p><p><b>Support Protocol 1</b>: Installing SPAdes</p><p><b>Support Protocol 2</b>: Providing input via command line</p><p><b>Support Protocol 3</b>: Providing input data via YAML format</p><p><b>Support Protocol 4</b>: Restarting previous run</p><p><b>Support Protocol 5</b>: Determining strand-specificity of RNA-seq data</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"70 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2020-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.102","citationCount":"804","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current protocols in bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cpbi.102","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}
引用次数: 804

Abstract

SPAdes—St. Petersburg genome Assembler—was originally developed for de novo assembly of genome sequencing data produced for cultivated microbial isolates and for single-cell genomic DNA sequencing. With time, the functionality of SPAdes was extended to enable assembly of IonTorrent data, as well as hybrid assembly from short and long reads (PacBio and Oxford Nanopore). In this article we present protocols for five different assembly pipelines that comprise the SPAdes package and that are used for assembly of metagenomes and transcriptomes as well as assembly of putative plasmids and biosynthetic gene clusters from whole-genome sequencing and metagenomic datasets. In addition, we present guidelines for understanding results with use cases for each pipeline, and several additional support protocols that help in using SPAdes properly. © 2020 Wiley Periodicals LLC.

Basic Protocol 1: Assembling isolate bacterial datasets

Basic Protocol 2: Assembling metagenomic datasets

Basic Protocol 3: Assembling sets of putative plasmids

Basic Protocol 4: Assembling transcriptomes

Basic Protocol 5: Assembling putative biosynthetic gene clusters

Support Protocol 1: Installing SPAdes

Support Protocol 2: Providing input via command line

Support Protocol 3: Providing input data via YAML format

Support Protocol 4: Restarting previous run

Support Protocol 5: Determining strand-specificity of RNA-seq data

使用黑桃从头组装
SPAdes-St。彼得斯堡基因组组装器-最初开发的基因组测序数据的从头组装产生的培养微生物分离和单细胞基因组DNA测序。随着时间的推移,SPAdes的功能扩展到能够组装IonTorrent数据,以及从短读取和长读取(PacBio和Oxford Nanopore)混合组装。在本文中,我们介绍了五种不同的组装管道的协议,这些管道包括SPAdes包,用于组装宏基因组和转录组,以及组装来自全基因组测序和宏基因组数据集的推定质粒和生物合成基因簇。此外,我们还提供了一些指导方针,用于理解每个管道的用例结果,以及一些帮助正确使用SPAdes的附加支持协议。©2020 Wiley期刊有限公司基本协议1:组装分离细菌数据集基本协议2:组装宏基因组数据集基本协议3:组装推定质粒集基本协议4:组装转录组基本协议5:组装推定的生物合成基因集群支持协议1:安装spades支持协议2:通过命令行提供输入支持协议3:通过YAML格式提供输入数据支持协议4:重新启动以前的runSupport协议5:确定RNA-seq数据的链特异性
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Current protocols in bioinformatics
Current protocols in bioinformatics Biochemistry, Genetics and Molecular Biology-Biochemistry
自引率
0.00%
发文量
0
期刊介绍: With Current Protocols in Bioinformatics, it"s easier than ever for the life scientist to become "fluent" in bioinformatics and master the exciting new frontiers opened up by DNA sequencing. Updated every three months in all formats, CPBI is constantly evolving to keep pace with the very latest discoveries and developments.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信