Andrey Prjibelski, Dmitry Antipov, Dmitry Meleshko, Alla Lapidus, Anton Korobeynikov
下载PDF
{"title":"使用黑桃从头组装","authors":"Andrey Prjibelski, Dmitry Antipov, Dmitry Meleshko, Alla Lapidus, Anton Korobeynikov","doi":"10.1002/cpbi.102","DOIUrl":null,"url":null,"abstract":"<p>SPAdes—St. Petersburg genome Assembler—was originally developed for de novo assembly of genome sequencing data produced for cultivated microbial isolates and for single-cell genomic DNA sequencing. With time, the functionality of SPAdes was extended to enable assembly of IonTorrent data, as well as hybrid assembly from short and long reads (PacBio and Oxford Nanopore). In this article we present protocols for five different assembly pipelines that comprise the SPAdes package and that are used for assembly of metagenomes and transcriptomes as well as assembly of putative plasmids and biosynthetic gene clusters from whole-genome sequencing and metagenomic datasets. In addition, we present guidelines for understanding results with use cases for each pipeline, and several additional support protocols that help in using SPAdes properly. © 2020 Wiley Periodicals LLC.</p><p><b>Basic Protocol 1</b>: Assembling isolate bacterial datasets</p><p><b>Basic Protocol 2</b>: Assembling metagenomic datasets</p><p><b>Basic Protocol 3</b>: Assembling sets of putative plasmids</p><p><b>Basic Protocol 4</b>: Assembling transcriptomes</p><p><b>Basic Protocol 5</b>: Assembling putative biosynthetic gene clusters</p><p><b>Support Protocol 1</b>: Installing SPAdes</p><p><b>Support Protocol 2</b>: Providing input via command line</p><p><b>Support Protocol 3</b>: Providing input data via YAML format</p><p><b>Support Protocol 4</b>: Restarting previous run</p><p><b>Support Protocol 5</b>: Determining strand-specificity of RNA-seq data</p>","PeriodicalId":10958,"journal":{"name":"Current protocols in bioinformatics","volume":"70 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2020-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1002/cpbi.102","citationCount":"804","resultStr":"{\"title\":\"Using SPAdes De Novo Assembler\",\"authors\":\"Andrey Prjibelski, Dmitry Antipov, Dmitry Meleshko, Alla Lapidus, Anton Korobeynikov\",\"doi\":\"10.1002/cpbi.102\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>SPAdes—St. Petersburg genome Assembler—was originally developed for de novo assembly of genome sequencing data produced for cultivated microbial isolates and for single-cell genomic DNA sequencing. With time, the functionality of SPAdes was extended to enable assembly of IonTorrent data, as well as hybrid assembly from short and long reads (PacBio and Oxford Nanopore). In this article we present protocols for five different assembly pipelines that comprise the SPAdes package and that are used for assembly of metagenomes and transcriptomes as well as assembly of putative plasmids and biosynthetic gene clusters from whole-genome sequencing and metagenomic datasets. In addition, we present guidelines for understanding results with use cases for each pipeline, and several additional support protocols that help in using SPAdes properly. © 2020 Wiley Periodicals LLC.</p><p><b>Basic Protocol 1</b>: Assembling isolate bacterial datasets</p><p><b>Basic Protocol 2</b>: Assembling metagenomic datasets</p><p><b>Basic Protocol 3</b>: Assembling sets of putative plasmids</p><p><b>Basic Protocol 4</b>: Assembling transcriptomes</p><p><b>Basic Protocol 5</b>: Assembling putative biosynthetic gene clusters</p><p><b>Support Protocol 1</b>: Installing SPAdes</p><p><b>Support Protocol 2</b>: Providing input via command line</p><p><b>Support Protocol 3</b>: Providing input data via YAML format</p><p><b>Support Protocol 4</b>: Restarting previous run</p><p><b>Support Protocol 5</b>: Determining strand-specificity of RNA-seq data</p>\",\"PeriodicalId\":10958,\"journal\":{\"name\":\"Current protocols in bioinformatics\",\"volume\":\"70 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-06-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1002/cpbi.102\",\"citationCount\":\"804\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Current protocols in bioinformatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/cpbi.102\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Biochemistry, Genetics and Molecular Biology\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current protocols in bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cpbi.102","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}
引用次数: 804
引用
批量引用
Using SPAdes De Novo Assembler
SPAdes—St. Petersburg genome Assembler—was originally developed for de novo assembly of genome sequencing data produced for cultivated microbial isolates and for single-cell genomic DNA sequencing. With time, the functionality of SPAdes was extended to enable assembly of IonTorrent data, as well as hybrid assembly from short and long reads (PacBio and Oxford Nanopore). In this article we present protocols for five different assembly pipelines that comprise the SPAdes package and that are used for assembly of metagenomes and transcriptomes as well as assembly of putative plasmids and biosynthetic gene clusters from whole-genome sequencing and metagenomic datasets. In addition, we present guidelines for understanding results with use cases for each pipeline, and several additional support protocols that help in using SPAdes properly. © 2020 Wiley Periodicals LLC.
Basic Protocol 1 : Assembling isolate bacterial datasets
Basic Protocol 2 : Assembling metagenomic datasets
Basic Protocol 3 : Assembling sets of putative plasmids
Basic Protocol 4 : Assembling transcriptomes
Basic Protocol 5 : Assembling putative biosynthetic gene clusters
Support Protocol 1 : Installing SPAdes
Support Protocol 2 : Providing input via command line
Support Protocol 3 : Providing input data via YAML format
Support Protocol 4 : Restarting previous run
Support Protocol 5 : Determining strand-specificity of RNA-seq data