Chris Jackson, Todd McLay, Alexander N. Schmidt-Lebuhn
{"title":"hybpiper-nf and paragone-nf: Containerization and additional options for target capture assembly and paralog resolution","authors":"Chris Jackson, Todd McLay, Alexander N. Schmidt-Lebuhn","doi":"10.1002/aps3.11532","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Premise</h3>\n \n <p>The HybPiper pipeline has become one of the most widely used tools for the assembly of target capture data for phylogenomic analysis. After the production of locus sequences and before phylogenetic analysis, the identification of paralogs is a critical step for ensuring the accurate inference of evolutionary relationships. Algorithmic approaches using gene tree topologies for the inference of ortholog groups are computationally efficient and broadly applicable to non-model organisms, especially in the absence of a known species tree.</p>\n </section>\n \n <section>\n \n <h3> Methods and Results</h3>\n \n <p>We containerized and expanded the functionality of both HybPiper and a pipeline for the inference of ortholog groups, providing novel options for the treatment of target capture sequence data, and allowing seamless use of the outputs of the former as inputs for the latter. The Singularity container presented here includes all dependencies, and the corresponding pipelines (hybpiper-nf and paragone-nf, respectively) are implemented via two Nextflow scripts for easier deployment and to vastly reduce the number of commands required for their use.</p>\n </section>\n \n <section>\n \n <h3> Conclusions</h3>\n \n <p>The hybpiper-nf and paragone-nf pipelines are easily installed and provide a user-friendly experience and robust results to the phylogenetic community. They are used by the Australian Angiosperm Tree of Life project. The pipelines are available at https://github.com/chrisjackson-pellicle/hybpiper-nf and https://github.com/chrisjackson-pellicle/paragone-nf.</p>\n </section>\n </div>","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2023-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/f4/1c/APS3-11-e11532.PMC10439820.pdf","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"99","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/aps3.11532","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
引用次数: 2
Abstract
Premise
The HybPiper pipeline has become one of the most widely used tools for the assembly of target capture data for phylogenomic analysis. After the production of locus sequences and before phylogenetic analysis, the identification of paralogs is a critical step for ensuring the accurate inference of evolutionary relationships. Algorithmic approaches using gene tree topologies for the inference of ortholog groups are computationally efficient and broadly applicable to non-model organisms, especially in the absence of a known species tree.
Methods and Results
We containerized and expanded the functionality of both HybPiper and a pipeline for the inference of ortholog groups, providing novel options for the treatment of target capture sequence data, and allowing seamless use of the outputs of the former as inputs for the latter. The Singularity container presented here includes all dependencies, and the corresponding pipelines (hybpiper-nf and paragone-nf, respectively) are implemented via two Nextflow scripts for easier deployment and to vastly reduce the number of commands required for their use.
Conclusions
The hybpiper-nf and paragone-nf pipelines are easily installed and provide a user-friendly experience and robust results to the phylogenetic community. They are used by the Australian Angiosperm Tree of Life project. The pipelines are available at https://github.com/chrisjackson-pellicle/hybpiper-nf and https://github.com/chrisjackson-pellicle/paragone-nf.