Poplar: a phylogenomics pipeline.

IF 2.4 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Bioinformatics advances Pub Date : 2025-05-06 eCollection Date: 2025-01-01 DOI:10.1093/bioadv/vbaf104

Elizabeth Koning, Arjun Subedi, Raga Krishnakumar

{"title":"Poplar: a phylogenomics pipeline.","authors":"Elizabeth Koning, Arjun Subedi, Raga Krishnakumar","doi":"10.1093/bioadv/vbaf104","DOIUrl":null,"url":null,"abstract":"Motivation: Generating phylogenomic trees from the genomic data is essential in understanding biological systems. Each step of this complex process has received extensive attention and has been significantly streamlined over the years. Given the public availability of data, obtaining genomes for a wide selection of species is straightforward. However, analyzing that data to generate a phylogenomic tree is a multistep process with legitimate scientific and technical challenges, often requiring a significant input from a domain-area scientist.Results: We present Poplar, a new, streamlined computational pipeline, to address the computational logistical issues that arise when constructing the phylogenomic trees. It provides a framework that runs state-of-the-art software for essential steps in the phylogenomic pipeline, beginning from a genome with or without an annotation, and resulting in a species tree. Running Poplar requires no external databases. In the execution, it enables parallelism for execution for clusters and cloud computing. The trees generated by Poplar match closely with state-of-the-art published trees. The usage and performance of Poplar is far simpler and quicker than manually running a phylogenomic pipeline.Availability and implementation: Freely available on GitHub at https://github.com/sandialabs/poplar. Implemented using Python and supported on Linux.","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"5 1","pages":"vbaf104"},"PeriodicalIF":2.4000,"publicationDate":"2025-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12159734/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics advances","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioadv/vbaf104","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Motivation: Generating phylogenomic trees from the genomic data is essential in understanding biological systems. Each step of this complex process has received extensive attention and has been significantly streamlined over the years. Given the public availability of data, obtaining genomes for a wide selection of species is straightforward. However, analyzing that data to generate a phylogenomic tree is a multistep process with legitimate scientific and technical challenges, often requiring a significant input from a domain-area scientist.

Results: We present Poplar, a new, streamlined computational pipeline, to address the computational logistical issues that arise when constructing the phylogenomic trees. It provides a framework that runs state-of-the-art software for essential steps in the phylogenomic pipeline, beginning from a genome with or without an annotation, and resulting in a species tree. Running Poplar requires no external databases. In the execution, it enables parallelism for execution for clusters and cloud computing. The trees generated by Poplar match closely with state-of-the-art published trees. The usage and performance of Poplar is far simpler and quicker than manually running a phylogenomic pipeline.

Availability and implementation: Freely available on GitHub at https://github.com/sandialabs/poplar. Implemented using Python and supported on Linux.

查看原文本刊更多论文

杨树：一个系统基因组学管道。

动机：从基因组数据中生成系统基因组树对于理解生物系统至关重要。这一复杂进程的每一步都受到广泛关注，多年来已大大精简。鉴于数据的公开可用性，获得广泛选择物种的基因组是直截了当的。然而，分析这些数据以生成系统基因组树是一个多步骤的过程，具有合理的科学和技术挑战，通常需要领域科学家的大量投入。结果：我们提出了Poplar，一个新的，流线型计算管道，以解决在构建系统基因组树时出现的计算逻辑问题。它提供了一个框架，运行最先进的软件，用于系统基因组管道中的基本步骤，从带有或不带有注释的基因组开始，并产生物种树。运行Poplar不需要外部数据库。在执行中，它支持集群和云计算执行的并行性。杨树生成的树与最新出版的树非常匹配。Poplar的使用和性能比手动运行系统基因组学管道要简单和快速得多。可用性和实现：在GitHub上免费获得https://github.com/sandialabs/poplar。使用Python实现，在Linux上支持。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Bioinformatics advances

CiteScore

1.60

自引率

0.00%

发文量