Poplar: a phylogenomics pipeline.

IF 2.4 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY
Bioinformatics advances Pub Date : 2025-05-06 eCollection Date: 2025-01-01 DOI:10.1093/bioadv/vbaf104
Elizabeth Koning, Arjun Subedi, Raga Krishnakumar
{"title":"Poplar: a phylogenomics pipeline.","authors":"Elizabeth Koning, Arjun Subedi, Raga Krishnakumar","doi":"10.1093/bioadv/vbaf104","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>Generating phylogenomic trees from the genomic data is essential in understanding biological systems. Each step of this complex process has received extensive attention and has been significantly streamlined over the years. Given the public availability of data, obtaining genomes for a wide selection of species is straightforward. However, analyzing that data to generate a phylogenomic tree is a multistep process with legitimate scientific and technical challenges, often requiring a significant input from a domain-area scientist.</p><p><strong>Results: </strong>We present Poplar, a new, streamlined computational pipeline, to address the computational logistical issues that arise when constructing the phylogenomic trees. It provides a framework that runs state-of-the-art software for essential steps in the phylogenomic pipeline, beginning from a genome with or without an annotation, and resulting in a species tree. Running Poplar requires no external databases. In the execution, it enables parallelism for execution for clusters and cloud computing. The trees generated by Poplar match closely with state-of-the-art published trees. The usage and performance of Poplar is far simpler and quicker than manually running a phylogenomic pipeline.</p><p><strong>Availability and implementation: </strong>Freely available on GitHub at https://github.com/sandialabs/poplar. Implemented using Python and supported on Linux.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"5 1","pages":"vbaf104"},"PeriodicalIF":2.4000,"publicationDate":"2025-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12159734/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics advances","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioadv/vbaf104","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Motivation: Generating phylogenomic trees from the genomic data is essential in understanding biological systems. Each step of this complex process has received extensive attention and has been significantly streamlined over the years. Given the public availability of data, obtaining genomes for a wide selection of species is straightforward. However, analyzing that data to generate a phylogenomic tree is a multistep process with legitimate scientific and technical challenges, often requiring a significant input from a domain-area scientist.

Results: We present Poplar, a new, streamlined computational pipeline, to address the computational logistical issues that arise when constructing the phylogenomic trees. It provides a framework that runs state-of-the-art software for essential steps in the phylogenomic pipeline, beginning from a genome with or without an annotation, and resulting in a species tree. Running Poplar requires no external databases. In the execution, it enables parallelism for execution for clusters and cloud computing. The trees generated by Poplar match closely with state-of-the-art published trees. The usage and performance of Poplar is far simpler and quicker than manually running a phylogenomic pipeline.

Availability and implementation: Freely available on GitHub at https://github.com/sandialabs/poplar. Implemented using Python and supported on Linux.

杨树:一个系统基因组学管道。
动机:从基因组数据中生成系统基因组树对于理解生物系统至关重要。这一复杂进程的每一步都受到广泛关注,多年来已大大精简。鉴于数据的公开可用性,获得广泛选择物种的基因组是直截了当的。然而,分析这些数据以生成系统基因组树是一个多步骤的过程,具有合理的科学和技术挑战,通常需要领域科学家的大量投入。结果:我们提出了Poplar,一个新的,流线型计算管道,以解决在构建系统基因组树时出现的计算逻辑问题。它提供了一个框架,运行最先进的软件,用于系统基因组管道中的基本步骤,从带有或不带有注释的基因组开始,并产生物种树。运行Poplar不需要外部数据库。在执行中,它支持集群和云计算执行的并行性。杨树生成的树与最新出版的树非常匹配。Poplar的使用和性能比手动运行系统基因组学管道要简单和快速得多。可用性和实现:在GitHub上免费获得https://github.com/sandialabs/poplar。使用Python实现,在Linux上支持。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
1.60
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信