Phage Commander, an Application for Rapid Gene Identification in Bacteriophage Genomes Using Multiple Programs.

PHAGE (New Rochelle, N.Y.) Pub Date : 2021-12-01 Epub Date: 2021-12-16 DOI:10.1089/phage.2020.0044
Matt Lazeroff, Geordie Ryder, Sarah L Harris, Philippos K Tsourkas
{"title":"Phage Commander, an Application for Rapid Gene Identification in Bacteriophage Genomes Using Multiple Programs.","authors":"Matt Lazeroff,&nbsp;Geordie Ryder,&nbsp;Sarah L Harris,&nbsp;Philippos K Tsourkas","doi":"10.1089/phage.2020.0044","DOIUrl":null,"url":null,"abstract":"<p><p>The number of sequenced bacteriophage genomes is growing at an exponential rate. The majority of sequenced bacteriophage genomes are annotated by one or more of several freely available gene identification programs (Glimmer, GeneMark, RAST, Prodigal, etc.). No program has been shown to consistently outperform the others; thus, the choice of which program to use is not obvious. We present the Phage Commander application for rapid identification of bacteriophage genes using multiple gene identification programs. Phage Commander runs a bacteriophage genome sequence through nine gene identification programs (and an additional program for identification of tRNAs) and integrates the results within a single output table. Phage Commander also generates formatted output files for direct export to National Center for Biotechnology Information GenBank or genome visualization programs such as DNA Master. Users can select the threshold for which genes to export (genes identified by at least one program, genes identified by at least two programs, etc.). Phage Commander was benchmarked using eight high-quality bacteriophage genomes whose genes are backed by experimental data. Our results show that the most accurate annotations are obtained by exporting genes identified by at least two or three programs. Many groups opt to manually curate the annotations obtained from gene identification programs, and Phage Commander was designed to facilitate manual curation of genome annotations. Our benchmarking results show that manual curation does indeed produce more accurate annotations than any individual gene identification program. The authors thus recommend manually curating the output of Phage Commander to generate maximally accurate annotations. Phage Commander is currently being used in the corresponding author's bacteriophage genome annotation class and has reduced the labor cost and improved the quality of genome annotations.</p>","PeriodicalId":74428,"journal":{"name":"PHAGE (New Rochelle, N.Y.)","volume":" ","pages":"204-213"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/03/71/phage.2020.0044.PMC9041506.pdf","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PHAGE (New Rochelle, N.Y.)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1089/phage.2020.0044","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2021/12/16 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10

Abstract

The number of sequenced bacteriophage genomes is growing at an exponential rate. The majority of sequenced bacteriophage genomes are annotated by one or more of several freely available gene identification programs (Glimmer, GeneMark, RAST, Prodigal, etc.). No program has been shown to consistently outperform the others; thus, the choice of which program to use is not obvious. We present the Phage Commander application for rapid identification of bacteriophage genes using multiple gene identification programs. Phage Commander runs a bacteriophage genome sequence through nine gene identification programs (and an additional program for identification of tRNAs) and integrates the results within a single output table. Phage Commander also generates formatted output files for direct export to National Center for Biotechnology Information GenBank or genome visualization programs such as DNA Master. Users can select the threshold for which genes to export (genes identified by at least one program, genes identified by at least two programs, etc.). Phage Commander was benchmarked using eight high-quality bacteriophage genomes whose genes are backed by experimental data. Our results show that the most accurate annotations are obtained by exporting genes identified by at least two or three programs. Many groups opt to manually curate the annotations obtained from gene identification programs, and Phage Commander was designed to facilitate manual curation of genome annotations. Our benchmarking results show that manual curation does indeed produce more accurate annotations than any individual gene identification program. The authors thus recommend manually curating the output of Phage Commander to generate maximally accurate annotations. Phage Commander is currently being used in the corresponding author's bacteriophage genome annotation class and has reduced the labor cost and improved the quality of genome annotations.

Abstract Image

Abstract Image

Abstract Image

噬菌体指挥官:噬菌体基因组多程序快速基因鉴定的应用。
噬菌体基因组测序的数量正以指数速度增长。大多数已测序的噬菌体基因组都由一个或多个免费的基因鉴定程序(Glimmer、GeneMark、RAST、Prodigal等)进行注释。没有一个项目被证明能始终优于其他项目;因此,选择使用哪个程序并不明显。我们提出噬菌体指挥官应用程序快速鉴定噬菌体基因使用多个基因鉴定程序。Phage Commander通过9个基因鉴定程序(以及一个额外的trna鉴定程序)运行噬菌体基因组序列,并将结果整合到一个输出表中。噬菌体指挥官还生成格式化的输出文件,直接输出到国家生物技术信息中心基因库或基因组可视化程序,如DNA大师。用户可以选择输出基因的阈值(至少被一个程序识别的基因,至少被两个程序识别的基因,等等)。噬菌体指挥官是使用八个高质量的噬菌体基因组,其基因是由实验数据支持的基准。我们的结果表明,最准确的注释是通过输出至少两个或三个程序识别的基因获得的。许多团队选择手动管理从基因鉴定程序获得的注释,Phage Commander的设计是为了方便手动管理基因组注释。我们的基准测试结果表明,人工管理确实比任何单个基因鉴定程序产生更准确的注释。因此,作者建议手动管理Phage Commander的输出,以生成最准确的注释。Phage Commander目前正在通讯作者的噬菌体基因组注释类中使用,降低了人工成本,提高了基因组注释的质量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信