基于注释基因组特征和神经网络的噬菌体宿主预测新工具vHULK

PHAGE (New Rochelle, N.Y.) Pub Date : 2022-12-01 Epub Date: 2022-12-19 DOI:10.1089/phage.2021.0016
Deyvid Amgarten, Bruno Koshin Vázquez Iha, Carlos Morais Piroupo, Aline Maria da Silva, João Carlos Setubal
{"title":"基于注释基因组特征和神经网络的噬菌体宿主预测新工具vHULK","authors":"Deyvid Amgarten, Bruno Koshin Vázquez Iha, Carlos Morais Piroupo, Aline Maria da Silva, João Carlos Setubal","doi":"10.1089/phage.2021.0016","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The experimental determination of a bacteriophage host is a laborious procedure. Thus, there is a pressing need for reliable computational predictions of bacteriophage hosts.</p><p><strong>Materials and methods: </strong>We developed the program vHULK for phage host prediction based on 9504 phage genome features, which consider alignment significance scores between predicted proteins and a curated database of viral protein families. The features were fed to a neural network, and two models were trained to predict 77 host genera and 118 host species.</p><p><strong>Results: </strong>In controlled random test sets with 90% redundancy reduction in terms of protein similarity, vHULK obtained on average 83% precision and 79% recall at the genus level, and 71% precision and 67% recall at the species level. The performance of vHULK was compared against three other tools on a test data set with 2153 phage genomes. On this data set, vHULK achieved better performance at both the genus and the species levels than the other tools.</p><p><strong>Conclusions: </strong>Our results suggest that vHULK represents an advance on the state of art in phage host prediction.</p>","PeriodicalId":74428,"journal":{"name":"PHAGE (New Rochelle, N.Y.)","volume":"3 4","pages":"204-212"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9917316/pdf/","citationCount":"3","resultStr":"{\"title\":\"vHULK, a New Tool for Bacteriophage Host Prediction Based on Annotated Genomic Features and Neural Networks.\",\"authors\":\"Deyvid Amgarten, Bruno Koshin Vázquez Iha, Carlos Morais Piroupo, Aline Maria da Silva, João Carlos Setubal\",\"doi\":\"10.1089/phage.2021.0016\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>The experimental determination of a bacteriophage host is a laborious procedure. Thus, there is a pressing need for reliable computational predictions of bacteriophage hosts.</p><p><strong>Materials and methods: </strong>We developed the program vHULK for phage host prediction based on 9504 phage genome features, which consider alignment significance scores between predicted proteins and a curated database of viral protein families. The features were fed to a neural network, and two models were trained to predict 77 host genera and 118 host species.</p><p><strong>Results: </strong>In controlled random test sets with 90% redundancy reduction in terms of protein similarity, vHULK obtained on average 83% precision and 79% recall at the genus level, and 71% precision and 67% recall at the species level. The performance of vHULK was compared against three other tools on a test data set with 2153 phage genomes. On this data set, vHULK achieved better performance at both the genus and the species levels than the other tools.</p><p><strong>Conclusions: </strong>Our results suggest that vHULK represents an advance on the state of art in phage host prediction.</p>\",\"PeriodicalId\":74428,\"journal\":{\"name\":\"PHAGE (New Rochelle, N.Y.)\",\"volume\":\"3 4\",\"pages\":\"204-212\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9917316/pdf/\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"PHAGE (New Rochelle, N.Y.)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1089/phage.2021.0016\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2022/12/19 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"PHAGE (New Rochelle, N.Y.)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1089/phage.2021.0016","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2022/12/19 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

背景:噬菌体宿主的实验测定是一个费力的过程。因此,迫切需要对噬菌体宿主进行可靠的计算预测。材料和方法:我们基于9504个噬菌体基因组特征开发了噬菌体宿主预测程序vHULK,该程序考虑了预测蛋白与病毒蛋白家族数据库之间的比对显著性评分。将这些特征输入到一个神经网络中,训练两个模型来预测77个宿主属和118个宿主种。结果:在蛋白质相似性冗余度降低90%的受控随机测试集中,vHULK在属水平上的平均准确率为83%,召回率为79%,在种水平上的平均准确率为71%,召回率为67%。在包含2153个噬菌体基因组的测试数据集上,将vHULK与其他三种工具的性能进行了比较。在该数据集上,vHULK在属和种水平上都比其他工具取得了更好的性能。结论:我们的研究结果表明,vHULK代表了噬菌体宿主预测的最新进展。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
vHULK, a New Tool for Bacteriophage Host Prediction Based on Annotated Genomic Features and Neural Networks.

Background: The experimental determination of a bacteriophage host is a laborious procedure. Thus, there is a pressing need for reliable computational predictions of bacteriophage hosts.

Materials and methods: We developed the program vHULK for phage host prediction based on 9504 phage genome features, which consider alignment significance scores between predicted proteins and a curated database of viral protein families. The features were fed to a neural network, and two models were trained to predict 77 host genera and 118 host species.

Results: In controlled random test sets with 90% redundancy reduction in terms of protein similarity, vHULK obtained on average 83% precision and 79% recall at the genus level, and 71% precision and 67% recall at the species level. The performance of vHULK was compared against three other tools on a test data set with 2153 phage genomes. On this data set, vHULK achieved better performance at both the genus and the species levels than the other tools.

Conclusions: Our results suggest that vHULK represents an advance on the state of art in phage host prediction.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信