Emvirus: An embedding-based neural framework for human-virus protein-protein interactions prediction

IF 3.5 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH
Pengfei Xie , Jujuan Zhuang , Geng Tian , Jialiang Yang
{"title":"Emvirus: An embedding-based neural framework for human-virus protein-protein interactions prediction","authors":"Pengfei Xie ,&nbsp;Jujuan Zhuang ,&nbsp;Geng Tian ,&nbsp;Jialiang Yang","doi":"10.1016/j.bsheal.2023.04.003","DOIUrl":null,"url":null,"abstract":"<div><p>Human-virus protein-protein interactions (PPIs) play critical roles in viral infection. For example, the spike protein of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) binds primarily to human angiotensin-converting enzyme 2 (ACE2) protein to infect human cells. Thus, identifying and blocking these PPIs contribute to controlling and preventing viruses. However, wet-lab experiment-based identification of human-virus PPIs is usually expensive, labor-intensive, and time-consuming, which presents the need for computational methods. Many machine-learning methods have been proposed recently and achieved good results in predicting human-virus PPIs. However, most methods are based on protein sequence features and apply manually extracted features, such as statistical characteristics, phylogenetic profiles, and physicochemical properties. In this work, we present an embedding-based neural framework with convolutional neural network (CNN) and bi-directional long short-term memory unit (Bi-LSTM) architecture, named Emvirus, to predict human-virus PPIs (including human–SARS-CoV-2 PPIs). In addition, we conduct cross-viral experiments to explore the generalization ability of Emvirus. Compared to other feature extraction methods, Emvirus achieves better prediction accuracy.</p></div>","PeriodicalId":36178,"journal":{"name":"Biosafety and Health","volume":"5 3","pages":"Pages 152-158"},"PeriodicalIF":3.5000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10166638/pdf/","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biosafety and Health","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2590053623000472","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 1

Abstract

Human-virus protein-protein interactions (PPIs) play critical roles in viral infection. For example, the spike protein of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) binds primarily to human angiotensin-converting enzyme 2 (ACE2) protein to infect human cells. Thus, identifying and blocking these PPIs contribute to controlling and preventing viruses. However, wet-lab experiment-based identification of human-virus PPIs is usually expensive, labor-intensive, and time-consuming, which presents the need for computational methods. Many machine-learning methods have been proposed recently and achieved good results in predicting human-virus PPIs. However, most methods are based on protein sequence features and apply manually extracted features, such as statistical characteristics, phylogenetic profiles, and physicochemical properties. In this work, we present an embedding-based neural framework with convolutional neural network (CNN) and bi-directional long short-term memory unit (Bi-LSTM) architecture, named Emvirus, to predict human-virus PPIs (including human–SARS-CoV-2 PPIs). In addition, we conduct cross-viral experiments to explore the generalization ability of Emvirus. Compared to other feature extraction methods, Emvirus achieves better prediction accuracy.

Abstract Image

Abstract Image

Abstract Image

Emvirus:一个基于嵌入的神经框架,用于预测人-病毒蛋白-蛋白相互作用
人类病毒蛋白质-蛋白质相互作用(PPIs)在病毒感染中起着至关重要的作用。例如,严重急性呼吸综合征冠状病毒2型(SARS-CoV-2)的刺突蛋白主要与人类血管紧张素转化酶2(ACE2)蛋白结合,以感染人类细胞。因此,识别和阻断这些PPI有助于控制和预防病毒。然而,基于湿实验室实验的人类病毒PPI识别通常成本高昂、劳动密集且耗时,这就需要计算方法。最近提出了许多机器学习方法,并在预测人类病毒PPI方面取得了良好的效果。然而,大多数方法都是基于蛋白质序列特征,并应用手动提取的特征,如统计特征、系统发育谱和物理化学特性。在这项工作中,我们提出了一个基于嵌入的神经框架,该框架具有卷积神经网络(CNN)和双向长短期记忆单元(bi-LSTM)架构,名为Emvirus,用于预测人类病毒PPI(包括人类严重急性呼吸系统综合征冠状病毒2型PPI)。此外,我们还进行了跨病毒实验来探索Emvirus的泛化能力。与其他特征提取方法相比,Emvirus具有更好的预测精度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Biosafety and Health
Biosafety and Health Medicine-Infectious Diseases
CiteScore
7.60
自引率
0.00%
发文量
116
审稿时长
66 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信