A Transformer-Based Ensemble Framework for the Prediction of Protein-Protein Interaction Sites.

IF 11 1区 综合性期刊 Q1 Multidisciplinary
Research Pub Date : 2023-09-27 eCollection Date: 2023-01-01 DOI:10.34133/research.0240
Minjie Mou, Ziqi Pan, Zhimeng Zhou, Lingyan Zheng, Hanyu Zhang, Shuiyang Shi, Fengcheng Li, Xiuna Sun, Feng Zhu
{"title":"A Transformer-Based Ensemble Framework for the Prediction of Protein-Protein Interaction Sites.","authors":"Minjie Mou,&nbsp;Ziqi Pan,&nbsp;Zhimeng Zhou,&nbsp;Lingyan Zheng,&nbsp;Hanyu Zhang,&nbsp;Shuiyang Shi,&nbsp;Fengcheng Li,&nbsp;Xiuna Sun,&nbsp;Feng Zhu","doi":"10.34133/research.0240","DOIUrl":null,"url":null,"abstract":"<p><p>The identification of protein-protein interaction (PPI) sites is essential in the research of protein function and the discovery of new drugs. So far, a variety of computational tools based on machine learning have been developed to accelerate the identification of PPI sites. However, existing methods suffer from the low predictive accuracy or the limited scope of application. Specifically, some methods learned only global or local sequential features, leading to low predictive accuracy, while others achieved improved performance by extracting residue interactions from structures but were limited in their application scope for the serious dependence on precise structure information. There is an urgent need to develop a method that integrates comprehensive information to realize proteome-wide accurate profiling of PPI sites. Herein, a novel ensemble framework for PPI sites prediction, EnsemPPIS, was therefore proposed based on transformer and gated convolutional networks. EnsemPPIS can effectively capture not only global and local patterns but also residue interactions. Specifically, EnsemPPIS was unique in (a) extracting residue interactions from protein sequences with transformer and (b) further integrating global and local sequential features with the ensemble learning strategy. Compared with various existing methods, EnsemPPIS exhibited either superior performance or broader applicability on multiple PPI sites prediction tasks. Moreover, pattern analysis based on the interpretability of EnsemPPIS demonstrated that EnsemPPIS was fully capable of learning residue interactions within the local structure of PPI sites using only sequence information. The web server of EnsemPPIS is freely available at http://idrblab.org/ensemppis.</p>","PeriodicalId":21120,"journal":{"name":"Research","volume":"6 ","pages":"0240"},"PeriodicalIF":11.0000,"publicationDate":"2023-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10528219/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Research","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.34133/research.0240","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"Multidisciplinary","Score":null,"Total":0}
引用次数: 0

Abstract

The identification of protein-protein interaction (PPI) sites is essential in the research of protein function and the discovery of new drugs. So far, a variety of computational tools based on machine learning have been developed to accelerate the identification of PPI sites. However, existing methods suffer from the low predictive accuracy or the limited scope of application. Specifically, some methods learned only global or local sequential features, leading to low predictive accuracy, while others achieved improved performance by extracting residue interactions from structures but were limited in their application scope for the serious dependence on precise structure information. There is an urgent need to develop a method that integrates comprehensive information to realize proteome-wide accurate profiling of PPI sites. Herein, a novel ensemble framework for PPI sites prediction, EnsemPPIS, was therefore proposed based on transformer and gated convolutional networks. EnsemPPIS can effectively capture not only global and local patterns but also residue interactions. Specifically, EnsemPPIS was unique in (a) extracting residue interactions from protein sequences with transformer and (b) further integrating global and local sequential features with the ensemble learning strategy. Compared with various existing methods, EnsemPPIS exhibited either superior performance or broader applicability on multiple PPI sites prediction tasks. Moreover, pattern analysis based on the interpretability of EnsemPPIS demonstrated that EnsemPPIS was fully capable of learning residue interactions within the local structure of PPI sites using only sequence information. The web server of EnsemPPIS is freely available at http://idrblab.org/ensemppis.

Abstract Image

Abstract Image

Abstract Image

基于Transformer的蛋白质-蛋白质相互作用位点预测集成框架。
蛋白质-蛋白质相互作用(PPI)位点的鉴定在蛋白质功能研究和新药发现中至关重要。到目前为止,已经开发了各种基于机器学习的计算工具来加速PPI位点的识别。然而,现有的方法存在预测精度低或应用范围有限的问题。具体而言,一些方法只学习全局或局部序列特征,导致预测精度较低,而另一些方法通过从结构中提取残基相互作用来提高性能,但由于严重依赖于精确的结构信息,其应用范围有限。迫切需要开发一种整合综合信息的方法,以实现PPI位点的全蛋白质组精确图谱。因此,本文提出了一种新的基于变换器和门控卷积网络的PPI位点预测集成框架EnsemPPIS。EnsemPPIS不仅可以有效地捕捉全局和局部模式,还可以捕捉残留物的相互作用。具体而言,EnsemPPIS在(a)使用transformer从蛋白质序列中提取残基相互作用,以及(b)将全局和局部序列特征与集成学习策略进一步集成方面是独特的。与现有的各种方法相比,EnsemPPIS在多个PPI位点的预测任务中表现出优越的性能或更广泛的适用性。此外,基于EnsemPPIS可解释性的模式分析表明,EnsemPPIS完全能够仅使用序列信息学习PPI位点局部结构内的残基相互作用。EnsemPPIS的网络服务器可在http://idrblab.org/ensemppis.
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Research
Research Multidisciplinary-Multidisciplinary
CiteScore
13.40
自引率
3.60%
发文量
0
审稿时长
14 weeks
期刊介绍: Research serves as a global platform for academic exchange, collaboration, and technological advancements. This journal welcomes high-quality research contributions from any domain, with open arms to authors from around the globe. Comprising fundamental research in the life and physical sciences, Research also highlights significant findings and issues in engineering and applied science. The journal proudly features original research articles, reviews, perspectives, and editorials, fostering a diverse and dynamic scholarly environment.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信