DP-site: A dual deep learning-based method for protein-peptide interaction site prediction

IF 4.2 3区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS
Shima Shafiee , Abdolhossein Fathi , Ghazaleh Taherzadeh
{"title":"DP-site: A dual deep learning-based method for protein-peptide interaction site prediction","authors":"Shima Shafiee ,&nbsp;Abdolhossein Fathi ,&nbsp;Ghazaleh Taherzadeh","doi":"10.1016/j.ymeth.2024.06.001","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><p>Protein-peptide interaction prediction is an important topic for several applications including various biological processes, understanding drug discovery, protein function abnormal cellular behaviors, and treating diseases. Over the years, studies have shown that experimental methods have improved the identification of this bio-molecular interaction. However, predicting protein-peptide interactions using these methods is laborious, time-consuming, dependent on third-party tools, and costly.</p></div><div><h3>Method</h3><p>To address these previous drawbacks, this study introduces a computational framework called DP-Site. The proposed framework concentrates on using a compound of a dual pipeline along with a combination predictor. A deep convolutional neural network for feature extraction and classification is embedded in pipeline 1. In addition, pipeline 2 includes a deep long-short-term memory<strong>-</strong>based and a random forest classifier for feature extraction and classification. In this investigation, the evolutionary, structure-based, sequence-based, and physicochemical information of proteins is utilized for identifying protein-peptide interaction at the residue level.</p></div><div><h3>Results</h3><p>The proposed method is evaluated on both the ten-fold cross-validation and independent test sets. The robust and consistent results between cross-validation and independent test sets confirm the ability of the proposed method to predict peptide binding residues in proteins. Moreover, experimental findings demonstrate that DP-Site has significantly outperformed other state-of-the-art sequence-based and structure-based methods. The proposed method achieves a remarkable balance between a specificity of 0.799 and a sensitivity of 0.770, along with the best f-measure of 0.661 and the highest precision of 0.580 using an independent test set.</p></div><div><h3>Conclusions</h3><p>The outcome of various experiments confirms the proficiency of the proposed method and outperforms state-of-the-art sequence-based and structure-based methods in terms of the mentioned criteria. DP-Site can be accessed at <span>https://github.com/shafiee</span><svg><path></path></svg> 95/shima.shafiee.DP-Site.</p></div>","PeriodicalId":390,"journal":{"name":"Methods","volume":"229 ","pages":"Pages 17-29"},"PeriodicalIF":4.2000,"publicationDate":"2024-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Methods","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1046202324001439","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

Background

Protein-peptide interaction prediction is an important topic for several applications including various biological processes, understanding drug discovery, protein function abnormal cellular behaviors, and treating diseases. Over the years, studies have shown that experimental methods have improved the identification of this bio-molecular interaction. However, predicting protein-peptide interactions using these methods is laborious, time-consuming, dependent on third-party tools, and costly.

Method

To address these previous drawbacks, this study introduces a computational framework called DP-Site. The proposed framework concentrates on using a compound of a dual pipeline along with a combination predictor. A deep convolutional neural network for feature extraction and classification is embedded in pipeline 1. In addition, pipeline 2 includes a deep long-short-term memory-based and a random forest classifier for feature extraction and classification. In this investigation, the evolutionary, structure-based, sequence-based, and physicochemical information of proteins is utilized for identifying protein-peptide interaction at the residue level.

Results

The proposed method is evaluated on both the ten-fold cross-validation and independent test sets. The robust and consistent results between cross-validation and independent test sets confirm the ability of the proposed method to predict peptide binding residues in proteins. Moreover, experimental findings demonstrate that DP-Site has significantly outperformed other state-of-the-art sequence-based and structure-based methods. The proposed method achieves a remarkable balance between a specificity of 0.799 and a sensitivity of 0.770, along with the best f-measure of 0.661 and the highest precision of 0.580 using an independent test set.

Conclusions

The outcome of various experiments confirms the proficiency of the proposed method and outperforms state-of-the-art sequence-based and structure-based methods in terms of the mentioned criteria. DP-Site can be accessed at https://github.com/shafiee 95/shima.shafiee.DP-Site.

DP-site:基于双重深度学习的蛋白质-肽相互作用位点预测方法。
背景:蛋白质-肽相互作用预测是一个重要的课题,可用于多种应用,包括各种生物过程、了解药物发现、蛋白质功能异常细胞行为以及治疗疾病。多年来的研究表明,实验方法提高了对这种生物分子相互作用的识别能力。然而,使用这些方法预测蛋白质与肽的相互作用费力、费时、依赖第三方工具且成本高昂:为了解决这些弊端,本研究提出了一种名为 DP-Site 的计算框架。所提出的框架集中使用了双管道化合物和组合预测器。管道 1 中嵌入了用于特征提取和分类的深度卷积神经网络。此外,管道 2 还包括用于特征提取和分类的基于深度长短期记忆的分类器和随机森林分类器。在这项研究中,利用蛋白质的进化信息、结构信息、序列信息和理化信息,在残基水平上识别蛋白质与肽的相互作用:结果:在十倍交叉验证和独立测试集上对所提出的方法进行了评估。交叉验证和独立测试集之间稳健一致的结果证实了所提出的方法能够预测蛋白质中的肽结合残基。此外,实验结果表明,DP-Site 的性能明显优于其他基于序列和结构的先进方法。所提出的方法在特异性(0.799)和灵敏度(0.770)之间取得了显著的平衡,在使用独立测试集时,其最佳 f 测量值为 0.661,最高精确度为 0.580:各种实验结果证实了所提方法的准确性,在上述标准方面优于最先进的基于序列和基于结构的方法。DP-Site网址:https://github.com/shafiee 95/shima.shafiee.DP-Site。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Methods
Methods 生物-生化研究方法
CiteScore
9.80
自引率
2.10%
发文量
222
审稿时长
11.3 weeks
期刊介绍: Methods focuses on rapidly developing techniques in the experimental biological and medical sciences. Each topical issue, organized by a guest editor who is an expert in the area covered, consists solely of invited quality articles by specialist authors, many of them reviews. Issues are devoted to specific technical approaches with emphasis on clear detailed descriptions of protocols that allow them to be reproduced easily. The background information provided enables researchers to understand the principles underlying the methods; other helpful sections include comparisons of alternative methods giving the advantages and disadvantages of particular methods, guidance on avoiding potential pitfalls, and suggestions for troubleshooting.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信