A Bayesian approach for estimating protein–protein interactions by integrating structural and non-structural biological data†

IF 3.743 Q2 Biochemistry, Genetics and Molecular Biology
Hafeez Ur Rehman, Inam Bari, Anwar Ali and Haroon Mahmood
{"title":"A Bayesian approach for estimating protein–protein interactions by integrating structural and non-structural biological data†","authors":"Hafeez Ur Rehman, Inam Bari, Anwar Ali and Haroon Mahmood","doi":"10.1039/C7MB00484B","DOIUrl":null,"url":null,"abstract":"<p >Accurate elucidation of genome wide protein–protein interactions is crucial for understanding the regulatory processes of the cell. High-throughput techniques, such as the yeast-2-hybrid (Y2H) assay, co-immunoprecipitation (co-IP), mass spectrometric (MS) protein complex identification, affinity purification (AP) <em>etc.</em>, are generally relied upon to determine protein interactions. Unfortunately, each type of method is inherently subject to different types of noise and results in false positive interactions. On the other hand, precise understanding of proteins, especially knowledge of their functional associations is necessary for understanding how complex molecular machines function. To solve this problem, computational techniques are generally relied upon to precisely predict protein interactions. In this work, we present a novel method that combines structural and non-structural biological data to precisely predict protein interactions. The conceptual novelty of our approach lies in identifying and precisely associating biological information that provides substantial interaction clues. Our model combines structural and non-structural information using Bayesian statistics to calculate the likelihood of each interaction. The proposed model is tested on <em>Saccharomyces cerevisiae</em>'s interactions extracted from the <em>DIP</em> and <em>IntAct</em> databases and provides substantial improvements in terms of accuracy, precision, recall and F1 score, as compared with the most widely used related state-of-the-art techniques.</p>","PeriodicalId":90,"journal":{"name":"Molecular BioSystems","volume":" 12","pages":" 2592-2602"},"PeriodicalIF":3.7430,"publicationDate":"2017-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1039/C7MB00484B","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular BioSystems","FirstCategoryId":"1085","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2017/mb/c7mb00484b","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}
引用次数: 0

Abstract

Accurate elucidation of genome wide protein–protein interactions is crucial for understanding the regulatory processes of the cell. High-throughput techniques, such as the yeast-2-hybrid (Y2H) assay, co-immunoprecipitation (co-IP), mass spectrometric (MS) protein complex identification, affinity purification (AP) etc., are generally relied upon to determine protein interactions. Unfortunately, each type of method is inherently subject to different types of noise and results in false positive interactions. On the other hand, precise understanding of proteins, especially knowledge of their functional associations is necessary for understanding how complex molecular machines function. To solve this problem, computational techniques are generally relied upon to precisely predict protein interactions. In this work, we present a novel method that combines structural and non-structural biological data to precisely predict protein interactions. The conceptual novelty of our approach lies in identifying and precisely associating biological information that provides substantial interaction clues. Our model combines structural and non-structural information using Bayesian statistics to calculate the likelihood of each interaction. The proposed model is tested on Saccharomyces cerevisiae's interactions extracted from the DIP and IntAct databases and provides substantial improvements in terms of accuracy, precision, recall and F1 score, as compared with the most widely used related state-of-the-art techniques.

Abstract Image

结合结构和非结构生物学数据估计蛋白质相互作用的贝叶斯方法
准确阐明基因组范围内蛋白质-蛋白质相互作用对于理解细胞的调控过程至关重要。高通量技术,如酵母-2-杂交(Y2H)测定,共免疫沉淀(co-IP),质谱(MS)蛋白质复合物鉴定,亲和纯化(AP)等,通常依赖于确定蛋白质相互作用。不幸的是,每种方法本身都受到不同类型的噪声的影响,并导致假阳性的相互作用。另一方面,对蛋白质的精确理解,特别是对它们的功能关联的了解,对于理解复杂的分子机器如何运作是必要的。为了解决这个问题,计算技术通常依赖于精确预测蛋白质相互作用。在这项工作中,我们提出了一种结合结构和非结构生物学数据来精确预测蛋白质相互作用的新方法。我们的方法在概念上的新颖之处在于识别和精确地关联提供实质性相互作用线索的生物信息。我们的模型结合了结构和非结构信息,使用贝叶斯统计来计算每个相互作用的可能性。通过对从DIP和完好数据库中提取的酿酒酵母相互作用进行测试,与目前最广泛使用的相关技术相比,该模型在准确性、精密度、召回率和F1分数方面有了显著提高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Molecular BioSystems
Molecular BioSystems 生物-生化与分子生物学
CiteScore
2.94
自引率
0.00%
发文量
0
审稿时长
2.6 months
期刊介绍: Molecular Omics publishes molecular level experimental and bioinformatics research in the -omics sciences, including genomics, proteomics, transcriptomics and metabolomics. We will also welcome multidisciplinary papers presenting studies combining different types of omics, or the interface of omics and other fields such as systems biology or chemical biology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信