A Bayesian approach for estimating protein–protein interactions by integrating structural and non-structural biological data†

IF 3.743 Q2 Biochemistry, Genetics and Molecular Biology

Molecular BioSystems Pub Date : 2017-10-05 DOI:10.1039/C7MB00484B

Hafeez Ur Rehman, Inam Bari, Anwar Ali and Haroon Mahmood

{"title":"A Bayesian approach for estimating protein–protein interactions by integrating structural and non-structural biological data†","authors":"Hafeez Ur Rehman, Inam Bari, Anwar Ali and Haroon Mahmood","doi":"10.1039/C7MB00484B","DOIUrl":null,"url":null,"abstract":"Accurate elucidation of genome wide protein–protein interactions is crucial for understanding the regulatory processes of the cell. High-throughput techniques, such as the yeast-2-hybrid (Y2H) assay, co-immunoprecipitation (co-IP), mass spectrometric (MS) protein complex identification, affinity purification (AP) etc., are generally relied upon to determine protein interactions. Unfortunately, each type of method is inherently subject to different types of noise and results in false positive interactions. On the other hand, precise understanding of proteins, especially knowledge of their functional associations is necessary for understanding how complex molecular machines function. To solve this problem, computational techniques are generally relied upon to precisely predict protein interactions. In this work, we present a novel method that combines structural and non-structural biological data to precisely predict protein interactions. The conceptual novelty of our approach lies in identifying and precisely associating biological information that provides substantial interaction clues. Our model combines structural and non-structural information using Bayesian statistics to calculate the likelihood of each interaction. The proposed model is tested on Saccharomyces cerevisiae's interactions extracted from the DIP and IntAct databases and provides substantial improvements in terms of accuracy, precision, recall and F1 score, as compared with the most widely used related state-of-the-art techniques.","PeriodicalId":90,"journal":{"name":"Molecular BioSystems","volume":" 12","pages":" 2592-2602"},"PeriodicalIF":3.7430,"publicationDate":"2017-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1039/C7MB00484B","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular BioSystems","FirstCategoryId":"1085","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2017/mb/c7mb00484b","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}

引用次数: 0

Abstract

Accurate elucidation of genome wide protein–protein interactions is crucial for understanding the regulatory processes of the cell. High-throughput techniques, such as the yeast-2-hybrid (Y2H) assay, co-immunoprecipitation (co-IP), mass spectrometric (MS) protein complex identification, affinity purification (AP) etc., are generally relied upon to determine protein interactions. Unfortunately, each type of method is inherently subject to different types of noise and results in false positive interactions. On the other hand, precise understanding of proteins, especially knowledge of their functional associations is necessary for understanding how complex molecular machines function. To solve this problem, computational techniques are generally relied upon to precisely predict protein interactions. In this work, we present a novel method that combines structural and non-structural biological data to precisely predict protein interactions. The conceptual novelty of our approach lies in identifying and precisely associating biological information that provides substantial interaction clues. Our model combines structural and non-structural information using Bayesian statistics to calculate the likelihood of each interaction. The proposed model is tested on Saccharomyces cerevisiae's interactions extracted from the DIP and IntAct databases and provides substantial improvements in terms of accuracy, precision, recall and F1 score, as compared with the most widely used related state-of-the-art techniques.

Abstract Image

查看原文本刊更多论文

结合结构和非结构生物学数据估计蛋白质相互作用的贝叶斯方法

准确阐明基因组范围内蛋白质-蛋白质相互作用对于理解细胞的调控过程至关重要。高通量技术，如酵母-2-杂交(Y2H)测定，共免疫沉淀(co-IP)，质谱(MS)蛋白质复合物鉴定，亲和纯化(AP)等，通常依赖于确定蛋白质相互作用。不幸的是，每种方法本身都受到不同类型的噪声的影响，并导致假阳性的相互作用。另一方面，对蛋白质的精确理解，特别是对它们的功能关联的了解，对于理解复杂的分子机器如何运作是必要的。为了解决这个问题，计算技术通常依赖于精确预测蛋白质相互作用。在这项工作中，我们提出了一种结合结构和非结构生物学数据来精确预测蛋白质相互作用的新方法。我们的方法在概念上的新颖之处在于识别和精确地关联提供实质性相互作用线索的生物信息。我们的模型结合了结构和非结构信息，使用贝叶斯统计来计算每个相互作用的可能性。通过对从DIP和完好数据库中提取的酿酒酵母相互作用进行测试，与目前最广泛使用的相关技术相比，该模型在准确性、精密度、召回率和F1分数方面有了显著提高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Molecular BioSystems 生物-生化与分子生物学

CiteScore

2.94

自引率

0.00%

发文量

审稿时长

2.6 months

期刊介绍： Molecular Omics publishes molecular level experimental and bioinformatics research in the -omics sciences, including genomics, proteomics, transcriptomics and metabolomics. We will also welcome multidisciplinary papers presenting studies combining different types of omics, or the interface of omics and other fields such as systems biology or chemical biology.