Hafeez Ur Rehman, Inam Bari, Anwar Ali and Haroon Mahmood
{"title":"A Bayesian approach for estimating protein–protein interactions by integrating structural and non-structural biological data†","authors":"Hafeez Ur Rehman, Inam Bari, Anwar Ali and Haroon Mahmood","doi":"10.1039/C7MB00484B","DOIUrl":null,"url":null,"abstract":"<p >Accurate elucidation of genome wide protein–protein interactions is crucial for understanding the regulatory processes of the cell. High-throughput techniques, such as the yeast-2-hybrid (Y2H) assay, co-immunoprecipitation (co-IP), mass spectrometric (MS) protein complex identification, affinity purification (AP) <em>etc.</em>, are generally relied upon to determine protein interactions. Unfortunately, each type of method is inherently subject to different types of noise and results in false positive interactions. On the other hand, precise understanding of proteins, especially knowledge of their functional associations is necessary for understanding how complex molecular machines function. To solve this problem, computational techniques are generally relied upon to precisely predict protein interactions. In this work, we present a novel method that combines structural and non-structural biological data to precisely predict protein interactions. The conceptual novelty of our approach lies in identifying and precisely associating biological information that provides substantial interaction clues. Our model combines structural and non-structural information using Bayesian statistics to calculate the likelihood of each interaction. The proposed model is tested on <em>Saccharomyces cerevisiae</em>'s interactions extracted from the <em>DIP</em> and <em>IntAct</em> databases and provides substantial improvements in terms of accuracy, precision, recall and F1 score, as compared with the most widely used related state-of-the-art techniques.</p>","PeriodicalId":90,"journal":{"name":"Molecular BioSystems","volume":" 12","pages":" 2592-2602"},"PeriodicalIF":3.7430,"publicationDate":"2017-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1039/C7MB00484B","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular BioSystems","FirstCategoryId":"1085","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2017/mb/c7mb00484b","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate elucidation of genome wide protein–protein interactions is crucial for understanding the regulatory processes of the cell. High-throughput techniques, such as the yeast-2-hybrid (Y2H) assay, co-immunoprecipitation (co-IP), mass spectrometric (MS) protein complex identification, affinity purification (AP) etc., are generally relied upon to determine protein interactions. Unfortunately, each type of method is inherently subject to different types of noise and results in false positive interactions. On the other hand, precise understanding of proteins, especially knowledge of their functional associations is necessary for understanding how complex molecular machines function. To solve this problem, computational techniques are generally relied upon to precisely predict protein interactions. In this work, we present a novel method that combines structural and non-structural biological data to precisely predict protein interactions. The conceptual novelty of our approach lies in identifying and precisely associating biological information that provides substantial interaction clues. Our model combines structural and non-structural information using Bayesian statistics to calculate the likelihood of each interaction. The proposed model is tested on Saccharomyces cerevisiae's interactions extracted from the DIP and IntAct databases and provides substantial improvements in terms of accuracy, precision, recall and F1 score, as compared with the most widely used related state-of-the-art techniques.
期刊介绍:
Molecular Omics publishes molecular level experimental and bioinformatics research in the -omics sciences, including genomics, proteomics, transcriptomics and metabolomics. We will also welcome multidisciplinary papers presenting studies combining different types of omics, or the interface of omics and other fields such as systems biology or chemical biology.