{"title":"Method for Essential Protein Prediction Based on the Naïve Bayesian Classifier and Bioinformation Fusion","authors":"Jingjuan Tan, Linai Kuang, Lei Wang","doi":"10.1145/3571532.3571533","DOIUrl":null,"url":null,"abstract":"In this article, a novel approach called EPNBC has been proposed by combining the biological information with the naïve Bayesian classifier and PageRank algorithm to predict potential essential proteins. In EPNBC, the naïve Bayesian classifier is used to process the original PPI network, and a new protein interaction network with more interaction relationships is obtained. Then, two similarity matrices were obtained by using Gaussian interaction profile kernel similarity based on the protein interaction relationship and gene expression data, and a weighted protein interaction network was obtained. The improved PageRank algorithm was used to score the nodes in the network and output the protein scores in descending order. Experimental results showed that EPNBC was superior to dozens of other methods in identifying essential proteins.","PeriodicalId":355088,"journal":{"name":"Proceedings of the 2022 11th International Conference on Bioinformatics and Biomedical Science","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 11th International Conference on Bioinformatics and Biomedical Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3571532.3571533","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this article, a novel approach called EPNBC has been proposed by combining the biological information with the naïve Bayesian classifier and PageRank algorithm to predict potential essential proteins. In EPNBC, the naïve Bayesian classifier is used to process the original PPI network, and a new protein interaction network with more interaction relationships is obtained. Then, two similarity matrices were obtained by using Gaussian interaction profile kernel similarity based on the protein interaction relationship and gene expression data, and a weighted protein interaction network was obtained. The improved PageRank algorithm was used to score the nodes in the network and output the protein scores in descending order. Experimental results showed that EPNBC was superior to dozens of other methods in identifying essential proteins.