Nazar Zaki, A. A. Dhaheri, Kalthoom A Alawar, Saad Harous
{"title":"Protein-protein Interaction Prediction using Arabic semantic analysis","authors":"Nazar Zaki, A. A. Dhaheri, Kalthoom A Alawar, Saad Harous","doi":"10.1109/INNOVATIONS.2013.6544426","DOIUrl":null,"url":null,"abstract":"Scientists are still far from unraveling the molecular mechanisms of most relevant diseases such as cancer and diabetes. A better understanding of protein interactions could provide a clue about the molecular mechanism of the processes leading to such diseases. Novel methodologies to understand diseases through their primary protein interactions are highly desired. In this paper we propose a simple method to predict protein-protein interaction based on Arabic semantic analysis model. The Arabic semantic model is an effective feature extraction method based on natural language processing. Two protein sequences may interact if they contain similar or related Arabic words. The semantic meaning will most likely provide us with a clue on how or why two proteins interact. To evaluate the ability of the proposed method to distinguish between “interacted” and “non-interacted” proteins pairs, we applied it on a dataset of 200 protein pairs from the available yeast saccharomyces cerevisiae protein interaction. The proposed method managed to get 100% sensitivity, 0.84% sensitivity and 92% overall accuracy. The method also showed moderate improvement over the existing well-known methods for PPI prediction such as PPI-PS and PIPE.","PeriodicalId":438270,"journal":{"name":"2013 9th International Conference on Innovations in Information Technology (IIT)","volume":"96 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 9th International Conference on Innovations in Information Technology (IIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INNOVATIONS.2013.6544426","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Scientists are still far from unraveling the molecular mechanisms of most relevant diseases such as cancer and diabetes. A better understanding of protein interactions could provide a clue about the molecular mechanism of the processes leading to such diseases. Novel methodologies to understand diseases through their primary protein interactions are highly desired. In this paper we propose a simple method to predict protein-protein interaction based on Arabic semantic analysis model. The Arabic semantic model is an effective feature extraction method based on natural language processing. Two protein sequences may interact if they contain similar or related Arabic words. The semantic meaning will most likely provide us with a clue on how or why two proteins interact. To evaluate the ability of the proposed method to distinguish between “interacted” and “non-interacted” proteins pairs, we applied it on a dataset of 200 protein pairs from the available yeast saccharomyces cerevisiae protein interaction. The proposed method managed to get 100% sensitivity, 0.84% sensitivity and 92% overall accuracy. The method also showed moderate improvement over the existing well-known methods for PPI prediction such as PPI-PS and PIPE.