Tomoki Ishino, I. Nishikawa, Y. Tohsato, S. Fukuchi, K. Nishikawa
{"title":"Prediction of protein phosphorylation sites by support vector machines","authors":"Tomoki Ishino, I. Nishikawa, Y. Tohsato, S. Fukuchi, K. Nishikawa","doi":"10.1109/BMEI.2013.6747053","DOIUrl":null,"url":null,"abstract":"Protein phosphorylation is one of the most important post-translational modifications, and revealing its mechanism is an important research topic. In this paper, phosphorylation sites in human proteins are predicted by support vector machine (SVM). First, two types of SVMs are constructed, each for phosphorylation sites in domain and in intrinsically disordered region (IDR). In domain, wide range of information of amino acid sequence is found effective, while it is not effective in IDR. As phosphorylation is abundant in IDR, the second part of the study focuses on the prediction of phosphorylation sites in IDR, especially, the phosphorylation sites with any known function. Then, it is found that the evolutionary conservation of each site is different in IDR, and multiple ortholog sequences which contain the conservation information is effective for the prediction compared with single sequence information.","PeriodicalId":163211,"journal":{"name":"2013 6th International Conference on Biomedical Engineering and Informatics","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 6th International Conference on Biomedical Engineering and Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BMEI.2013.6747053","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Protein phosphorylation is one of the most important post-translational modifications, and revealing its mechanism is an important research topic. In this paper, phosphorylation sites in human proteins are predicted by support vector machine (SVM). First, two types of SVMs are constructed, each for phosphorylation sites in domain and in intrinsically disordered region (IDR). In domain, wide range of information of amino acid sequence is found effective, while it is not effective in IDR. As phosphorylation is abundant in IDR, the second part of the study focuses on the prediction of phosphorylation sites in IDR, especially, the phosphorylation sites with any known function. Then, it is found that the evolutionary conservation of each site is different in IDR, and multiple ortholog sequences which contain the conservation information is effective for the prediction compared with single sequence information.