{"title":"Prediction of Protein Sub-Cellular Localization through Weighted Combination of Classifiers","authors":"M. Fayyaz, A. Mujahid, Asifullah Khan, A. Bangash","doi":"10.1109/ICEE.2007.4287289","DOIUrl":null,"url":null,"abstract":"Prediction of subcellular localization of proteins is an important step in genome annotation and in search for achieving novel drug targets. Conducting experiments for extracting information about protein sub cellular localization is both time consuming and costly effort. Machine learning approaches, especially, ensemble of classifiers, providing efficient and reliable mechanism of computational prediction are thus highly desired In this context, we propose a modification to the approach proposed in [K. C. Chou, J. Cell. Biol. 99(2006)517]. We have used a weighted polling method to fuse the output of individual covariant discriminant classifiers. The individual classifiers are trained on features based on pseudo-amino add composition of proteins. Three methods of verifications; re-substitution, Jackknife, and independent data set tests have been employed and give over all accuracies of 87.13%, 71.15% and 74.90% respectively. The predicted accuracies are higher than that of the existing schemes.","PeriodicalId":291800,"journal":{"name":"2007 International Conference on Electrical Engineering","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 International Conference on Electrical Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEE.2007.4287289","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
Prediction of subcellular localization of proteins is an important step in genome annotation and in search for achieving novel drug targets. Conducting experiments for extracting information about protein sub cellular localization is both time consuming and costly effort. Machine learning approaches, especially, ensemble of classifiers, providing efficient and reliable mechanism of computational prediction are thus highly desired In this context, we propose a modification to the approach proposed in [K. C. Chou, J. Cell. Biol. 99(2006)517]. We have used a weighted polling method to fuse the output of individual covariant discriminant classifiers. The individual classifiers are trained on features based on pseudo-amino add composition of proteins. Three methods of verifications; re-substitution, Jackknife, and independent data set tests have been employed and give over all accuracies of 87.13%, 71.15% and 74.90% respectively. The predicted accuracies are higher than that of the existing schemes.