Gulsah Altun, Hae-Jin Hu, D. Brinza, R. Harrison, A. Zelikovsky, Yi Pan
{"title":"基于混合支持向量机核的蛋白质二级结构预测","authors":"Gulsah Altun, Hae-Jin Hu, D. Brinza, R. Harrison, A. Zelikovsky, Yi Pan","doi":"10.1109/GRC.2006.1635912","DOIUrl":null,"url":null,"abstract":"The Support Vector Machine is a powerful methodology for solving problems in nonlinear classification, function estimation and density estimation. When data are not linearly separable, data are mapped to a high dimensional future space using a nonlinear function which can be computed through a positive definite kernel in the input space. Using a suitable kernel function for a particular problem and input data can change the prediction results remarkably and improve the accuracy. The goal of this work is to find the best kernel functions that can be applied to different types of data and problems. In this paper, we propose two hybrid kernels SVMSM+RBF and SVMEDIT+RBF. SVMSM+RBF is designed by combining the best performed RBF kernel with substitution matrix (SM) based kernel developed by Vanschoenwinkel and Manderick. SVMEDIT+RBF kernel combines the RBF kernel and the edit kernel devised by Li and Jiang. We tested these two hybrid kernels on one of the widely studied problems in bioinformatics which is the protein secondary structure prediction problem. For the protein secondary structure problem, our results were 91% accuracy on H/E binary classifier.","PeriodicalId":400997,"journal":{"name":"2006 IEEE International Conference on Granular Computing","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Hybrid SVM kernels for protein secondary structure prediction\",\"authors\":\"Gulsah Altun, Hae-Jin Hu, D. Brinza, R. Harrison, A. Zelikovsky, Yi Pan\",\"doi\":\"10.1109/GRC.2006.1635912\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The Support Vector Machine is a powerful methodology for solving problems in nonlinear classification, function estimation and density estimation. When data are not linearly separable, data are mapped to a high dimensional future space using a nonlinear function which can be computed through a positive definite kernel in the input space. Using a suitable kernel function for a particular problem and input data can change the prediction results remarkably and improve the accuracy. The goal of this work is to find the best kernel functions that can be applied to different types of data and problems. In this paper, we propose two hybrid kernels SVMSM+RBF and SVMEDIT+RBF. SVMSM+RBF is designed by combining the best performed RBF kernel with substitution matrix (SM) based kernel developed by Vanschoenwinkel and Manderick. SVMEDIT+RBF kernel combines the RBF kernel and the edit kernel devised by Li and Jiang. We tested these two hybrid kernels on one of the widely studied problems in bioinformatics which is the protein secondary structure prediction problem. For the protein secondary structure problem, our results were 91% accuracy on H/E binary classifier.\",\"PeriodicalId\":400997,\"journal\":{\"name\":\"2006 IEEE International Conference on Granular Computing\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2006-05-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2006 IEEE International Conference on Granular Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/GRC.2006.1635912\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2006 IEEE International Conference on Granular Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GRC.2006.1635912","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Hybrid SVM kernels for protein secondary structure prediction
The Support Vector Machine is a powerful methodology for solving problems in nonlinear classification, function estimation and density estimation. When data are not linearly separable, data are mapped to a high dimensional future space using a nonlinear function which can be computed through a positive definite kernel in the input space. Using a suitable kernel function for a particular problem and input data can change the prediction results remarkably and improve the accuracy. The goal of this work is to find the best kernel functions that can be applied to different types of data and problems. In this paper, we propose two hybrid kernels SVMSM+RBF and SVMEDIT+RBF. SVMSM+RBF is designed by combining the best performed RBF kernel with substitution matrix (SM) based kernel developed by Vanschoenwinkel and Manderick. SVMEDIT+RBF kernel combines the RBF kernel and the edit kernel devised by Li and Jiang. We tested these two hybrid kernels on one of the widely studied problems in bioinformatics which is the protein secondary structure prediction problem. For the protein secondary structure problem, our results were 91% accuracy on H/E binary classifier.