{"title":"一种结合支持向量机和马尔可夫模型的剪接位置识别方法","authors":"Elham Pashaei, Alper Yilmaz, N. Aydin","doi":"10.1109/ICCKE.2016.7802140","DOIUrl":null,"url":null,"abstract":"Due to an exponential increase in biological sequence data, gene detection has become one of the challenging tasks in computational biology. Splice site prediction is an essential part of the gene detection. Thus, it has great significance to develop efficient methods for accurately identifying splice sites. This paper introduces a novel algorithm to predict the splice sites based on support vector machine (SVM) and a new type of Markov chain model, namely DMM2. The proposed method shows great improvement over most of the current state of art methods, including MM1-SVM, Reduced MM1-SVM, SVM-B, LVMM, MM1-RF, MM2F-SVM, MCM-SVM, DM-SVM and DM2-AdaBoost. The repeated 10-fold cross validation was used to assess the performance of the method on the HS3D dataset. In addition, we applied it to NN269 dataset to examine the stability of the proposed method. The experimental results indicate that the new approach is feasible and efficient.","PeriodicalId":205768,"journal":{"name":"2016 6th International Conference on Computer and Knowledge Engineering (ICCKE)","volume":"162 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"A combined SVM and Markov model approach for splice site identification\",\"authors\":\"Elham Pashaei, Alper Yilmaz, N. Aydin\",\"doi\":\"10.1109/ICCKE.2016.7802140\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Due to an exponential increase in biological sequence data, gene detection has become one of the challenging tasks in computational biology. Splice site prediction is an essential part of the gene detection. Thus, it has great significance to develop efficient methods for accurately identifying splice sites. This paper introduces a novel algorithm to predict the splice sites based on support vector machine (SVM) and a new type of Markov chain model, namely DMM2. The proposed method shows great improvement over most of the current state of art methods, including MM1-SVM, Reduced MM1-SVM, SVM-B, LVMM, MM1-RF, MM2F-SVM, MCM-SVM, DM-SVM and DM2-AdaBoost. The repeated 10-fold cross validation was used to assess the performance of the method on the HS3D dataset. In addition, we applied it to NN269 dataset to examine the stability of the proposed method. The experimental results indicate that the new approach is feasible and efficient.\",\"PeriodicalId\":205768,\"journal\":{\"name\":\"2016 6th International Conference on Computer and Knowledge Engineering (ICCKE)\",\"volume\":\"162 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 6th International Conference on Computer and Knowledge Engineering (ICCKE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCKE.2016.7802140\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 6th International Conference on Computer and Knowledge Engineering (ICCKE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCKE.2016.7802140","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A combined SVM and Markov model approach for splice site identification
Due to an exponential increase in biological sequence data, gene detection has become one of the challenging tasks in computational biology. Splice site prediction is an essential part of the gene detection. Thus, it has great significance to develop efficient methods for accurately identifying splice sites. This paper introduces a novel algorithm to predict the splice sites based on support vector machine (SVM) and a new type of Markov chain model, namely DMM2. The proposed method shows great improvement over most of the current state of art methods, including MM1-SVM, Reduced MM1-SVM, SVM-B, LVMM, MM1-RF, MM2F-SVM, MCM-SVM, DM-SVM and DM2-AdaBoost. The repeated 10-fold cross validation was used to assess the performance of the method on the HS3D dataset. In addition, we applied it to NN269 dataset to examine the stability of the proposed method. The experimental results indicate that the new approach is feasible and efficient.