{"title":"基于k -收敛近邻和超球支持向量机的混合文本分类方法","authors":"Y. H. Chen, Y. F. Zheng, J. Pan, N. Yang","doi":"10.1109/ITA.2013.120","DOIUrl":null,"url":null,"abstract":"Our work implements a novel text classifier by combining k congener nearest neighbors-Support Vector Machine(KCNN-SVM) with hyper sphere Support Vector Machine(hyper sphere-SVM) training algorithm. Hyper plane Support Vector Machine has been widely used to divide the samples into two equal categories. However, the hyper sphere Support Vector Machine can not only separate the samples, but also divide them into two different parts. Since the probability inside and outside the hyper sphere is not same, hyper sphere-SVM is helpful to the classification when the datasets are imbalanced that we can control the radius of hyper sphere to get higher accuracy. The KCNN-SVM algorithm distinguishes a sample with its nearest neighbor's category label as well as the average distance between it and its k nearest same kind of neighbors which can enhance the accuracy when the samples are chaotic imbalanced. In this paper, we propose the hyper sphere-KCNN-SVM(HS-KCNN-SVM) hybrid approach which can validly improve the classification accuracy especially for those chaotic imbalanced samples.","PeriodicalId":285687,"journal":{"name":"2013 International Conference on Information Technology and Applications","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"A Hybrid Text Classification Method Based on K-Congener-Nearest-Neighbors and Hypersphere Support Vector Machine\",\"authors\":\"Y. H. Chen, Y. F. Zheng, J. Pan, N. Yang\",\"doi\":\"10.1109/ITA.2013.120\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Our work implements a novel text classifier by combining k congener nearest neighbors-Support Vector Machine(KCNN-SVM) with hyper sphere Support Vector Machine(hyper sphere-SVM) training algorithm. Hyper plane Support Vector Machine has been widely used to divide the samples into two equal categories. However, the hyper sphere Support Vector Machine can not only separate the samples, but also divide them into two different parts. Since the probability inside and outside the hyper sphere is not same, hyper sphere-SVM is helpful to the classification when the datasets are imbalanced that we can control the radius of hyper sphere to get higher accuracy. The KCNN-SVM algorithm distinguishes a sample with its nearest neighbor's category label as well as the average distance between it and its k nearest same kind of neighbors which can enhance the accuracy when the samples are chaotic imbalanced. In this paper, we propose the hyper sphere-KCNN-SVM(HS-KCNN-SVM) hybrid approach which can validly improve the classification accuracy especially for those chaotic imbalanced samples.\",\"PeriodicalId\":285687,\"journal\":{\"name\":\"2013 International Conference on Information Technology and Applications\",\"volume\":\"38 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-11-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 International Conference on Information Technology and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ITA.2013.120\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 International Conference on Information Technology and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITA.2013.120","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Hybrid Text Classification Method Based on K-Congener-Nearest-Neighbors and Hypersphere Support Vector Machine
Our work implements a novel text classifier by combining k congener nearest neighbors-Support Vector Machine(KCNN-SVM) with hyper sphere Support Vector Machine(hyper sphere-SVM) training algorithm. Hyper plane Support Vector Machine has been widely used to divide the samples into two equal categories. However, the hyper sphere Support Vector Machine can not only separate the samples, but also divide them into two different parts. Since the probability inside and outside the hyper sphere is not same, hyper sphere-SVM is helpful to the classification when the datasets are imbalanced that we can control the radius of hyper sphere to get higher accuracy. The KCNN-SVM algorithm distinguishes a sample with its nearest neighbor's category label as well as the average distance between it and its k nearest same kind of neighbors which can enhance the accuracy when the samples are chaotic imbalanced. In this paper, we propose the hyper sphere-KCNN-SVM(HS-KCNN-SVM) hybrid approach which can validly improve the classification accuracy especially for those chaotic imbalanced samples.