{"title":"基于特征排序、关联分析和混沌二元粒子群优化的特征选择","authors":"Fei Wang, Yi Yang, Xianchao Lv, Jiao Xu, Lian Li","doi":"10.1109/ICSESS.2014.6933569","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a multi-stage feature selection algorithm, which focuses on the reduction of redundant features and the improvement of classification performance using feature ranking (FR), correlation analysis (CA) and chaotic binary particle swarm optimization (CBPSO). In the first stage, with the purpose of selecting the most effective features for classification, FR is introduced to select the top-ranked features according to the classification accuracies. In the second stage, CA is used to measure the correlation among the selected top-ranked features for reducing redundant features. In the third stage, in order to further eliminate redundant features and improve the classification performances, CBPSO is adopted to search the optimal feature subset. Ultimately, feature selection can be completed by using only some top-ranked features with less redundancy for classification. Support vector machine (SVM) with n-fold cross-validation is adopted to assess the classification performances on six datasets in the experiments. Experimental results show that the proposed algorithm can achieve better performance in terms of classification accuracy and the number of features than benchmark algorithms.","PeriodicalId":6473,"journal":{"name":"2014 IEEE 5th International Conference on Software Engineering and Service Science","volume":"50 1","pages":"305-309"},"PeriodicalIF":0.0000,"publicationDate":"2014-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Feature selection using feature ranking, correlation analysis and chaotic binary particle swarm optimization\",\"authors\":\"Fei Wang, Yi Yang, Xianchao Lv, Jiao Xu, Lian Li\",\"doi\":\"10.1109/ICSESS.2014.6933569\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we propose a multi-stage feature selection algorithm, which focuses on the reduction of redundant features and the improvement of classification performance using feature ranking (FR), correlation analysis (CA) and chaotic binary particle swarm optimization (CBPSO). In the first stage, with the purpose of selecting the most effective features for classification, FR is introduced to select the top-ranked features according to the classification accuracies. In the second stage, CA is used to measure the correlation among the selected top-ranked features for reducing redundant features. In the third stage, in order to further eliminate redundant features and improve the classification performances, CBPSO is adopted to search the optimal feature subset. Ultimately, feature selection can be completed by using only some top-ranked features with less redundancy for classification. Support vector machine (SVM) with n-fold cross-validation is adopted to assess the classification performances on six datasets in the experiments. Experimental results show that the proposed algorithm can achieve better performance in terms of classification accuracy and the number of features than benchmark algorithms.\",\"PeriodicalId\":6473,\"journal\":{\"name\":\"2014 IEEE 5th International Conference on Software Engineering and Service Science\",\"volume\":\"50 1\",\"pages\":\"305-309\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-06-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 IEEE 5th International Conference on Software Engineering and Service Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSESS.2014.6933569\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE 5th International Conference on Software Engineering and Service Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSESS.2014.6933569","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Feature selection using feature ranking, correlation analysis and chaotic binary particle swarm optimization
In this paper, we propose a multi-stage feature selection algorithm, which focuses on the reduction of redundant features and the improvement of classification performance using feature ranking (FR), correlation analysis (CA) and chaotic binary particle swarm optimization (CBPSO). In the first stage, with the purpose of selecting the most effective features for classification, FR is introduced to select the top-ranked features according to the classification accuracies. In the second stage, CA is used to measure the correlation among the selected top-ranked features for reducing redundant features. In the third stage, in order to further eliminate redundant features and improve the classification performances, CBPSO is adopted to search the optimal feature subset. Ultimately, feature selection can be completed by using only some top-ranked features with less redundancy for classification. Support vector machine (SVM) with n-fold cross-validation is adopted to assess the classification performances on six datasets in the experiments. Experimental results show that the proposed algorithm can achieve better performance in terms of classification accuracy and the number of features than benchmark algorithms.