{"title":"基于ReliefF和二进制蜻蜓的高维数据混合特征选择","authors":"Atefe Asadi Karizaki, M. Tavassoli","doi":"10.1109/ICCKE48569.2019.8965106","DOIUrl":null,"url":null,"abstract":"High dimensionality is a common challenge in large datasets. Combination of the filter and wrapper methods is used to select the appropriate set of features in these datasets. The hybrid method is desirable, which uses the advantages of both the methods and covers the disadvantages. In this paper, a hybrid method for feature selection in high dimension data is presented. In proposed algorithm, the ReliefF algorithm is used as a filter method for ranking features. Next, the binary dragonfly algorithm (BDA) is applied as a wrapper method. The BDA algorithm uses the ranked features to find optimal set of features incrementally and iteratively. Minimizing the cross-validation loss and decreasing the number of features is considered to evaluate the solution, hierarchically. The proposed algorithm and other compared algorithms run over 5 datasets and the results indicated that the proposed algorithm not only reduce the dimension of dataset but also improve the performance of classifiers on the test data.","PeriodicalId":6685,"journal":{"name":"2019 9th International Conference on Computer and Knowledge Engineering (ICCKE)","volume":"34 1","pages":"300-304"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"A novel hybrid feature selection based on ReliefF and binary dragonfly for high dimensional datasets\",\"authors\":\"Atefe Asadi Karizaki, M. Tavassoli\",\"doi\":\"10.1109/ICCKE48569.2019.8965106\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"High dimensionality is a common challenge in large datasets. Combination of the filter and wrapper methods is used to select the appropriate set of features in these datasets. The hybrid method is desirable, which uses the advantages of both the methods and covers the disadvantages. In this paper, a hybrid method for feature selection in high dimension data is presented. In proposed algorithm, the ReliefF algorithm is used as a filter method for ranking features. Next, the binary dragonfly algorithm (BDA) is applied as a wrapper method. The BDA algorithm uses the ranked features to find optimal set of features incrementally and iteratively. Minimizing the cross-validation loss and decreasing the number of features is considered to evaluate the solution, hierarchically. The proposed algorithm and other compared algorithms run over 5 datasets and the results indicated that the proposed algorithm not only reduce the dimension of dataset but also improve the performance of classifiers on the test data.\",\"PeriodicalId\":6685,\"journal\":{\"name\":\"2019 9th International Conference on Computer and Knowledge Engineering (ICCKE)\",\"volume\":\"34 1\",\"pages\":\"300-304\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 9th International Conference on Computer and Knowledge Engineering (ICCKE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCKE48569.2019.8965106\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 9th International Conference on Computer and Knowledge Engineering (ICCKE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCKE48569.2019.8965106","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A novel hybrid feature selection based on ReliefF and binary dragonfly for high dimensional datasets
High dimensionality is a common challenge in large datasets. Combination of the filter and wrapper methods is used to select the appropriate set of features in these datasets. The hybrid method is desirable, which uses the advantages of both the methods and covers the disadvantages. In this paper, a hybrid method for feature selection in high dimension data is presented. In proposed algorithm, the ReliefF algorithm is used as a filter method for ranking features. Next, the binary dragonfly algorithm (BDA) is applied as a wrapper method. The BDA algorithm uses the ranked features to find optimal set of features incrementally and iteratively. Minimizing the cross-validation loss and decreasing the number of features is considered to evaluate the solution, hierarchically. The proposed algorithm and other compared algorithms run over 5 datasets and the results indicated that the proposed algorithm not only reduce the dimension of dataset but also improve the performance of classifiers on the test data.