{"title":"Feature subset selection using Information Energy and correlation coefficients of hesitant fuzzy sets","authors":"M. K. Ebrahimpour, M. Eftekhari","doi":"10.1109/IKT.2015.7288746","DOIUrl":null,"url":null,"abstract":"In this paper, a novel feature selection algorithm based on hesitant fuzzy sets (HFS) is proposed. For each feature two HFSs are defined.For generating the first HFS for each defining feature, the opinions of three different ranking algorithms are considered. For second HFS for each feature the opinions of three different proximity measures are considered. The Information Energy (IE) of the first HFS for each feature is considered as the relevancy measure of the feature to the class labels. Then the hesitant correlation coefficient matrix for features is calculated based on the second HFSs. After that the average of hesitant correlation coefficients is considered as the relevancy measure of selected features. By combining hesitant based relevancy and redundancy measures, a new feature selection merit is proposed. The proposed merit potentially is able to consider both the maximum relevancy and the minimum redundancy of selected features. The efficiency of this approach is proved through 9 UCI repository datasets. The approach demonstrates a significant performance in both number of selected features and classification accuracy by four different classifiers.","PeriodicalId":338953,"journal":{"name":"2015 7th Conference on Information and Knowledge Technology (IKT)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 7th Conference on Information and Knowledge Technology (IKT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IKT.2015.7288746","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
In this paper, a novel feature selection algorithm based on hesitant fuzzy sets (HFS) is proposed. For each feature two HFSs are defined.For generating the first HFS for each defining feature, the opinions of three different ranking algorithms are considered. For second HFS for each feature the opinions of three different proximity measures are considered. The Information Energy (IE) of the first HFS for each feature is considered as the relevancy measure of the feature to the class labels. Then the hesitant correlation coefficient matrix for features is calculated based on the second HFSs. After that the average of hesitant correlation coefficients is considered as the relevancy measure of selected features. By combining hesitant based relevancy and redundancy measures, a new feature selection merit is proposed. The proposed merit potentially is able to consider both the maximum relevancy and the minimum redundancy of selected features. The efficiency of this approach is proved through 9 UCI repository datasets. The approach demonstrates a significant performance in both number of selected features and classification accuracy by four different classifiers.