F. Mokdad, D. Bouchaffra, Nabil Zerrouki, A. Touazi
{"title":"Determination of an optimal feature selection method based on maximum Shapley value","authors":"F. Mokdad, D. Bouchaffra, Nabil Zerrouki, A. Touazi","doi":"10.1109/ISDA.2015.7489211","DOIUrl":null,"url":null,"abstract":"We propose a novel feature selection methodology based on game theory. In this context, the players are the various feature selection methods and the characteristic function (payoff) represents the feature ranking agreement within a coalition of players. The Shapley value assigned to each feature selection method is computed and ranked from higher to lower. The best feature selection method is identified as the one having the highest Shapley value. Finally, we have performed a score fusion scheme using the Borda Count (BC) consensus function as a benchmark to the maximum-Shapley value proposed approach. In order to validate the results obtained experimentally, we have performed a classification using a set of UCI and Statlog datasets by invoking an SVM classifier. Experimental results demonstrate the efficiency of the proposed methodology compared to some state-of-the-art approaches.","PeriodicalId":196743,"journal":{"name":"2015 15th International Conference on Intelligent Systems Design and Applications (ISDA)","volume":"11 18","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 15th International Conference on Intelligent Systems Design and Applications (ISDA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISDA.2015.7489211","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9
Abstract
We propose a novel feature selection methodology based on game theory. In this context, the players are the various feature selection methods and the characteristic function (payoff) represents the feature ranking agreement within a coalition of players. The Shapley value assigned to each feature selection method is computed and ranked from higher to lower. The best feature selection method is identified as the one having the highest Shapley value. Finally, we have performed a score fusion scheme using the Borda Count (BC) consensus function as a benchmark to the maximum-Shapley value proposed approach. In order to validate the results obtained experimentally, we have performed a classification using a set of UCI and Statlog datasets by invoking an SVM classifier. Experimental results demonstrate the efficiency of the proposed methodology compared to some state-of-the-art approaches.