{"title":"Feature selection based on network maximal correlation","authors":"Xiaokang Yang, Qiang Wang, Yi Wang","doi":"10.1109/WPMC.2017.8301854","DOIUrl":null,"url":null,"abstract":"Feature selection can effectively increase the accuracy of machine learning and improve the efficiency of the algorithm. Therefore, it has emerged as a critical technology related to data mining, machine learning, and has shown great impacts in many applications, including biomedical, financial and communication. However, the increase in data dimension poses a serious challenge to many existing feature selection methods in terms of effectiveness and efficiency. Hirschfeld-Gebelein-Renyi maximal correlation is a effective measure of the correlation between variables. In this paper, with this measure, we proposed a improved Network Maximal Correlation (NMC) model. It can quickly and effectively calculate the statistical dependence between feature set and label variable. Further, based on the Recursive Feature Elimination (RFE) algorithm, a new NMC-RFE feature selection method is Further proposed. The experimental results show that the proposed method can obtain much better feature subsets from high dimensional data sets with faster calculation speed and better accuracy.","PeriodicalId":239243,"journal":{"name":"2017 20th International Symposium on Wireless Personal Multimedia Communications (WPMC)","volume":"61 11","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 20th International Symposium on Wireless Personal Multimedia Communications (WPMC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WPMC.2017.8301854","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Feature selection can effectively increase the accuracy of machine learning and improve the efficiency of the algorithm. Therefore, it has emerged as a critical technology related to data mining, machine learning, and has shown great impacts in many applications, including biomedical, financial and communication. However, the increase in data dimension poses a serious challenge to many existing feature selection methods in terms of effectiveness and efficiency. Hirschfeld-Gebelein-Renyi maximal correlation is a effective measure of the correlation between variables. In this paper, with this measure, we proposed a improved Network Maximal Correlation (NMC) model. It can quickly and effectively calculate the statistical dependence between feature set and label variable. Further, based on the Recursive Feature Elimination (RFE) algorithm, a new NMC-RFE feature selection method is Further proposed. The experimental results show that the proposed method can obtain much better feature subsets from high dimensional data sets with faster calculation speed and better accuracy.