{"title":"An improved fuzzy mutual information feature selection for classification systems","authors":"Liwei Wang, Omar A. M. Salem","doi":"10.1109/ICIS.2017.7959980","DOIUrl":null,"url":null,"abstract":"Classification systems are sensitive to input data, especially for datasets with a IoT of undesirable features. Selecting relevant features and avoiding irrelevant or redundant features builds effective systems. Fuzzy Mutual Information measures the relevance and redundancy of features. Although it can deal directly with continuous data without discretization, it still requires more computation and storage space. In this paper, we propose an improved fuzzy mutual information to solve this problem. Furthermore, we integrate it with normalized max-relevance and min-redundancy (mRMR) approach. It does not only select the relevant features but also avoids the redundancies with respect to the domination between them. Our experiment was evaluated according to storage, stability, classification accuracy, and the number of selected features. Based on 12 benchmark datasets, experimental results confirm that our proposed method achieved better results.","PeriodicalId":301467,"journal":{"name":"2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIS.2017.7959980","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Classification systems are sensitive to input data, especially for datasets with a IoT of undesirable features. Selecting relevant features and avoiding irrelevant or redundant features builds effective systems. Fuzzy Mutual Information measures the relevance and redundancy of features. Although it can deal directly with continuous data without discretization, it still requires more computation and storage space. In this paper, we propose an improved fuzzy mutual information to solve this problem. Furthermore, we integrate it with normalized max-relevance and min-redundancy (mRMR) approach. It does not only select the relevant features but also avoids the redundancies with respect to the domination between them. Our experiment was evaluated according to storage, stability, classification accuracy, and the number of selected features. Based on 12 benchmark datasets, experimental results confirm that our proposed method achieved better results.