{"title":"基于粗糙集的特征选择新方法","authors":"Nidhika Yadav, N. Chatterjee","doi":"10.1109/COMPTELIX.2017.8003963","DOIUrl":null,"url":null,"abstract":"Rough Set is a mathematical tool to find patterns hidden in data with uncertainty. A major step for reduction of high dimension data, present in various forms, is selection of appropriate features. In this work we propose a new indiscernibility relation based on clusters, and compare its effectiveness with that of classical Rough Set based indiscernibility. In particular, we study the proposed Rough Set based scheme for feature set reduction. Rough-Cluster (RC) based approximate algorithms are proposed. The major advantage of these algorithms over the classical method is that they work well even without data discretization. The accuracy, measured in terms of the proportion of correctly classified data samples, is obtained on various standard data sets. The results are found to be on par with those obtained through classical Rough Set based technique for the problem of feature selection.","PeriodicalId":6917,"journal":{"name":"2017 International Conference on Computer, Communications and Electronics (Comptelix)","volume":"18 1","pages":"195-199"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"A novel approach for feature selection using Rough Sets\",\"authors\":\"Nidhika Yadav, N. Chatterjee\",\"doi\":\"10.1109/COMPTELIX.2017.8003963\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Rough Set is a mathematical tool to find patterns hidden in data with uncertainty. A major step for reduction of high dimension data, present in various forms, is selection of appropriate features. In this work we propose a new indiscernibility relation based on clusters, and compare its effectiveness with that of classical Rough Set based indiscernibility. In particular, we study the proposed Rough Set based scheme for feature set reduction. Rough-Cluster (RC) based approximate algorithms are proposed. The major advantage of these algorithms over the classical method is that they work well even without data discretization. The accuracy, measured in terms of the proportion of correctly classified data samples, is obtained on various standard data sets. The results are found to be on par with those obtained through classical Rough Set based technique for the problem of feature selection.\",\"PeriodicalId\":6917,\"journal\":{\"name\":\"2017 International Conference on Computer, Communications and Electronics (Comptelix)\",\"volume\":\"18 1\",\"pages\":\"195-199\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International Conference on Computer, Communications and Electronics (Comptelix)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/COMPTELIX.2017.8003963\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Computer, Communications and Electronics (Comptelix)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/COMPTELIX.2017.8003963","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A novel approach for feature selection using Rough Sets
Rough Set is a mathematical tool to find patterns hidden in data with uncertainty. A major step for reduction of high dimension data, present in various forms, is selection of appropriate features. In this work we propose a new indiscernibility relation based on clusters, and compare its effectiveness with that of classical Rough Set based indiscernibility. In particular, we study the proposed Rough Set based scheme for feature set reduction. Rough-Cluster (RC) based approximate algorithms are proposed. The major advantage of these algorithms over the classical method is that they work well even without data discretization. The accuracy, measured in terms of the proportion of correctly classified data samples, is obtained on various standard data sets. The results are found to be on par with those obtained through classical Rough Set based technique for the problem of feature selection.