{"title":"基于模糊信息测度的综合特征选择方法","authors":"B. Azhagusundari","doi":"10.1109/ICICES.2017.8070711","DOIUrl":null,"url":null,"abstract":"The information is flooded with heterogeneous data sources and it generates over 2.5 quintillion bytes every day from communication devices, social media, consumer transactions, online behaviour and streaming services. To overcome this difficulty irrelevant and redundant data are to be removed using the technique called feature selection. The goal of the feature selection is to find the minimum set of attribute. The results are implemented by MATLAB and WEKA tool for feature selection and classification respectively. This research work is validated using different datasets namely Pima Diabetic, Breast Cancer, Ecoli, Iris, Sonar and Student which are available in UCI repository. Model performance is evaluated by using Precision, Recall and F-Measure performance metrics. The selected subset features are used to compare different feature ranking methods like Gain Ratio, Relief, Chi Square and OneR. The experimental inference reveals that the proposed algorithms are efficient in selecting minimum features for the feature subset and gives higher accuracy rate.","PeriodicalId":134931,"journal":{"name":"2017 International Conference on Information Communication and Embedded Systems (ICICES)","volume":"78 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"An integrated method for feature selection using fuzzy information measure\",\"authors\":\"B. Azhagusundari\",\"doi\":\"10.1109/ICICES.2017.8070711\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The information is flooded with heterogeneous data sources and it generates over 2.5 quintillion bytes every day from communication devices, social media, consumer transactions, online behaviour and streaming services. To overcome this difficulty irrelevant and redundant data are to be removed using the technique called feature selection. The goal of the feature selection is to find the minimum set of attribute. The results are implemented by MATLAB and WEKA tool for feature selection and classification respectively. This research work is validated using different datasets namely Pima Diabetic, Breast Cancer, Ecoli, Iris, Sonar and Student which are available in UCI repository. Model performance is evaluated by using Precision, Recall and F-Measure performance metrics. The selected subset features are used to compare different feature ranking methods like Gain Ratio, Relief, Chi Square and OneR. The experimental inference reveals that the proposed algorithms are efficient in selecting minimum features for the feature subset and gives higher accuracy rate.\",\"PeriodicalId\":134931,\"journal\":{\"name\":\"2017 International Conference on Information Communication and Embedded Systems (ICICES)\",\"volume\":\"78 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International Conference on Information Communication and Embedded Systems (ICICES)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICICES.2017.8070711\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Information Communication and Embedded Systems (ICICES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICES.2017.8070711","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An integrated method for feature selection using fuzzy information measure
The information is flooded with heterogeneous data sources and it generates over 2.5 quintillion bytes every day from communication devices, social media, consumer transactions, online behaviour and streaming services. To overcome this difficulty irrelevant and redundant data are to be removed using the technique called feature selection. The goal of the feature selection is to find the minimum set of attribute. The results are implemented by MATLAB and WEKA tool for feature selection and classification respectively. This research work is validated using different datasets namely Pima Diabetic, Breast Cancer, Ecoli, Iris, Sonar and Student which are available in UCI repository. Model performance is evaluated by using Precision, Recall and F-Measure performance metrics. The selected subset features are used to compare different feature ranking methods like Gain Ratio, Relief, Chi Square and OneR. The experimental inference reveals that the proposed algorithms are efficient in selecting minimum features for the feature subset and gives higher accuracy rate.