{"title":"Feature Selection on High Dimensional Data Using Wrapper Based Subset Selection","authors":"G. Manikandan, E. Susi, S. Abirami","doi":"10.1109/ICRTCCM.2017.58","DOIUrl":null,"url":null,"abstract":"In recent years, feature subset selection and classification in high dimensional data is a major challenge faced by the researchers. The main aim of the feature subset selection is to find most informative features from the vast number of features in the high dimensional data. Filter, wrapper and embedded methods are currently used to solve these issues. In this paper, we have incorporated wrapper based subset selection technique for selecting a subset from the high dimensional datasets. In this approach to find the optimal threshold value, the feature subsets are given to the classifier iteratively until the maximum accuracy is obtained. The symmetrical uncertainty method is used to weight the features to predict the predominant feature. For validating the incorporated algorithm, we have used 10-fold cross validation against the two standard classification techniques such as Naive Bayes and Support Vector Machine (SVM) and the results are tabulated and compared. The comparison between the results shows that the proposed method gives the better accuracy and results.","PeriodicalId":134897,"journal":{"name":"2017 Second International Conference on Recent Trends and Challenges in Computational Models (ICRTCCM)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Second International Conference on Recent Trends and Challenges in Computational Models (ICRTCCM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRTCCM.2017.58","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11
Abstract
In recent years, feature subset selection and classification in high dimensional data is a major challenge faced by the researchers. The main aim of the feature subset selection is to find most informative features from the vast number of features in the high dimensional data. Filter, wrapper and embedded methods are currently used to solve these issues. In this paper, we have incorporated wrapper based subset selection technique for selecting a subset from the high dimensional datasets. In this approach to find the optimal threshold value, the feature subsets are given to the classifier iteratively until the maximum accuracy is obtained. The symmetrical uncertainty method is used to weight the features to predict the predominant feature. For validating the incorporated algorithm, we have used 10-fold cross validation against the two standard classification techniques such as Naive Bayes and Support Vector Machine (SVM) and the results are tabulated and compared. The comparison between the results shows that the proposed method gives the better accuracy and results.