{"title":"Improving classification accuracy through feature selection","authors":"C. V. Bratu, T. Muresan, R. Potolea","doi":"10.1109/ICCP.2008.4648350","DOIUrl":null,"url":null,"abstract":"High accuracy is essential to any data mining process. A large part of the factors which influence the success of a data mining problem reside in the quality of the data used. Feature selection represents one of the tools which can refine a dataset before presenting it to a learning scheme. This paper analyzes a wrapper approach for feature selection, with the purpose of boosting the classification accuracy. A wrapper is viewed as a 3-tuple consisting of a generation procedure, an evaluation function and a validation procedure. Experimental evaluations have been performed for several combinations of the three components. The results have shown that feature selection improves the classification accuracy and speeds up the training process. Moreover, two robust combinations are proposed: one that constantly achieves highest accuracy, and one which significantly boosts the initial accuracy of the inducer.","PeriodicalId":169031,"journal":{"name":"2008 4th International Conference on Intelligent Computer Communication and Processing","volume":"82 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"29","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 4th International Conference on Intelligent Computer Communication and Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCP.2008.4648350","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 29
Abstract
High accuracy is essential to any data mining process. A large part of the factors which influence the success of a data mining problem reside in the quality of the data used. Feature selection represents one of the tools which can refine a dataset before presenting it to a learning scheme. This paper analyzes a wrapper approach for feature selection, with the purpose of boosting the classification accuracy. A wrapper is viewed as a 3-tuple consisting of a generation procedure, an evaluation function and a validation procedure. Experimental evaluations have been performed for several combinations of the three components. The results have shown that feature selection improves the classification accuracy and speeds up the training process. Moreover, two robust combinations are proposed: one that constantly achieves highest accuracy, and one which significantly boosts the initial accuracy of the inducer.