{"title":"Feature selection for classification using particle swarm optimization","authors":"Lucija Brezočnik","doi":"10.1109/EUROCON.2017.8011255","DOIUrl":null,"url":null,"abstract":"This paper proposes a method for the problem of processing high-dimensional data. When one has thousands of features (attributes) in a dataset, it is hard to achieve an efficient feature selection. To cope with this problem, we propose the use of a binary particle swarm optimization algorithm combined with the C4.5 as a classifier in the fitness function for the selection of informative attributes. The results obtained on 11 datasets were analyzed statistically and reveal that the proposed method, called BPSO+C4.5, outperforms known classifiers, i.e., C4.5, Naive Bayes, and SVM.","PeriodicalId":114100,"journal":{"name":"IEEE EUROCON 2017 -17th International Conference on Smart Technologies","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"21","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE EUROCON 2017 -17th International Conference on Smart Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EUROCON.2017.8011255","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 21
Abstract
This paper proposes a method for the problem of processing high-dimensional data. When one has thousands of features (attributes) in a dataset, it is hard to achieve an efficient feature selection. To cope with this problem, we propose the use of a binary particle swarm optimization algorithm combined with the C4.5 as a classifier in the fitness function for the selection of informative attributes. The results obtained on 11 datasets were analyzed statistically and reveal that the proposed method, called BPSO+C4.5, outperforms known classifiers, i.e., C4.5, Naive Bayes, and SVM.