R. Murata, Yohei Mishina, Yuji Yamauchi, Takayoshi Yamashita, H. Fujiyoshi
{"title":"Efficient feature selection method using contribution ratio by random forest","authors":"R. Murata, Yohei Mishina, Yuji Yamauchi, Takayoshi Yamashita, H. Fujiyoshi","doi":"10.1109/FCV.2015.7103746","DOIUrl":null,"url":null,"abstract":"In the field of image recognition, a high-dimensional feature vector is often used to construct a classifier. This presents a problem, however, since using a large number of features can slow down training and degrade model readability. To alleviate this problem, sequential backward selection (SBS) has come to be used as a method for selecting an effective number of features for classification. However, as a type of wrapper method, SBS iteratively constructs and evaluates classifiers when selecting features, which is computationally intensive. In this study, we define the contribution ratio of features by random forest and use it to create an efficient feature selection method. We performed an evaluation experiment to compare the proposed method with SBS and found that the former could significantly reduce feature selection time for the same dimension reduction rate.","PeriodicalId":424974,"journal":{"name":"2015 21st Korea-Japan Joint Workshop on Frontiers of Computer Vision (FCV)","volume":"43 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 21st Korea-Japan Joint Workshop on Frontiers of Computer Vision (FCV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FCV.2015.7103746","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
In the field of image recognition, a high-dimensional feature vector is often used to construct a classifier. This presents a problem, however, since using a large number of features can slow down training and degrade model readability. To alleviate this problem, sequential backward selection (SBS) has come to be used as a method for selecting an effective number of features for classification. However, as a type of wrapper method, SBS iteratively constructs and evaluates classifiers when selecting features, which is computationally intensive. In this study, we define the contribution ratio of features by random forest and use it to create an efficient feature selection method. We performed an evaluation experiment to compare the proposed method with SBS and found that the former could significantly reduce feature selection time for the same dimension reduction rate.