{"title":"Feature selection for pattern recognition by LASSO and thresholding methods - a comparison","authors":"Urszula Libal","doi":"10.1109/MMAR.2011.6031338","DOIUrl":null,"url":null,"abstract":"For high-dimensional data processing, like pattern recognition, it seems desirable to precede with a reduction of the number of describing features. Our aim is a comparison of various feature selection methods for pattern recognition. We consider two-class supervised classification problem for signals decomposed in wavelet bases. We test kNN classification rule with soft and hard thresholding, performed in two stages: (1) wavelet detail coefficient thresholding (noise reduction) and (2) searching for the most differentiating coefficients between classes (selection of discriminating coefficients). We present a new classification rule based on LARS/LASSO. We compare criteria for L1-norm regularization of wavelet coefficients: AIC, BIC and the thresh derived for kNN rule. There were performed simulations for noisy signals with SNR in the range from 0 to 22 [dB], approximated for all possible wavelet resolutions. The quality of pattern recognition for the presented algorithms was measured by the estimated recognition risk and the size of reduced model.","PeriodicalId":440376,"journal":{"name":"2011 16th International Conference on Methods & Models in Automation & Robotics","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 16th International Conference on Methods & Models in Automation & Robotics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MMAR.2011.6031338","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
For high-dimensional data processing, like pattern recognition, it seems desirable to precede with a reduction of the number of describing features. Our aim is a comparison of various feature selection methods for pattern recognition. We consider two-class supervised classification problem for signals decomposed in wavelet bases. We test kNN classification rule with soft and hard thresholding, performed in two stages: (1) wavelet detail coefficient thresholding (noise reduction) and (2) searching for the most differentiating coefficients between classes (selection of discriminating coefficients). We present a new classification rule based on LARS/LASSO. We compare criteria for L1-norm regularization of wavelet coefficients: AIC, BIC and the thresh derived for kNN rule. There were performed simulations for noisy signals with SNR in the range from 0 to 22 [dB], approximated for all possible wavelet resolutions. The quality of pattern recognition for the presented algorithms was measured by the estimated recognition risk and the size of reduced model.