{"title":"Stability-Based Multivariate Mapping Using SCoRS","authors":"J. Rondina, J. Shawe-Taylor, J. Miranda","doi":"10.1109/PRNI.2013.58","DOIUrl":null,"url":null,"abstract":"Recently we proposed a feature selection method based on stability theory (SCoRS - Survival Count on Random Subspaces) and showed that the proposed approach was able to improve classification accuracy using different datasets. In the present work we propose: (i) an extension of SCoRS using reproducibility instead of model accuracy as the parameter optimization criterion and (ii) a procedure to estimate the rate of false positive selection associated with the set of features obtained. Our results using the proposed framework showed that, as expected, the optimal parameter was more stable across the cross-validation folds, the spatial map displaying the features selected was less noisy and there was no decrease in classification accuracy. In addition, our results suggest that the estimated false positive rate for the features selected by SCoRS is under 0.05 for both optimization approaches, nevertheless lower when optimizing reproducibility in comparison with the standard optimization approach.","PeriodicalId":144007,"journal":{"name":"2013 International Workshop on Pattern Recognition in Neuroimaging","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2013-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 International Workshop on Pattern Recognition in Neuroimaging","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PRNI.2013.58","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Recently we proposed a feature selection method based on stability theory (SCoRS - Survival Count on Random Subspaces) and showed that the proposed approach was able to improve classification accuracy using different datasets. In the present work we propose: (i) an extension of SCoRS using reproducibility instead of model accuracy as the parameter optimization criterion and (ii) a procedure to estimate the rate of false positive selection associated with the set of features obtained. Our results using the proposed framework showed that, as expected, the optimal parameter was more stable across the cross-validation folds, the spatial map displaying the features selected was less noisy and there was no decrease in classification accuracy. In addition, our results suggest that the estimated false positive rate for the features selected by SCoRS is under 0.05 for both optimization approaches, nevertheless lower when optimizing reproducibility in comparison with the standard optimization approach.
最近,我们提出了一种基于稳定性理论的特征选择方法(SCoRS - Survival Count on Random Subspaces),并表明该方法能够在不同的数据集上提高分类精度。在目前的工作中,我们提出:(i)使用可重复性而不是模型精度作为参数优化标准的SCoRS扩展;(ii)估计与所获得的特征集相关的假阳性选择率的程序。我们使用该框架的结果表明,正如预期的那样,最优参数在交叉验证折叠中更加稳定,显示所选特征的空间地图噪声更小,分类精度没有下降。此外,我们的结果表明,两种优化方法对SCoRS选择的特征的估计假阳性率都在0.05以下,但在优化再现性时,与标准优化方法相比,假阳性率较低。