{"title":"基于边际的排列变量重要性:随机森林的稳定重要性度量","authors":"Fan Yang, Peng Piao, Yongxuan Lai, Liu Pei","doi":"10.1109/ISKE.2017.8258842","DOIUrl":null,"url":null,"abstract":"Permutation based variable importance measure (VIM) has been widely used in various research fields. For example, in gene expression studies, it has been regarded as a screening tool to select a subset of relevant genes for subsequent analysis or better predictive performance. However, little effort has been devoted to the stability of variable importance measures. In this paper, margin based permutation variable importance measures (VIM-MDs) are proposed, which utilize the similarity between margin distribution before and after random permutation to evaluate the importance of variables. Experiments on six benchmark datasets show that the VIM-MDs outperform permutation based variable importance measure in terms of both global stability and predictive accuracy, which indicates that the proposed method could be used as an effective and stable variable importance measure for random forest.","PeriodicalId":208009,"journal":{"name":"2017 12th International Conference on Intelligent Systems and Knowledge Engineering (ISKE)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Margin based permutation variable importance: A stable importance measure for random forest\",\"authors\":\"Fan Yang, Peng Piao, Yongxuan Lai, Liu Pei\",\"doi\":\"10.1109/ISKE.2017.8258842\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Permutation based variable importance measure (VIM) has been widely used in various research fields. For example, in gene expression studies, it has been regarded as a screening tool to select a subset of relevant genes for subsequent analysis or better predictive performance. However, little effort has been devoted to the stability of variable importance measures. In this paper, margin based permutation variable importance measures (VIM-MDs) are proposed, which utilize the similarity between margin distribution before and after random permutation to evaluate the importance of variables. Experiments on six benchmark datasets show that the VIM-MDs outperform permutation based variable importance measure in terms of both global stability and predictive accuracy, which indicates that the proposed method could be used as an effective and stable variable importance measure for random forest.\",\"PeriodicalId\":208009,\"journal\":{\"name\":\"2017 12th International Conference on Intelligent Systems and Knowledge Engineering (ISKE)\",\"volume\":\"31 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 12th International Conference on Intelligent Systems and Knowledge Engineering (ISKE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISKE.2017.8258842\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 12th International Conference on Intelligent Systems and Knowledge Engineering (ISKE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISKE.2017.8258842","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Margin based permutation variable importance: A stable importance measure for random forest
Permutation based variable importance measure (VIM) has been widely used in various research fields. For example, in gene expression studies, it has been regarded as a screening tool to select a subset of relevant genes for subsequent analysis or better predictive performance. However, little effort has been devoted to the stability of variable importance measures. In this paper, margin based permutation variable importance measures (VIM-MDs) are proposed, which utilize the similarity between margin distribution before and after random permutation to evaluate the importance of variables. Experiments on six benchmark datasets show that the VIM-MDs outperform permutation based variable importance measure in terms of both global stability and predictive accuracy, which indicates that the proposed method could be used as an effective and stable variable importance measure for random forest.