{"title":"堆叠随机森林:更准确和更好的校准","authors":"R. Hänsch","doi":"10.1109/IGARSS39084.2020.9324475","DOIUrl":null,"url":null,"abstract":"Stacked Random Forests (SRFs) sequentially apply multiple Random Forests (RFs) where each instance uses the estimate of the predecessor as additional input to further refine the prediction. They have been shown to improve the performance for semantic segmentation of Polarimetric Synthetic Aperture Radar (PolSAR) images. Both, RFs and SRFs, not only provide an estimate of the class label of a query sample, but instead make a probabilistic prediction, i.e. provide the full class posterior. The probabilistic predictions of RFs are known to be usually well calibrated (i.e. the predictions match the expected probability distributions of each class). This paper answers the question whether stacking leads to overfitting on the training data or decreases the calibration quality of RFs. Results indicate that neither is the case. Instead, classification accuracy steadily increases and then saturates quickly after only a few stacking levels. The predicted probabilities are generally well calibrated where calibration quality also increases slightly for higher stacking levels.","PeriodicalId":444267,"journal":{"name":"IGARSS 2020 - 2020 IEEE International Geoscience and Remote Sensing Symposium","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Stacked Random Forests: More Accurate and Better Calibrated\",\"authors\":\"R. Hänsch\",\"doi\":\"10.1109/IGARSS39084.2020.9324475\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Stacked Random Forests (SRFs) sequentially apply multiple Random Forests (RFs) where each instance uses the estimate of the predecessor as additional input to further refine the prediction. They have been shown to improve the performance for semantic segmentation of Polarimetric Synthetic Aperture Radar (PolSAR) images. Both, RFs and SRFs, not only provide an estimate of the class label of a query sample, but instead make a probabilistic prediction, i.e. provide the full class posterior. The probabilistic predictions of RFs are known to be usually well calibrated (i.e. the predictions match the expected probability distributions of each class). This paper answers the question whether stacking leads to overfitting on the training data or decreases the calibration quality of RFs. Results indicate that neither is the case. Instead, classification accuracy steadily increases and then saturates quickly after only a few stacking levels. The predicted probabilities are generally well calibrated where calibration quality also increases slightly for higher stacking levels.\",\"PeriodicalId\":444267,\"journal\":{\"name\":\"IGARSS 2020 - 2020 IEEE International Geoscience and Remote Sensing Symposium\",\"volume\":\"15 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IGARSS 2020 - 2020 IEEE International Geoscience and Remote Sensing Symposium\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IGARSS39084.2020.9324475\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IGARSS 2020 - 2020 IEEE International Geoscience and Remote Sensing Symposium","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IGARSS39084.2020.9324475","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Stacked Random Forests: More Accurate and Better Calibrated
Stacked Random Forests (SRFs) sequentially apply multiple Random Forests (RFs) where each instance uses the estimate of the predecessor as additional input to further refine the prediction. They have been shown to improve the performance for semantic segmentation of Polarimetric Synthetic Aperture Radar (PolSAR) images. Both, RFs and SRFs, not only provide an estimate of the class label of a query sample, but instead make a probabilistic prediction, i.e. provide the full class posterior. The probabilistic predictions of RFs are known to be usually well calibrated (i.e. the predictions match the expected probability distributions of each class). This paper answers the question whether stacking leads to overfitting on the training data or decreases the calibration quality of RFs. Results indicate that neither is the case. Instead, classification accuracy steadily increases and then saturates quickly after only a few stacking levels. The predicted probabilities are generally well calibrated where calibration quality also increases slightly for higher stacking levels.