{"title":"将可解释集合树(E2Tree)扩展到回归情境中","authors":"Massimo Aria, Agostino Gnasso, Carmela Iorio, Marjolein Fokkema","doi":"arxiv-2409.06439","DOIUrl":null,"url":null,"abstract":"Ensemble methods such as random forests have transformed the landscape of\nsupervised learning, offering highly accurate prediction through the\naggregation of multiple weak learners. However, despite their effectiveness,\nthese methods often lack transparency, impeding users' comprehension of how RF\nmodels arrive at their predictions. Explainable ensemble trees (E2Tree) is a\nnovel methodology for explaining random forests, that provides a graphical\nrepresentation of the relationship between response variables and predictors. A\nstriking characteristic of E2Tree is that it not only accounts for the effects\nof predictor variables on the response but also accounts for associations\nbetween the predictor variables through the computation and use of\ndissimilarity measures. The E2Tree methodology was initially proposed for use\nin classification tasks. In this paper, we extend the methodology to encompass\nregression contexts. To demonstrate the explanatory power of the proposed\nalgorithm, we illustrate its use on real-world datasets.","PeriodicalId":501340,"journal":{"name":"arXiv - STAT - Machine Learning","volume":"13 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Extending Explainable Ensemble Trees (E2Tree) to regression contexts\",\"authors\":\"Massimo Aria, Agostino Gnasso, Carmela Iorio, Marjolein Fokkema\",\"doi\":\"arxiv-2409.06439\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Ensemble methods such as random forests have transformed the landscape of\\nsupervised learning, offering highly accurate prediction through the\\naggregation of multiple weak learners. However, despite their effectiveness,\\nthese methods often lack transparency, impeding users' comprehension of how RF\\nmodels arrive at their predictions. Explainable ensemble trees (E2Tree) is a\\nnovel methodology for explaining random forests, that provides a graphical\\nrepresentation of the relationship between response variables and predictors. A\\nstriking characteristic of E2Tree is that it not only accounts for the effects\\nof predictor variables on the response but also accounts for associations\\nbetween the predictor variables through the computation and use of\\ndissimilarity measures. The E2Tree methodology was initially proposed for use\\nin classification tasks. In this paper, we extend the methodology to encompass\\nregression contexts. To demonstrate the explanatory power of the proposed\\nalgorithm, we illustrate its use on real-world datasets.\",\"PeriodicalId\":501340,\"journal\":{\"name\":\"arXiv - STAT - Machine Learning\",\"volume\":\"13 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - STAT - Machine Learning\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.06439\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.06439","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Extending Explainable Ensemble Trees (E2Tree) to regression contexts
Ensemble methods such as random forests have transformed the landscape of
supervised learning, offering highly accurate prediction through the
aggregation of multiple weak learners. However, despite their effectiveness,
these methods often lack transparency, impeding users' comprehension of how RF
models arrive at their predictions. Explainable ensemble trees (E2Tree) is a
novel methodology for explaining random forests, that provides a graphical
representation of the relationship between response variables and predictors. A
striking characteristic of E2Tree is that it not only accounts for the effects
of predictor variables on the response but also accounts for associations
between the predictor variables through the computation and use of
dissimilarity measures. The E2Tree methodology was initially proposed for use
in classification tasks. In this paper, we extend the methodology to encompass
regression contexts. To demonstrate the explanatory power of the proposed
algorithm, we illustrate its use on real-world datasets.