Parikshit Solunke, Vitoria Guardieiro, Joao Rulff, Peter Xenopoulos, Gromit Yeuk-Yin Chan, Brian Barr, Luis Gustavo Nonato, Claudio Silva
{"title":"MOUNTAINEER: Topology-Driven Visual Analytics for Comparing Local Explanations","authors":"Parikshit Solunke, Vitoria Guardieiro, Joao Rulff, Peter Xenopoulos, Gromit Yeuk-Yin Chan, Brian Barr, Luis Gustavo Nonato, Claudio Silva","doi":"arxiv-2406.15613","DOIUrl":null,"url":null,"abstract":"With the increasing use of black-box Machine Learning (ML) techniques in\ncritical applications, there is a growing demand for methods that can provide\ntransparency and accountability for model predictions. As a result, a large\nnumber of local explainability methods for black-box models have been developed\nand popularized. However, machine learning explanations are still hard to\nevaluate and compare due to the high dimensionality, heterogeneous\nrepresentations, varying scales, and stochastic nature of some of these\nmethods. Topological Data Analysis (TDA) can be an effective method in this\ndomain since it can be used to transform attributions into uniform graph\nrepresentations, providing a common ground for comparison across different\nexplanation methods. We present a novel topology-driven visual analytics tool, Mountaineer, that\nallows ML practitioners to interactively analyze and compare these\nrepresentations by linking the topological graphs back to the original data\ndistribution, model predictions, and feature attributions. Mountaineer\nfacilitates rapid and iterative exploration of ML explanations, enabling\nexperts to gain deeper insights into the explanation techniques, understand the\nunderlying data distributions, and thus reach well-founded conclusions about\nmodel behavior. Furthermore, we demonstrate the utility of Mountaineer through\ntwo case studies using real-world data. In the first, we show how Mountaineer\nenabled us to compare black-box ML explanations and discern regions of and\ncauses of disagreements between different explanations. In the second, we\ndemonstrate how the tool can be used to compare and understand ML models\nthemselves. Finally, we conducted interviews with three industry experts to\nhelp us evaluate our work.","PeriodicalId":501119,"journal":{"name":"arXiv - MATH - Algebraic Topology","volume":"164 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - MATH - Algebraic Topology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2406.15613","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
With the increasing use of black-box Machine Learning (ML) techniques in
critical applications, there is a growing demand for methods that can provide
transparency and accountability for model predictions. As a result, a large
number of local explainability methods for black-box models have been developed
and popularized. However, machine learning explanations are still hard to
evaluate and compare due to the high dimensionality, heterogeneous
representations, varying scales, and stochastic nature of some of these
methods. Topological Data Analysis (TDA) can be an effective method in this
domain since it can be used to transform attributions into uniform graph
representations, providing a common ground for comparison across different
explanation methods. We present a novel topology-driven visual analytics tool, Mountaineer, that
allows ML practitioners to interactively analyze and compare these
representations by linking the topological graphs back to the original data
distribution, model predictions, and feature attributions. Mountaineer
facilitates rapid and iterative exploration of ML explanations, enabling
experts to gain deeper insights into the explanation techniques, understand the
underlying data distributions, and thus reach well-founded conclusions about
model behavior. Furthermore, we demonstrate the utility of Mountaineer through
two case studies using real-world data. In the first, we show how Mountaineer
enabled us to compare black-box ML explanations and discern regions of and
causes of disagreements between different explanations. In the second, we
demonstrate how the tool can be used to compare and understand ML models
themselves. Finally, we conducted interviews with three industry experts to
help us evaluate our work.
随着黑盒机器学习(ML)技术在关键应用中的使用越来越多,人们对能够为模型预测提供透明度和责任感的方法的需求也越来越大。因此,大量针对黑盒模型的局部可解释性方法得到了开发和推广。然而,由于一些方法的高维性、异质性、不同尺度和随机性,机器学习解释仍然难以评估和比较。拓扑数据分析(Topological Data Analysis,TDA)是这一领域的有效方法,因为它可以用来将归因转化为统一的图表示,为不同解释方法之间的比较提供共同基础。我们介绍了一种新颖的拓扑驱动可视化分析工具 Mountaineer,它允许人工智能从业人员通过将拓扑图与原始数据分布、模型预测和特征归因联系起来,以交互方式分析和比较这些表示。登山者有助于对 ML 解释进行快速、反复的探索,使专家能够深入了解解释技术,理解基本数据分布,从而对模型行为得出有理有据的结论。此外,我们还通过两个使用真实世界数据的案例研究展示了 Mountaineer 的实用性。在第一个案例中,我们展示了登山者如何帮助我们比较黑盒子 ML 解释,并找出不同解释之间存在分歧的区域和原因。其次,我们展示了如何使用该工具来比较和理解 ML 模型本身。最后,我们对三位行业专家进行了访谈,以帮助我们评估自己的工作。