Faithful and Customizable Explanations of Black Box Models

Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society Pub Date : 2019-01-27 DOI:10.1145/3306618.3314229

Himabindu Lakkaraju, Ece Kamar, R. Caruana, J. Leskovec

{"title":"Faithful and Customizable Explanations of Black Box Models","authors":"Himabindu Lakkaraju, Ece Kamar, R. Caruana, J. Leskovec","doi":"10.1145/3306618.3314229","DOIUrl":null,"url":null,"abstract":"As predictive models increasingly assist human experts (e.g., doctors) in day-to-day decision making, it is crucial for experts to be able to explore and understand how such models behave in different feature subspaces in order to know if and when to trust them. To this end, we propose Model Understanding through Subspace Explanations (MUSE), a novel model agnostic framework which facilitates understanding of a given black box model by explaining how it behaves in subspaces characterized by certain features of interest. Our framework provides end users (e.g., doctors) with the flexibility of customizing the model explanations by allowing them to input the features of interest. The construction of explanations is guided by a novel objective function that we propose to simultaneously optimize for fidelity to the original model, unambiguity and interpretability of the explanation. More specifically, our objective allows us to learn, with optimality guarantees, a small number of compact decision sets each of which captures the behavior of a given black box model in unambiguous, well-defined regions of the feature space. Experimental evaluation with real-world datasets and user studies demonstrate that our approach can generate customizable, highly compact, easy-to-understand, yet accurate explanations of various kinds of predictive models compared to state-of-the-art baselines.","PeriodicalId":418125,"journal":{"name":"Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society","volume":"108 ","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"224","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3306618.3314229","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 224

Abstract

As predictive models increasingly assist human experts (e.g., doctors) in day-to-day decision making, it is crucial for experts to be able to explore and understand how such models behave in different feature subspaces in order to know if and when to trust them. To this end, we propose Model Understanding through Subspace Explanations (MUSE), a novel model agnostic framework which facilitates understanding of a given black box model by explaining how it behaves in subspaces characterized by certain features of interest. Our framework provides end users (e.g., doctors) with the flexibility of customizing the model explanations by allowing them to input the features of interest. The construction of explanations is guided by a novel objective function that we propose to simultaneously optimize for fidelity to the original model, unambiguity and interpretability of the explanation. More specifically, our objective allows us to learn, with optimality guarantees, a small number of compact decision sets each of which captures the behavior of a given black box model in unambiguous, well-defined regions of the feature space. Experimental evaluation with real-world datasets and user studies demonstrate that our approach can generate customizable, highly compact, easy-to-understand, yet accurate explanations of various kinds of predictive models compared to state-of-the-art baselines.

查看原文本刊更多论文

黑匣子模型的忠实和可定制的解释

随着预测模型越来越多地帮助人类专家(例如医生)进行日常决策，对于专家来说，能够探索和理解这些模型在不同特征子空间中的行为是至关重要的，以便知道是否以及何时信任它们。为此，我们提出了通过子空间解释来理解模型(MUSE)，这是一个新的模型不可知论框架，通过解释给定黑箱模型在以某些感兴趣的特征为特征的子空间中的行为来促进对给定黑箱模型的理解。我们的框架通过允许最终用户(例如医生)输入感兴趣的特征，为他们提供定制模型解释的灵活性。我们提出了一个新的目标函数来指导解释的构建，同时优化了对原始模型的保真度，解释的无歧义性和可解释性。更具体地说，我们的目标允许我们在最优性保证的情况下学习少量紧凑的决策集，每个决策集在特征空间的明确、定义良好的区域中捕获给定黑盒模型的行为。使用真实世界数据集和用户研究的实验评估表明，与最先进的基线相比，我们的方法可以生成可定制的、高度紧凑的、易于理解的、但准确的各种预测模型解释。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society

自引率

0.00%

发文量