Exploiting auto-encoders for explaining black-box classifiers

IF 1.9 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Riccardo Guidotti
{"title":"Exploiting auto-encoders for explaining black-box classifiers","authors":"Riccardo Guidotti","doi":"10.3233/ia-220139","DOIUrl":null,"url":null,"abstract":"Recent years have witnessed the rise of accurate but obscure classification models that hide the logic of their internal decision processes. In this paper, we present a framework to locally explain any type of black-box classifiers working on any data type through a rule-based model. In the literature already exists local explanation approaches able to accomplish this task. However, they suffer from a significant limitation that implies representing data as a binary vectors and constraining the local surrogate model to be trained on synthetic instances that are not representative of the real world. We overcome these deficiencies by using autoencoder-based approaches. The proposed framework first allows to generate synthetic instances in the latent feature space and learn a latent decision tree classifier. After that, it selects and decodes the synthetic instances respecting local decision rules. Independently from the data type under analysis, such synthetic instances belonging to different classes can unveil the reasons for the classification. Also, depending on the data type, they can be exploited to provide the most useful kind of explanation. Experiments show that the proposed framework advances the state-of-the-art towards a comprehensive and widely usable approach that is able to successfully guarantee various properties besides interpretability.","PeriodicalId":42055,"journal":{"name":"Intelligenza Artificiale","volume":"16 1","pages":"115-129"},"PeriodicalIF":1.9000,"publicationDate":"2022-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligenza Artificiale","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3233/ia-220139","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Recent years have witnessed the rise of accurate but obscure classification models that hide the logic of their internal decision processes. In this paper, we present a framework to locally explain any type of black-box classifiers working on any data type through a rule-based model. In the literature already exists local explanation approaches able to accomplish this task. However, they suffer from a significant limitation that implies representing data as a binary vectors and constraining the local surrogate model to be trained on synthetic instances that are not representative of the real world. We overcome these deficiencies by using autoencoder-based approaches. The proposed framework first allows to generate synthetic instances in the latent feature space and learn a latent decision tree classifier. After that, it selects and decodes the synthetic instances respecting local decision rules. Independently from the data type under analysis, such synthetic instances belonging to different classes can unveil the reasons for the classification. Also, depending on the data type, they can be exploited to provide the most useful kind of explanation. Experiments show that the proposed framework advances the state-of-the-art towards a comprehensive and widely usable approach that is able to successfully guarantee various properties besides interpretability.
利用自动编码器解释黑盒分类器
近年来,准确但模糊的分类模型逐渐兴起,它们隐藏了内部决策过程的逻辑。在本文中,我们提出了一个框架,通过基于规则的模型来局部解释在任何数据类型上工作的任何类型的黑盒分类器。在文献中已经存在能够完成这一任务的局部解释方法。然而,它们受到了一个显著的限制,这意味着将数据表示为二进制向量,并约束局部代理模型在不代表真实世界的合成实例上进行训练。我们通过使用基于自动编码器的方法克服了这些不足。所提出的框架首先允许在潜在特征空间中生成合成实例,并学习潜在决策树分类器。然后,它根据局部决策规则选择并解码合成实例。与所分析的数据类型无关,属于不同类别的此类合成实例可以揭示分类的原因。此外,根据数据类型,可以利用它们来提供最有用的解释。实验表明,所提出的框架将最先进的技术推向了一种全面且广泛可用的方法,该方法能够成功地保证除可解释性之外的各种性质。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Intelligenza Artificiale
Intelligenza Artificiale COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-
CiteScore
3.50
自引率
6.70%
发文量
13
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信