利用自动编码器解释黑盒分类器

IF 1.9 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Intelligenza Artificiale Pub Date : 2022-07-08 DOI:10.3233/ia-220139

Riccardo Guidotti

{"title":"利用自动编码器解释黑盒分类器","authors":"Riccardo Guidotti","doi":"10.3233/ia-220139","DOIUrl":null,"url":null,"abstract":"Recent years have witnessed the rise of accurate but obscure classification models that hide the logic of their internal decision processes. In this paper, we present a framework to locally explain any type of black-box classifiers working on any data type through a rule-based model. In the literature already exists local explanation approaches able to accomplish this task. However, they suffer from a significant limitation that implies representing data as a binary vectors and constraining the local surrogate model to be trained on synthetic instances that are not representative of the real world. We overcome these deficiencies by using autoencoder-based approaches. The proposed framework first allows to generate synthetic instances in the latent feature space and learn a latent decision tree classifier. After that, it selects and decodes the synthetic instances respecting local decision rules. Independently from the data type under analysis, such synthetic instances belonging to different classes can unveil the reasons for the classification. Also, depending on the data type, they can be exploited to provide the most useful kind of explanation. Experiments show that the proposed framework advances the state-of-the-art towards a comprehensive and widely usable approach that is able to successfully guarantee various properties besides interpretability.","PeriodicalId":42055,"journal":{"name":"Intelligenza Artificiale","volume":"16 1","pages":"115-129"},"PeriodicalIF":1.9000,"publicationDate":"2022-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Exploiting auto-encoders for explaining black-box classifiers\",\"authors\":\"Riccardo Guidotti\",\"doi\":\"10.3233/ia-220139\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recent years have witnessed the rise of accurate but obscure classification models that hide the logic of their internal decision processes. In this paper, we present a framework to locally explain any type of black-box classifiers working on any data type through a rule-based model. In the literature already exists local explanation approaches able to accomplish this task. However, they suffer from a significant limitation that implies representing data as a binary vectors and constraining the local surrogate model to be trained on synthetic instances that are not representative of the real world. We overcome these deficiencies by using autoencoder-based approaches. The proposed framework first allows to generate synthetic instances in the latent feature space and learn a latent decision tree classifier. After that, it selects and decodes the synthetic instances respecting local decision rules. Independently from the data type under analysis, such synthetic instances belonging to different classes can unveil the reasons for the classification. Also, depending on the data type, they can be exploited to provide the most useful kind of explanation. Experiments show that the proposed framework advances the state-of-the-art towards a comprehensive and widely usable approach that is able to successfully guarantee various properties besides interpretability.\",\"PeriodicalId\":42055,\"journal\":{\"name\":\"Intelligenza Artificiale\",\"volume\":\"16 1\",\"pages\":\"115-129\"},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2022-07-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Intelligenza Artificiale\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3233/ia-220139\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligenza Artificiale","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3233/ia-220139","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

近年来，准确但模糊的分类模型逐渐兴起，它们隐藏了内部决策过程的逻辑。在本文中，我们提出了一个框架，通过基于规则的模型来局部解释在任何数据类型上工作的任何类型的黑盒分类器。在文献中已经存在能够完成这一任务的局部解释方法。然而，它们受到了一个显著的限制，这意味着将数据表示为二进制向量，并约束局部代理模型在不代表真实世界的合成实例上进行训练。我们通过使用基于自动编码器的方法克服了这些不足。所提出的框架首先允许在潜在特征空间中生成合成实例，并学习潜在决策树分类器。然后，它根据局部决策规则选择并解码合成实例。与所分析的数据类型无关，属于不同类别的此类合成实例可以揭示分类的原因。此外，根据数据类型，可以利用它们来提供最有用的解释。实验表明，所提出的框架将最先进的技术推向了一种全面且广泛可用的方法，该方法能够成功地保证除可解释性之外的各种性质。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Exploiting auto-encoders for explaining black-box classifiers

Recent years have witnessed the rise of accurate but obscure classification models that hide the logic of their internal decision processes. In this paper, we present a framework to locally explain any type of black-box classifiers working on any data type through a rule-based model. In the literature already exists local explanation approaches able to accomplish this task. However, they suffer from a significant limitation that implies representing data as a binary vectors and constraining the local surrogate model to be trained on synthetic instances that are not representative of the real world. We overcome these deficiencies by using autoencoder-based approaches. The proposed framework first allows to generate synthetic instances in the latent feature space and learn a latent decision tree classifier. After that, it selects and decodes the synthetic instances respecting local decision rules. Independently from the data type under analysis, such synthetic instances belonging to different classes can unveil the reasons for the classification. Also, depending on the data type, they can be exploited to provide the most useful kind of explanation. Experiments show that the proposed framework advances the state-of-the-art towards a comprehensive and widely usable approach that is able to successfully guarantee various properties besides interpretability.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Intelligenza Artificiale COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-

CiteScore

3.50

自引率

6.70%

发文量