Interpretable Machine Learning for Self-Service High-Risk Decision-Making

2022 26th International Conference Information Visualisation (IV) Pub Date : 2022-05-09 DOI:10.1109/IV56949.2022.00061

Charles Recaido, B. Kovalerchuk

{"title":"Interpretable Machine Learning for Self-Service High-Risk Decision-Making","authors":"Charles Recaido, B. Kovalerchuk","doi":"10.1109/IV56949.2022.00061","DOIUrl":null,"url":null,"abstract":"This paper contributes to interpretable machine learning via visual knowledge discovery in general line coordinates (GLC). The concepts of hyperblocks as interpretable dataset units and general line coordinates are combined to create a visual self-service machine learning model. Dynamic Scaffolding Coordinates as lossless multidimensional coordinate systems are proposed, and their applications as visual models is shown. DSC1 and DSC2 can map multiple dataset attributes to a single two-dimensional (X, Y) Cartesian plane using a graph construction algorithm. The hyperblock analysis was used to determine visually appealing dataset attribute orders and to reduce line occlusion. It is shown that hyperblocks can generalize decision tree rules and a series of DSC1 or DSC2 plots can visualize a decision tree. The DSC1 and DSC2 plots were tested on benchmark datasets from the UCI ML repository. They allowed for visual classification of data. Additionally, areas of hyperblock impurity were discovered and used to establish dataset splits that highlight the upper estimate of worst-case model accuracy to guide model selection for high-risk decision-making. Major benefits of DSC1 and DSC2 is their highly interpretable nature. They allow domain experts to control or establish new machine learning models through visual pattern discovery.","PeriodicalId":153161,"journal":{"name":"2022 26th International Conference Information Visualisation (IV)","volume":"101 9","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 26th International Conference Information Visualisation (IV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IV56949.2022.00061","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

This paper contributes to interpretable machine learning via visual knowledge discovery in general line coordinates (GLC). The concepts of hyperblocks as interpretable dataset units and general line coordinates are combined to create a visual self-service machine learning model. Dynamic Scaffolding Coordinates as lossless multidimensional coordinate systems are proposed, and their applications as visual models is shown. DSC1 and DSC2 can map multiple dataset attributes to a single two-dimensional (X, Y) Cartesian plane using a graph construction algorithm. The hyperblock analysis was used to determine visually appealing dataset attribute orders and to reduce line occlusion. It is shown that hyperblocks can generalize decision tree rules and a series of DSC1 or DSC2 plots can visualize a decision tree. The DSC1 and DSC2 plots were tested on benchmark datasets from the UCI ML repository. They allowed for visual classification of data. Additionally, areas of hyperblock impurity were discovered and used to establish dataset splits that highlight the upper estimate of worst-case model accuracy to guide model selection for high-risk decision-making. Major benefits of DSC1 and DSC2 is their highly interpretable nature. They allow domain experts to control or establish new machine learning models through visual pattern discovery.

查看原文本刊更多论文

自助式高风险决策的可解释机器学习

本文通过在一般直线坐标(GLC)中的视觉知识发现为可解释的机器学习做出了贡献。超块作为可解释数据集单元的概念和一般的直线坐标相结合，创建了一个可视化的自助机器学习模型。提出了动态脚手架坐标系作为无损多维坐标系，并给出了动态脚手架坐标系作为可视化模型的应用。DSC1和DSC2可以使用图构建算法将多个数据集属性映射到单个二维(X, Y)笛卡尔平面上。使用超块分析来确定视觉上吸引人的数据集属性顺序并减少线遮挡。结果表明，超块可以泛化决策树规则，一系列DSC1或DSC2图可以可视化决策树。DSC1和DSC2图在来自UCI ML存储库的基准数据集上进行测试。它们允许对数据进行可视化分类。此外，超块杂质区域被发现并用于建立数据集分割，突出最坏情况模型精度的上限估计，以指导高风险决策的模型选择。DSC1和DSC2的主要优点是它们的高度可解释性。它们允许领域专家通过视觉模式发现来控制或建立新的机器学习模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 26th International Conference Information Visualisation (IV)

自引率

0.00%

发文量