VIS4SL: A visual analytic approach for interpreting and diagnosing shortcut learning

IF 7.2 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Knowledge-Based Systems Pub Date : 2025-05-06 DOI:10.1016/j.knosys.2025.113598

Xiyu Meng , Tan Tang , Yuhua Zhou , Zihan Yan , Dazhen Deng , Yongheng Wang , Yuhan Wu , Yingcai Wu

{"title":"VIS4SL: A visual analytic approach for interpreting and diagnosing shortcut learning","authors":"Xiyu Meng , Tan Tang , Yuhua Zhou , Zihan Yan , Dazhen Deng , Yongheng Wang , Yuhan Wu , Yingcai Wu","doi":"10.1016/j.knosys.2025.113598","DOIUrl":null,"url":null,"abstract":"<div><div>Shortcut learning, a phenomenon where deep neural networks inadvertently learn irrelevant features, has been extensively discussed due to its impact on model generalization and unexpected failures. Interpreting and diagnosing shortcut learning is challenging due to its diverse manifestations and multiple influencing factors. To assist data scientists in these tasks, we introduce VIS4SL, an interactive visual analytics approach that harnesses both human intelligence and computational power. VIS4SL integrates a perturbation-based method with comprehensive visualizations to facilitate an understandable analysis of learned features. We also present a set of comparative visualizations that allow for the evaluation of model explanations against robust proxies, particularly human explanations, to quantify the degree of shortcut learning and assess model components. Two case studies, involving natural image classification and visualization classification, demonstrate the efficacy of VIS4SL in practical applications. Our findings reveal that the model uses the orientation of bars to differentiate between bar charts and Pareto charts. Furthermore, we explore how interactive visualizations enhance data scientists’ understanding of shortcut learning, enabling the development of more precise deep learning models.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"320 ","pages":"Article 113598"},"PeriodicalIF":7.2000,"publicationDate":"2025-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125006446","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Shortcut learning, a phenomenon where deep neural networks inadvertently learn irrelevant features, has been extensively discussed due to its impact on model generalization and unexpected failures. Interpreting and diagnosing shortcut learning is challenging due to its diverse manifestations and multiple influencing factors. To assist data scientists in these tasks, we introduce VIS4SL, an interactive visual analytics approach that harnesses both human intelligence and computational power. VIS4SL integrates a perturbation-based method with comprehensive visualizations to facilitate an understandable analysis of learned features. We also present a set of comparative visualizations that allow for the evaluation of model explanations against robust proxies, particularly human explanations, to quantify the degree of shortcut learning and assess model components. Two case studies, involving natural image classification and visualization classification, demonstrate the efficacy of VIS4SL in practical applications. Our findings reveal that the model uses the orientation of bars to differentiate between bar charts and Pareto charts. Furthermore, we explore how interactive visualizations enhance data scientists’ understanding of shortcut learning, enabling the development of more precise deep learning models.

查看原文本刊更多论文

VIS4SL：一种用于解释和诊断快捷学习的可视化分析方法

快速学习是一种深度神经网络无意中学习不相关特征的现象，由于其对模型泛化和意外故障的影响而被广泛讨论。捷径学习的表现形式多样，影响因素多，对其进行解释和诊断具有一定的挑战性。为了帮助数据科学家完成这些任务，我们介绍了VIS4SL，一种利用人类智能和计算能力的交互式可视化分析方法。VIS4SL将基于微扰的方法与全面的可视化相结合，以促进对学习特征的可理解分析。我们还提出了一组比较可视化，允许根据稳健代理评估模型解释，特别是人类解释，以量化快捷学习的程度并评估模型组件。两个涉及自然图像分类和可视化分类的案例研究证明了VIS4SL在实际应用中的有效性。我们的研究结果表明，该模型使用柱状图的方向来区分柱状图和帕累托图。此外，我们还探讨了交互式可视化如何增强数据科学家对快捷学习的理解，从而开发更精确的深度学习模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Knowledge-Based Systems 工程技术-计算机：人工智能

CiteScore

14.80

自引率

12.50%

发文量

1245

审稿时长

7.8 months

期刊介绍： Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.