Xiyu Meng , Tan Tang , Yuhua Zhou , Zihan Yan , Dazhen Deng , Yongheng Wang , Yuhan Wu , Yingcai Wu
{"title":"VIS4SL: A visual analytic approach for interpreting and diagnosing shortcut learning","authors":"Xiyu Meng , Tan Tang , Yuhua Zhou , Zihan Yan , Dazhen Deng , Yongheng Wang , Yuhan Wu , Yingcai Wu","doi":"10.1016/j.knosys.2025.113598","DOIUrl":null,"url":null,"abstract":"<div><div>Shortcut learning, a phenomenon where deep neural networks inadvertently learn irrelevant features, has been extensively discussed due to its impact on model generalization and unexpected failures. Interpreting and diagnosing shortcut learning is challenging due to its diverse manifestations and multiple influencing factors. To assist data scientists in these tasks, we introduce VIS4SL, an interactive visual analytics approach that harnesses both human intelligence and computational power. VIS4SL integrates a perturbation-based method with comprehensive visualizations to facilitate an understandable analysis of learned features. We also present a set of comparative visualizations that allow for the evaluation of model explanations against robust proxies, particularly human explanations, to quantify the degree of shortcut learning and assess model components. Two case studies, involving natural image classification and visualization classification, demonstrate the efficacy of VIS4SL in practical applications. Our findings reveal that the model uses the orientation of bars to differentiate between bar charts and Pareto charts. Furthermore, we explore how interactive visualizations enhance data scientists’ understanding of shortcut learning, enabling the development of more precise deep learning models.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"320 ","pages":"Article 113598"},"PeriodicalIF":7.2000,"publicationDate":"2025-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125006446","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Shortcut learning, a phenomenon where deep neural networks inadvertently learn irrelevant features, has been extensively discussed due to its impact on model generalization and unexpected failures. Interpreting and diagnosing shortcut learning is challenging due to its diverse manifestations and multiple influencing factors. To assist data scientists in these tasks, we introduce VIS4SL, an interactive visual analytics approach that harnesses both human intelligence and computational power. VIS4SL integrates a perturbation-based method with comprehensive visualizations to facilitate an understandable analysis of learned features. We also present a set of comparative visualizations that allow for the evaluation of model explanations against robust proxies, particularly human explanations, to quantify the degree of shortcut learning and assess model components. Two case studies, involving natural image classification and visualization classification, demonstrate the efficacy of VIS4SL in practical applications. Our findings reveal that the model uses the orientation of bars to differentiate between bar charts and Pareto charts. Furthermore, we explore how interactive visualizations enhance data scientists’ understanding of shortcut learning, enabling the development of more precise deep learning models.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.