Anna N. Gummadi , Osvaldo Arreche , Mustafa Abdallah
{"title":"A systematic evaluation of white-box explainable AI methods for anomaly detection in IoT systems","authors":"Anna N. Gummadi , Osvaldo Arreche , Mustafa Abdallah","doi":"10.1016/j.iot.2025.101505","DOIUrl":null,"url":null,"abstract":"<div><div>The rapid evolution of Internet of Things (IoT) systems has created new avenues for studying how artificial intelligence (AI) might be used to improve anomaly detection in these systems. Using AI for finding and predicting abnormalities or malfunctions in IoT devices with precision is crucial to highlight the salient characteristics of these anomalies to save maintenance expenses and downtime. Pairing these AI-based anomaly detection methods with explanations will make them applicable in real-world IoT systems. However, the systematic evaluation of various explainable AI (XAI) strategies used for anomaly detection in IoT systems is still lacking. To help address this gap, we propose an evaluation framework for different white-box XAI techniques in which we consider six evaluation metrics that are essential for evaluating XAI methods and have mappings to AI and security domains. In particular, we use these metrics to evaluate Integrated Gradients (IG), Layer-wise Relevance Propagation (LRP), and Deep Learning - SHAP (Deep-SHAP), through a single deep neural network (DNN) model. These metrics are descriptive accuracy, sparsity, stability, efficiency, robustness, and completeness. We used these six metrics to evaluate the three aforementioned white-box XAI methods considered in our work. Our evaluation framework is tested on two different IoT datasets: the N-BaIoT dataset, which focuses on detecting IoT botnet attacks on commercial IoT devices infected by the Mirai and Gafgyt botnets, and the MEMS dataset, which contains sensor data from manufacturing processes aimed at detecting abnormal readings. In the context of IoT systems, we illustrate the advantages and disadvantages of the state-of-the-art white-box XAI techniques through this study. Our evaluation framework’s source code is made accessible to the general public, providing a common benchmark for upcoming studies that explore applying explainable AI for enhancing anomaly detection in the emerging IoT field.</div></div>","PeriodicalId":29968,"journal":{"name":"Internet of Things","volume":"30 ","pages":"Article 101505"},"PeriodicalIF":6.0000,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Internet of Things","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2542660525000186","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The rapid evolution of Internet of Things (IoT) systems has created new avenues for studying how artificial intelligence (AI) might be used to improve anomaly detection in these systems. Using AI for finding and predicting abnormalities or malfunctions in IoT devices with precision is crucial to highlight the salient characteristics of these anomalies to save maintenance expenses and downtime. Pairing these AI-based anomaly detection methods with explanations will make them applicable in real-world IoT systems. However, the systematic evaluation of various explainable AI (XAI) strategies used for anomaly detection in IoT systems is still lacking. To help address this gap, we propose an evaluation framework for different white-box XAI techniques in which we consider six evaluation metrics that are essential for evaluating XAI methods and have mappings to AI and security domains. In particular, we use these metrics to evaluate Integrated Gradients (IG), Layer-wise Relevance Propagation (LRP), and Deep Learning - SHAP (Deep-SHAP), through a single deep neural network (DNN) model. These metrics are descriptive accuracy, sparsity, stability, efficiency, robustness, and completeness. We used these six metrics to evaluate the three aforementioned white-box XAI methods considered in our work. Our evaluation framework is tested on two different IoT datasets: the N-BaIoT dataset, which focuses on detecting IoT botnet attacks on commercial IoT devices infected by the Mirai and Gafgyt botnets, and the MEMS dataset, which contains sensor data from manufacturing processes aimed at detecting abnormal readings. In the context of IoT systems, we illustrate the advantages and disadvantages of the state-of-the-art white-box XAI techniques through this study. Our evaluation framework’s source code is made accessible to the general public, providing a common benchmark for upcoming studies that explore applying explainable AI for enhancing anomaly detection in the emerging IoT field.
期刊介绍:
Internet of Things; Engineering Cyber Physical Human Systems is a comprehensive journal encouraging cross collaboration between researchers, engineers and practitioners in the field of IoT & Cyber Physical Human Systems. The journal offers a unique platform to exchange scientific information on the entire breadth of technology, science, and societal applications of the IoT.
The journal will place a high priority on timely publication, and provide a home for high quality.
Furthermore, IOT is interested in publishing topical Special Issues on any aspect of IOT.