{"title":"Fast Estimation of Shapley Value by Stratified Sampling and Its Application in Explaining Fault Diagnosis Neural Network","authors":"Biao He;Yongfang Mao;Yi Qin","doi":"10.1109/JIOT.2024.3520804","DOIUrl":null,"url":null,"abstract":"There are two problems when the Shapley value is employed to interpret deep neural networks. The first issue is that the computational complexity increases exponentially with the number of players. The other issue is that the contribution evaluation index cannot effectively reflect the nonlinearity of the classification function (i.e., SoftMax), which is often neglected in previous studies. To address these challenges, a method for fast estimating the Shapley value based on the stratified sampling and the Mann–Whitney test (SSMW-Shap) is proposed in this work. In SSMW-Shap, a new contribution index is designed to accurately measure the contribution of each player by leveraging the distance between the outputs of two specific neurons, accounting for the nonlinearity of SoftMax and the efficiency of the Shapley value. Based on the proposed index, a simplified two-player coalition evaluation method is built to select important affiliates for each player, significantly reducing the computational complexity of the Shapley value. Then, the Shapley value is fast estimated by combining the stratified sampling and the Mann–Whitney test. In this process, the Mann–Whitney test is employed to estimate the difference between the samples and the population, and sample expansion is executed for the failed test, improving the estimation accuracy. Finally, a simple but reasonable method based on the proposed index is designed to quantitatively evaluate the explanation accuracy of each method. The proposed method is verified using two classic classification networks trained on two bearing datasets.","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"12 9","pages":"12410-12418"},"PeriodicalIF":8.9000,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet of Things Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10810348/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
There are two problems when the Shapley value is employed to interpret deep neural networks. The first issue is that the computational complexity increases exponentially with the number of players. The other issue is that the contribution evaluation index cannot effectively reflect the nonlinearity of the classification function (i.e., SoftMax), which is often neglected in previous studies. To address these challenges, a method for fast estimating the Shapley value based on the stratified sampling and the Mann–Whitney test (SSMW-Shap) is proposed in this work. In SSMW-Shap, a new contribution index is designed to accurately measure the contribution of each player by leveraging the distance between the outputs of two specific neurons, accounting for the nonlinearity of SoftMax and the efficiency of the Shapley value. Based on the proposed index, a simplified two-player coalition evaluation method is built to select important affiliates for each player, significantly reducing the computational complexity of the Shapley value. Then, the Shapley value is fast estimated by combining the stratified sampling and the Mann–Whitney test. In this process, the Mann–Whitney test is employed to estimate the difference between the samples and the population, and sample expansion is executed for the failed test, improving the estimation accuracy. Finally, a simple but reasonable method based on the proposed index is designed to quantitatively evaluate the explanation accuracy of each method. The proposed method is verified using two classic classification networks trained on two bearing datasets.
期刊介绍:
The EEE Internet of Things (IoT) Journal publishes articles and review articles covering various aspects of IoT, including IoT system architecture, IoT enabling technologies, IoT communication and networking protocols such as network coding, and IoT services and applications. Topics encompass IoT's impacts on sensor technologies, big data management, and future internet design for applications like smart cities and smart homes. Fields of interest include IoT architecture such as things-centric, data-centric, service-oriented IoT architecture; IoT enabling technologies and systematic integration such as sensor technologies, big sensor data management, and future Internet design for IoT; IoT services, applications, and test-beds such as IoT service middleware, IoT application programming interface (API), IoT application design, and IoT trials/experiments; IoT standardization activities and technology development in different standard development organizations (SDO) such as IEEE, IETF, ITU, 3GPP, ETSI, etc.