What to Blame? On the Granularity of Fault Localization for Deep Neural Networks

2021 IEEE 32nd International Symposium on Software Reliability Engineering (ISSRE) Pub Date : 2021-10-01 DOI:10.1109/ISSRE52982.2021.00037

Matias Duran, Xiaoyi Zhang, Paolo Arcaini, F. Ishikawa

{"title":"What to Blame? On the Granularity of Fault Localization for Deep Neural Networks","authors":"Matias Duran, Xiaoyi Zhang, Paolo Arcaini, F. Ishikawa","doi":"10.1109/ISSRE52982.2021.00037","DOIUrl":null,"url":null,"abstract":"Validating Deep Neural Networks (DNNs) used for classification is of paramount importance; an approach for this consists in (i) executing the DNN over the test dataset, (ii) collecting information about classifications, and (iii) applying fault localization (FL) techniques to identify the neurons responsible for the misclassifications. DNNs can have multiple misclassification types, and so neurons responsible for one type could be different from those responsible for another type. However, depending on the granularity of the analyzed dataset, FL may not reveal these differences: failure types more frequent in the dataset may mask less frequent ones. We here propose a way to perform FL for DNNs that avoids this masking effect by selecting test data in a granular way. We conduct an empirical study, using a spectrum-based FL approach for DNNs, to assess how FL results change by changing the granularity of the analyzed test data. Namely, we perform FL by using test data with two different granularities: following a state-of-the-art approach that considers all misclassifications for a given class together, and the proposed fine-grained approach. Results show that FL should be done for each misclassification, such that practitioners have a more detailed analysis of the DNN faults and can make a more informed decision on what to fix in the DNN.","PeriodicalId":162410,"journal":{"name":"2021 IEEE 32nd International Symposium on Software Reliability Engineering (ISSRE)","volume":"65 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 32nd International Symposium on Software Reliability Engineering (ISSRE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISSRE52982.2021.00037","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

Validating Deep Neural Networks (DNNs) used for classification is of paramount importance; an approach for this consists in (i) executing the DNN over the test dataset, (ii) collecting information about classifications, and (iii) applying fault localization (FL) techniques to identify the neurons responsible for the misclassifications. DNNs can have multiple misclassification types, and so neurons responsible for one type could be different from those responsible for another type. However, depending on the granularity of the analyzed dataset, FL may not reveal these differences: failure types more frequent in the dataset may mask less frequent ones. We here propose a way to perform FL for DNNs that avoids this masking effect by selecting test data in a granular way. We conduct an empirical study, using a spectrum-based FL approach for DNNs, to assess how FL results change by changing the granularity of the analyzed test data. Namely, we perform FL by using test data with two different granularities: following a state-of-the-art approach that considers all misclassifications for a given class together, and the proposed fine-grained approach. Results show that FL should be done for each misclassification, such that practitioners have a more detailed analysis of the DNN faults and can make a more informed decision on what to fix in the DNN.

查看原文本刊更多论文

该怪什么?深度神经网络故障定位的粒度研究

验证用于分类的深度神经网络(dnn)是至关重要的;这方面的方法包括(i)在测试数据集上执行DNN， (ii)收集有关分类的信息，以及(iii)应用故障定位(FL)技术来识别导致错误分类的神经元。dnn可以有多种错误分类类型，因此负责一种类型的神经元可能与负责另一种类型的神经元不同。然而，根据所分析数据集的粒度，FL可能不会显示这些差异:数据集中更频繁的故障类型可能会掩盖不太频繁的故障类型。我们在这里提出了一种对dnn执行FL的方法，通过以颗粒方式选择测试数据来避免这种屏蔽效应。我们进行了一项实证研究，使用基于频谱的深度神经网络FL方法，通过改变分析测试数据的粒度来评估FL结果的变化。也就是说，我们通过使用两种不同粒度的测试数据来执行FL:遵循一种最先进的方法，将给定类的所有错误分类考虑在一起，以及建议的细粒度方法。结果表明，应该对每个错误分类进行FL，这样从业者就可以对DNN故障进行更详细的分析，并可以对DNN中的修复内容做出更明智的决定。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE 32nd International Symposium on Software Reliability Engineering (ISSRE)

自引率

0.00%

发文量