Self-Checking Deep Neural Networks in Deployment

2021 IEEE/ACM 43rd International Conference on Software Engineering (ICSE) Pub Date : 2021-03-03 DOI:10.1109/ICSE43902.2021.00044

Yan Xiao, Ivan Beschastnikh, David S. Rosenblum, Changsheng Sun, Sebastian G. Elbaum, Yun Lin, J. Dong

{"title":"Self-Checking Deep Neural Networks in Deployment","authors":"Yan Xiao, Ivan Beschastnikh, David S. Rosenblum, Changsheng Sun, Sebastian G. Elbaum, Yun Lin, J. Dong","doi":"10.1109/ICSE43902.2021.00044","DOIUrl":null,"url":null,"abstract":"The widespread adoption of Deep Neural Networks (DNNs) in important domains raises questions about the trustworthiness of DNN outputs. Even a highly accurate DNN will make mistakes some of the time, and in settings like self-driving vehicles these mistakes must be quickly detected and properly dealt with in deployment. Just as our community has developed effective techniques and mechanisms to monitor and check programmed components, we believe it is now necessary to do the same for DNNs. In this paper we present DNN self-checking as a process by which internal DNN layer features are used to check DNN predictions. We detail SelfChecker, a self-checking system that monitors DNN outputs and triggers an alarm if the internal layer features of the model are inconsistent with the final prediction. SelfChecker also provides advice in the form of an alternative prediction. We evaluated SelfChecker on four popular image datasets and three DNN models and found that SelfChecker triggers correct alarms on 60.56% of wrong DNN predictions, and false alarms on 2.04% of correct DNN predictions. This is a substantial improvement over prior work (SelfOracle, Dissector, and ConfidNet). In experiments with self-driving car scenarios, SelfChecker triggers more correct alarms than SelfOracle for two DNN models (DAVE-2 and Chauffeur) with comparable false alarms. Our implementation is available as open source.","PeriodicalId":305167,"journal":{"name":"2021 IEEE/ACM 43rd International Conference on Software Engineering (ICSE)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"25","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE/ACM 43rd International Conference on Software Engineering (ICSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSE43902.2021.00044","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 25

Abstract

The widespread adoption of Deep Neural Networks (DNNs) in important domains raises questions about the trustworthiness of DNN outputs. Even a highly accurate DNN will make mistakes some of the time, and in settings like self-driving vehicles these mistakes must be quickly detected and properly dealt with in deployment. Just as our community has developed effective techniques and mechanisms to monitor and check programmed components, we believe it is now necessary to do the same for DNNs. In this paper we present DNN self-checking as a process by which internal DNN layer features are used to check DNN predictions. We detail SelfChecker, a self-checking system that monitors DNN outputs and triggers an alarm if the internal layer features of the model are inconsistent with the final prediction. SelfChecker also provides advice in the form of an alternative prediction. We evaluated SelfChecker on four popular image datasets and three DNN models and found that SelfChecker triggers correct alarms on 60.56% of wrong DNN predictions, and false alarms on 2.04% of correct DNN predictions. This is a substantial improvement over prior work (SelfOracle, Dissector, and ConfidNet). In experiments with self-driving car scenarios, SelfChecker triggers more correct alarms than SelfOracle for two DNN models (DAVE-2 and Chauffeur) with comparable false alarms. Our implementation is available as open source.

查看原文本刊更多论文

部署中的自检深度神经网络

深度神经网络(DNN)在重要领域的广泛应用引发了对DNN输出可信度的质疑。即使是高度精确的深度神经网络有时也会犯错误，在自动驾驶汽车等环境中，这些错误必须迅速被发现，并在部署过程中得到妥善处理。正如我们的社区已经开发出有效的技术和机制来监控和检查编程组件，我们认为现在有必要对dnn做同样的事情。在本文中，我们将DNN自检查作为一个过程，通过该过程使用内部DNN层特征来检查DNN预测。我们详细介绍了SelfChecker，这是一个自检系统，可以监控DNN输出，并在模型的内层特征与最终预测不一致时触发警报。SelfChecker还以替代预测的形式提供建议。我们在四种流行的图像数据集和三种DNN模型上对SelfChecker进行了评估，发现SelfChecker对60.56%的错误DNN预测触发正确警报，对2.04%的正确DNN预测触发错误警报。这比之前的工作(SelfOracle、Dissector和ConfidNet)有了实质性的改进。在自动驾驶汽车场景的实验中，对于两种DNN模型(DAVE-2和Chauffeur)， SelfChecker触发的正确警报比SelfOracle触发的错误警报更多。我们的实现是开源的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE/ACM 43rd International Conference on Software Engineering (ICSE)

自引率

0.00%

发文量