Efficient Online Testing for DNN-Enabled Systems using Surrogate-Assisted and Many-Objective Optimization

2022 IEEE/ACM 44th International Conference on Software Engineering (ICSE) Pub Date : 2022-05-01 DOI:10.1145/3510003.3510188

Fitash Ul Haq, Donghwan Shin, L. Briand

{"title":"Efficient Online Testing for DNN-Enabled Systems using Surrogate-Assisted and Many-Objective Optimization","authors":"Fitash Ul Haq, Donghwan Shin, L. Briand","doi":"10.1145/3510003.3510188","DOIUrl":null,"url":null,"abstract":"With the recent advances of Deep Neural Networks (DNNs) in real-world applications, such as Automated Driving Systems (ADS) for self-driving cars, ensuring the reliability and safety of such DNN-enabled Systems emerges as a fundamental topic in software testing. One of the essential testing phases of such DNN-enabled systems is online testing, where the system under test is embedded into a specific and often simulated application environment (e.g., a driving environment) and tested in a closed-loop mode in interaction with the environment. However, despite the importance of online testing for detecting safety violations, automatically generating new and diverse test data that lead to safety violations presents the following challenges: (1) there can be many safety requirements to be considered at the same time, (2) running a high-fidelity simulator is often very computationally-intensive, and (3) the space of all possible test data that may trigger safety violations is too large to be exhaustively explored. In this paper, we address the challenges by proposing a novel approach, called SAMOTA (Surrogate-Assisted Many-Objective Testing Approach), extending existing many-objective search algorithms for test suite generation to efficiently utilize surrogate models that mimic the simulator, but are much less expensive to run. Empirical evaluation results on Pylot, an advanced ADS composed of multiple DNNs, using CARLA, a high-fidelity driving simulator, show that SAMOTA is significantly more effective and efficient at detecting unknown safety requirement violations than state-of-the-art many-objective test suite generation algorithms and random search. In other words, SAMOTA appears to be a key enabler technology for online testing in practice.","PeriodicalId":202896,"journal":{"name":"2022 IEEE/ACM 44th International Conference on Software Engineering (ICSE)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"28","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE/ACM 44th International Conference on Software Engineering (ICSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3510003.3510188","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 28

Abstract

With the recent advances of Deep Neural Networks (DNNs) in real-world applications, such as Automated Driving Systems (ADS) for self-driving cars, ensuring the reliability and safety of such DNN-enabled Systems emerges as a fundamental topic in software testing. One of the essential testing phases of such DNN-enabled systems is online testing, where the system under test is embedded into a specific and often simulated application environment (e.g., a driving environment) and tested in a closed-loop mode in interaction with the environment. However, despite the importance of online testing for detecting safety violations, automatically generating new and diverse test data that lead to safety violations presents the following challenges: (1) there can be many safety requirements to be considered at the same time, (2) running a high-fidelity simulator is often very computationally-intensive, and (3) the space of all possible test data that may trigger safety violations is too large to be exhaustively explored. In this paper, we address the challenges by proposing a novel approach, called SAMOTA (Surrogate-Assisted Many-Objective Testing Approach), extending existing many-objective search algorithms for test suite generation to efficiently utilize surrogate models that mimic the simulator, but are much less expensive to run. Empirical evaluation results on Pylot, an advanced ADS composed of multiple DNNs, using CARLA, a high-fidelity driving simulator, show that SAMOTA is significantly more effective and efficient at detecting unknown safety requirement violations than state-of-the-art many-objective test suite generation algorithms and random search. In other words, SAMOTA appears to be a key enabler technology for online testing in practice.

查看原文本刊更多论文

使用代理辅助和多目标优化的dnn支持系统的高效在线测试

随着深度神经网络(dnn)在实际应用中的最新进展，例如自动驾驶汽车的自动驾驶系统(ADS)，确保这种支持dnn的系统的可靠性和安全性成为软件测试中的一个基本主题。这种支持dnn的系统的基本测试阶段之一是在线测试，其中被测系统被嵌入到特定且通常是模拟的应用环境(例如，驾驶环境)中，并在与环境交互的闭环模式下进行测试。然而，尽管在线测试对于检测安全违规具有重要意义，但自动生成导致安全违规的新的和多样化的测试数据带来了以下挑战:(1)同时需要考虑许多安全要求;(2)运行高保真模拟器通常是非常密集的计算;(3)所有可能引发安全违规的测试数据的空间太大，无法进行详尽的探索。在本文中，我们通过提出一种称为SAMOTA(代理辅助多目标测试方法)的新方法来解决这些挑战，该方法扩展了现有的多目标搜索算法，用于生成测试套件，以有效地利用模拟模拟器的代理模型，但运行成本要低得多。使用高保真驾驶模拟器CARLA对由多个dnn组成的先进ADS Pylot进行的实证评估结果表明，SAMOTA在检测未知安全要求违规方面比最先进的多目标测试套件生成算法和随机搜索算法更加有效和高效。换句话说，SAMOTA在实践中似乎是在线测试的关键使能技术。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE/ACM 44th International Conference on Software Engineering (ICSE)

自引率

0.00%

发文量