Semantic Image Fuzzing of AI Perception Systems

2022 IEEE/ACM 44th International Conference on Software Engineering (ICSE) Pub Date : 2022-05-01 DOI:10.1145/3510003.3510212

Trey Woodlief, Sebastian G. Elbaum, Kevin Sullivan

{"title":"Semantic Image Fuzzing of AI Perception Systems","authors":"Trey Woodlief, Sebastian G. Elbaum, Kevin Sullivan","doi":"10.1145/3510003.3510212","DOIUrl":null,"url":null,"abstract":"Perception systems enable autonomous systems to interpret raw sensor readings of the physical world. Testing of perception systems aims to reveal misinterpretations that could cause system failures. Current testing methods, however, are inadequate. The cost of human interpretation and annotation of real-world input data is high, so manual test suites tend to be small. The simulation-reality gap reduces the validity of test results based on simulated worlds. And methods for synthesizing test inputs do not provide corresponding expected interpretations. To address these limitations, we developed semSensFuzz, a new approach to fuzz testing of perception systems based on semantic mutation of test cases that pair realworld sensor readings with their ground-truth interpretations. We implemented our approach to assess its feasibility and potential to improve software testing for perception systems. We used it to generate 150,000 semantically mutated image inputs for five state-of-the-art perception systems. We found that it synthesized tests with novel and subjectively realistic image inputs, and that it discovered inputs that revealed significant inconsistencies between the specified and computed interpretations. We also found that it produced such test cases at a cost that was very low compared to that of manual semantic annotation of real-world images.","PeriodicalId":202896,"journal":{"name":"2022 IEEE/ACM 44th International Conference on Software Engineering (ICSE)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE/ACM 44th International Conference on Software Engineering (ICSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3510003.3510212","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 8

Abstract

Perception systems enable autonomous systems to interpret raw sensor readings of the physical world. Testing of perception systems aims to reveal misinterpretations that could cause system failures. Current testing methods, however, are inadequate. The cost of human interpretation and annotation of real-world input data is high, so manual test suites tend to be small. The simulation-reality gap reduces the validity of test results based on simulated worlds. And methods for synthesizing test inputs do not provide corresponding expected interpretations. To address these limitations, we developed semSensFuzz, a new approach to fuzz testing of perception systems based on semantic mutation of test cases that pair realworld sensor readings with their ground-truth interpretations. We implemented our approach to assess its feasibility and potential to improve software testing for perception systems. We used it to generate 150,000 semantically mutated image inputs for five state-of-the-art perception systems. We found that it synthesized tests with novel and subjectively realistic image inputs, and that it discovered inputs that revealed significant inconsistencies between the specified and computed interpretations. We also found that it produced such test cases at a cost that was very low compared to that of manual semantic annotation of real-world images.

查看原文本刊更多论文

人工智能感知系统的语义图像模糊

感知系统使自主系统能够解释物理世界的原始传感器读数。感知系统的测试旨在揭示可能导致系统故障的误解。然而，目前的测试方法是不够的。人工解释和注释实际输入数据的成本很高，因此手动测试套件往往很小。仿真与现实的差距降低了基于仿真世界的测试结果的有效性。综合测试输入的方法不能提供相应的预期解释。为了解决这些限制，我们开发了semSensFuzz，这是一种基于测试用例的语义突变对感知系统进行模糊测试的新方法，该测试用例将真实世界的传感器读数与其基础真值解释配对。我们实施了我们的方法来评估其可行性和潜力，以改善感知系统的软件测试。我们用它为五个最先进的感知系统生成了150,000个语义突变的图像输入。我们发现它用新颖和主观逼真的图像输入合成了测试，并且它发现输入显示了指定和计算解释之间的显着不一致。我们还发现，与对真实世界的图像进行手动语义注释相比，它产生这样的测试用例的成本非常低。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE/ACM 44th International Conference on Software Engineering (ICSE)

自引率

0.00%

发文量