Repairing Order-Dependent Flaky Tests via Test Generation

2022 IEEE/ACM 44th International Conference on Software Engineering (ICSE) Pub Date : 2022-05-01 DOI:10.1145/3510003.3510173

Chengpeng Li, Chenguang Zhu, Wenxi Wang, A. Shi

{"title":"Repairing Order-Dependent Flaky Tests via Test Generation","authors":"Chengpeng Li, Chenguang Zhu, Wenxi Wang, A. Shi","doi":"10.1145/3510003.3510173","DOIUrl":null,"url":null,"abstract":"Flaky tests are tests that pass or fail nondeterministically on the same version of code. These tests can mislead developers concerning the quality of their code changes during regression testing. A common kind of flaky tests are order-dependent tests, whose pass/ fail outcomes depend on the test order in which they are run. Such tests have different outcomes because other tests running before them pollute shared state. Prior work has proposed repairing order-dependent tests by searching for existing tests, known as “cleaners”, that reset the shared state, allowing the order-dependent test to pass when run after a polluted shared state. The code within a cleaner represents a patch to repair the order-dependent test. However, this technique requires cleaners to already exist in the test suite. We propose ODRepair, an automated technique to repair order-dependent tests even without existing cleaners. The idea is to first determine the exact polluted shared state that results in the order-dependent test to fail and then generate code that can modify and reset the shared state so that the order-dependent test can pass. We focus on shared state through internal heap memory, in particular shared state reachable from static fields. Once we know which static field leads to the pollution, we search for reset-methods in the code-base that can potentially access and modify state reachable from that static field. We then apply an automatic test-generation tool to generate method-call sequences, targeting these reset-methods. Our evaluation on 327 order-dependent tests from a publicly available dataset shows that ODRepair automatically identifies the polluted static field for 181 tests, and it can generate patches for 141 of these tests. Compared against state-of-the-art iFixFlakies, ODRepair can generate patches for 24 tests that iFixFlakies cannot.","PeriodicalId":202896,"journal":{"name":"2022 IEEE/ACM 44th International Conference on Software Engineering (ICSE)","volume":"101 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE/ACM 44th International Conference on Software Engineering (ICSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3510003.3510173","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 10

Abstract

Flaky tests are tests that pass or fail nondeterministically on the same version of code. These tests can mislead developers concerning the quality of their code changes during regression testing. A common kind of flaky tests are order-dependent tests, whose pass/ fail outcomes depend on the test order in which they are run. Such tests have different outcomes because other tests running before them pollute shared state. Prior work has proposed repairing order-dependent tests by searching for existing tests, known as “cleaners”, that reset the shared state, allowing the order-dependent test to pass when run after a polluted shared state. The code within a cleaner represents a patch to repair the order-dependent test. However, this technique requires cleaners to already exist in the test suite. We propose ODRepair, an automated technique to repair order-dependent tests even without existing cleaners. The idea is to first determine the exact polluted shared state that results in the order-dependent test to fail and then generate code that can modify and reset the shared state so that the order-dependent test can pass. We focus on shared state through internal heap memory, in particular shared state reachable from static fields. Once we know which static field leads to the pollution, we search for reset-methods in the code-base that can potentially access and modify state reachable from that static field. We then apply an automatic test-generation tool to generate method-call sequences, targeting these reset-methods. Our evaluation on 327 order-dependent tests from a publicly available dataset shows that ODRepair automatically identifies the polluted static field for 181 tests, and it can generate patches for 141 of these tests. Compared against state-of-the-art iFixFlakies, ODRepair can generate patches for 24 tests that iFixFlakies cannot.

查看原文本刊更多论文

通过测试生成修复顺序相关的片状测试

不稳定的测试是在同一版本的代码上不确定地通过或失败的测试。在回归测试期间，这些测试可能会误导开发人员对代码更改的质量的认识。一种常见的片状测试是顺序相关的测试，其通过/失败结果取决于它们运行的测试顺序。这些测试有不同的结果，因为在它们之前运行的其他测试会污染共享状态。先前的工作建议通过搜索重置共享状态的现有测试(称为“清除器”)来修复依赖顺序的测试，从而允许依赖顺序的测试在受污染的共享状态之后运行时通过。清理程序中的代码表示修复与顺序相关的测试的补丁。然而，这种技术要求测试套件中已经存在清理器。我们提出ODRepair，这是一种自动化技术，即使没有现有的清洁器，也可以修复依赖于顺序的测试。其思想是首先确定导致依赖顺序的测试失败的受污染的共享状态，然后生成可以修改和重置共享状态的代码，以便依赖顺序的测试可以通过。我们关注通过内部堆内存的共享状态，特别是从静态字段可访问的共享状态。一旦我们知道哪个静态字段导致了污染，我们就在代码库中搜索可能访问和修改可从该静态字段访问的状态的重置方法。然后我们应用一个自动测试生成工具来生成方法调用序列，以这些重置方法为目标。我们对来自公开可用数据集的327个顺序相关测试的评估表明，ODRepair为181个测试自动识别受污染的静态场，并为其中141个测试生成补丁。与最先进的iFixFlakies相比，ODRepair可以为iFixFlakies无法生成的24个测试生成补丁。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE/ACM 44th International Conference on Software Engineering (ICSE)

自引率

0.00%

发文量