A Systematic Evaluation of Problematic Tests Generated by EvoSuite

2019 IEEE/ACM 41st International Conference on Software Engineering: Companion Proceedings (ICSE-Companion) Pub Date : 2019-05-25 DOI:10.1109/ICSE-Companion.2019.00068

Zhiyu Fan

引用次数: 4

Abstract

With the rapidly growing scale of modern software, the reliability of software systems has become essential. To ease the developers' pressure of writing unit tests manually, test generation tools such as EvoSuite and Randoop were proposed. Although these approaches have been shown to be able to automatically generate tests for achieving high coverage, the generated tests may be ineffective in detecting real faults. Particularly, these automatically generated tests may suffer from several problems (we call them problematic tests): (1) incorrect oracle. (2) unexpected exception/error. (3) flaky test. We present a comprehensive study of EvoSuite in Defects4j, and performed a detailed analysis of the reasons behind these automatically generated problematic tests. Our analysis identifies 528 problematic tests: 208 (39.4%) of them are caused by incorrect oracle, 319 (60.4%) are caused by unexpected exception/error, and one flaky test.

查看原文本刊更多论文

对EvoSuite生成的问题测试进行系统评估

随着现代软件规模的迅速增长，软件系统的可靠性变得至关重要。为了减轻开发人员手工编写单元测试的压力，开发人员提出了EvoSuite和Randoop等测试生成工具。尽管这些方法已经被证明能够自动生成测试以获得高覆盖率，但是生成的测试在检测真正的故障时可能是无效的。特别是，这些自动生成的测试可能会遇到几个问题(我们称之为有问题的测试):(1)不正确的oracle。(2)意外异常/错误。(3)片状试验。我们在缺陷4j中对EvoSuite进行了全面的研究，并对这些自动生成的问题测试背后的原因进行了详细的分析。我们的分析确定了528个有问题的测试:其中208个(39.4%)是由不正确的oracle引起的，319个(60.4%)是由意外异常/错误引起的，还有一个零散的测试。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 IEEE/ACM 41st International Conference on Software Engineering: Companion Proceedings (ICSE-Companion)

自引率

0.00%

发文量