Assessing and Improving the Mutation Testing Practice of PIT

2017 IEEE International Conference on Software Testing, Verification and Validation (ICST) Pub Date : 2016-01-11 DOI:10.1109/ICST.2017.47

Thomas Laurent, Anthony Ventresque, Mike Papadakis, Christopher Henard, Yves Le Traon

{"title":"Assessing and Improving the Mutation Testing Practice of PIT","authors":"Thomas Laurent, Anthony Ventresque, Mike Papadakis, Christopher Henard, Yves Le Traon","doi":"10.1109/ICST.2017.47","DOIUrl":null,"url":null,"abstract":"Mutation testing is extensively used in software testing studies. However, popular mutation testing tools use a restrictive set of mutants which does not conform to the community standards and mutation testing literature. This can be problematic since the effectiveness of mutation strongly depends on the used mutants. To investigate this issue we form an extended set of mutants and implement it on a popular mutation testing tool named PIT. We then show that in real-world projects the original mutants of PIT are easier to kill and lead to tests that score statistically lower than those of the extended set of mutants for a range of 35% to 70% of the studied classes. These results raise serious concerns regarding the validity of mutation-based experiments that use PIT. To further show the strengths of the extended mutants we also performed an analysis using a benchmark with mutation-adequate test cases and identified equivalent mutants. Our results confirmed that the extended mutants are more effective than a) the original version of PIT and b) two other popular mutation testing tools (major and muJava). In particular, our results demonstrate that the extended mutants are more effective by 23%, 12% and 7% than the mutants of the original PIT, major and muJava. They also show that the extended mutants are at least as strong as the mutants of all the other three tools together. To support future research, we make the new version of PIT, which is equipped with the extended mutants, publicly available.","PeriodicalId":112258,"journal":{"name":"2017 IEEE International Conference on Software Testing, Verification and Validation (ICST)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"53","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE International Conference on Software Testing, Verification and Validation (ICST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICST.2017.47","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 53

Abstract

Mutation testing is extensively used in software testing studies. However, popular mutation testing tools use a restrictive set of mutants which does not conform to the community standards and mutation testing literature. This can be problematic since the effectiveness of mutation strongly depends on the used mutants. To investigate this issue we form an extended set of mutants and implement it on a popular mutation testing tool named PIT. We then show that in real-world projects the original mutants of PIT are easier to kill and lead to tests that score statistically lower than those of the extended set of mutants for a range of 35% to 70% of the studied classes. These results raise serious concerns regarding the validity of mutation-based experiments that use PIT. To further show the strengths of the extended mutants we also performed an analysis using a benchmark with mutation-adequate test cases and identified equivalent mutants. Our results confirmed that the extended mutants are more effective than a) the original version of PIT and b) two other popular mutation testing tools (major and muJava). In particular, our results demonstrate that the extended mutants are more effective by 23%, 12% and 7% than the mutants of the original PIT, major and muJava. They also show that the extended mutants are at least as strong as the mutants of all the other three tools together. To support future research, we make the new version of PIT, which is equipped with the extended mutants, publicly available.

查看原文本刊更多论文

PIT突变检测实践的评估与改进

突变测试在软件测试研究中被广泛使用。然而，流行的突变检测工具使用的是一组限制性的突变体，不符合社区标准和突变检测文献。这可能是有问题的，因为突变的有效性在很大程度上取决于使用的突变体。为了研究这个问题，我们形成了一个扩展的突变集，并在一个名为PIT的流行突变测试工具上实现它。然后，我们表明，在现实世界的项目中，PIT的原始突变体更容易被杀死，并且在35%至70%的研究类别中导致测试得分低于扩展突变集的测试得分。这些结果引起了对使用PIT的基于突变的实验有效性的严重关注。为了进一步显示扩展突变的优势，我们还使用具有足够突变的测试用例的基准执行了分析，并确定了等效突变。我们的结果证实，扩展突变比a)原始版本的PIT和b)其他两种流行的突变测试工具(major和muJava)更有效。特别是，我们的研究结果表明，扩展突变体比原始PIT, major和muJava突变体的有效性分别提高了23%，12%和7%。他们还表明，扩展突变体的强度至少与其他三种工具的突变体加在一起一样强。为了支持未来的研究，我们公开了配备扩展突变体的新版本PIT。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 IEEE International Conference on Software Testing, Verification and Validation (ICST)

自引率

0.00%

发文量