An Extensive Study on Cross-Project Predictive Mutation Testing

2019 12th IEEE Conference on Software Testing, Validation and Verification (ICST) Pub Date : 2019-04-01 DOI:10.1109/ICST.2019.00025

Dongyu Mao, Lingchao Chen, Lingming Zhang

{"title":"An Extensive Study on Cross-Project Predictive Mutation Testing","authors":"Dongyu Mao, Lingchao Chen, Lingming Zhang","doi":"10.1109/ICST.2019.00025","DOIUrl":null,"url":null,"abstract":"Mutation testing is a powerful technique for evaluating the quality of test suite which plays a key role in ensuring software quality. The concept of mutation testing has also been widely used in other software engineering studies, e.g., test generation, fault localization, and program repair. During the process of mutation testing, large number of mutants may be generated and then executed against the test suite to examine whether they can be killed, making the process extremely computational expensive. Several techniques have been proposed to speed up this process, including selective, weakened, and predictive mutation testing. Among those techniques, Predictive Mutation Testing (PMT) tries to build a classification model based on an amount of mutant execution records to predict whether coming new mutants would be killed or alive without mutant execution, and can achieve significant mutation cost reduction. In PMT, each mutant is represented as a list of features related to the mutant itself and the test suite, transforming the mutation testing problem to a binary classification problem. In this paper, we perform an extensive study on the effectiveness and efficiency of the promising PMT technique under the cross-project setting using a total 654 real world projects with more than 4 Million mutants. Our work also complements the original PMT work by considering more features and the powerful deep learning models. The experimental results show an average of over 0.85 prediction accuracy on 654 projects using cross validation, demonstrating the effectiveness of PMT. Meanwhile, a clear speed up is also observed with an average of 28.7X compared to traditional mutation testing with 5 threads. In addition, we analyze the importance of different groups of features in classification model, which provides important implications for the future research.","PeriodicalId":446827,"journal":{"name":"2019 12th IEEE Conference on Software Testing, Validation and Verification (ICST)","volume":"70 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"31","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 12th IEEE Conference on Software Testing, Validation and Verification (ICST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICST.2019.00025","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 31

Abstract

Mutation testing is a powerful technique for evaluating the quality of test suite which plays a key role in ensuring software quality. The concept of mutation testing has also been widely used in other software engineering studies, e.g., test generation, fault localization, and program repair. During the process of mutation testing, large number of mutants may be generated and then executed against the test suite to examine whether they can be killed, making the process extremely computational expensive. Several techniques have been proposed to speed up this process, including selective, weakened, and predictive mutation testing. Among those techniques, Predictive Mutation Testing (PMT) tries to build a classification model based on an amount of mutant execution records to predict whether coming new mutants would be killed or alive without mutant execution, and can achieve significant mutation cost reduction. In PMT, each mutant is represented as a list of features related to the mutant itself and the test suite, transforming the mutation testing problem to a binary classification problem. In this paper, we perform an extensive study on the effectiveness and efficiency of the promising PMT technique under the cross-project setting using a total 654 real world projects with more than 4 Million mutants. Our work also complements the original PMT work by considering more features and the powerful deep learning models. The experimental results show an average of over 0.85 prediction accuracy on 654 projects using cross validation, demonstrating the effectiveness of PMT. Meanwhile, a clear speed up is also observed with an average of 28.7X compared to traditional mutation testing with 5 threads. In addition, we analyze the importance of different groups of features in classification model, which provides important implications for the future research.

查看原文本刊更多论文

跨项目预测突变检测的广泛研究

突变测试是一种强有力的测试套件质量评估技术，在保证软件质量方面起着关键作用。突变测试的概念也被广泛应用于其他软件工程研究中，例如测试生成、故障定位和程序修复。在突变测试过程中，可能会生成大量的突变体，然后对测试套件执行以检查它们是否可以被杀死，这使得该过程的计算成本非常高。已经提出了几种加速这一过程的技术，包括选择性、弱化和预测性突变检测。其中，预测性突变测试(Predictive Mutation Testing, PMT)试图根据大量的突变执行记录建立一个分类模型，预测即将到来的新突变体在不执行突变的情况下是被杀死还是存活，可以显著降低突变成本。在PMT中，每个突变被表示为与突变本身和测试套件相关的特征列表，将突变测试问题转换为二元分类问题。在本文中，我们对跨项目设置下有前途的PMT技术的有效性和效率进行了广泛的研究，使用了总共654个真实世界的项目，超过400万个突变体。我们的工作还通过考虑更多的特征和强大的深度学习模型来补充原始的PMT工作。实验结果表明，对654个项目进行交叉验证，平均预测精度超过0.85，证明了PMT的有效性。同时，与使用5个线程的传统突变测试相比，还观察到明显的速度提高，平均提高了28.7倍。此外，我们还分析了不同组的特征在分类模型中的重要性，为未来的研究提供了重要的启示。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 12th IEEE Conference on Software Testing, Validation and Verification (ICST)

自引率

0.00%

发文量