A3Test: Assertion-Augmented Automated Test case generation

IF 3.8 2区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information and Software Technology Pub Date : 2024-08-30 DOI:10.1016/j.infsof.2024.107565

Saranya Alagarsamy, Chakkrit Tantithamthavorn, Aldeida Aleti

{"title":"A3Test: Assertion-Augmented Automated Test case generation","authors":"Saranya Alagarsamy, Chakkrit Tantithamthavorn, Aldeida Aleti","doi":"10.1016/j.infsof.2024.107565","DOIUrl":null,"url":null,"abstract":"<div><h3>Context:</h3><p>Test case generation is a critical yet challenging task in software development. Recently, AthenaTest – a Deep Learning (DL) approach for generating unit test cases has been proposed. However, our revisiting study reveals that AthenaTest can generate less than one-fifth of the test cases correctly, due to a lack of assertion knowledge and test signature verification.</p></div><div><h3>Objective:</h3><p>This paper introduces A3Test, a novel DL-based approach to the generation of test cases, enhanced with assertion knowledge and a mechanism to verify consistency of the name and signatures of the tests. A3Test aims to adapt domain knowledge from assertion generation to test case generation.</p></div><div><h3>Method:</h3><p>A3Test employs domain adaptation principles and introduces a verification approach to name consistency and test signatures. We evaluate its effectiveness using 5,278 focal methods from the Defects4j dataset.</p></div><div><h3>Results:</h3><p>Our findings indicate that A3Test outperforms AthenaTest and ChatUniTest. A3Test generates 2.16% to 395.43% more correct test cases, achieves 2.17% to 34.29% higher method coverage, and 25.64% higher line coverage. A3Test achieves 2.13% to 12.20% higher branch coverage, 2.22% to 12.20% higher mutation scores, and 2.44% to 55.56% more correct assertions compared to both ChatUniTest and AthenaTest respectively for one iteration. When generating multiple test cases per method A3Test still shows improvements and comparable efficacy to ChatUnitTest. A survey of developers reveals that the majority of the participants 70.51% agree that test cases generated by A3Test are more readable than those generated by EvoSuite.</p></div><div><h3>Conclusions:</h3><p>A3Test significantly enhances test case generation through its incorporation of assertion knowledge and test signature verification, contributing to the generation of correct test cases.</p></div>","PeriodicalId":54983,"journal":{"name":"Information and Software Technology","volume":"176 ","pages":"Article 107565"},"PeriodicalIF":3.8000,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0950584924001708/pdfft?md5=76cf4413a9abbc636fe62d1466a54e41&pid=1-s2.0-S0950584924001708-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information and Software Technology","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950584924001708","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Context:

Test case generation is a critical yet challenging task in software development. Recently, AthenaTest – a Deep Learning (DL) approach for generating unit test cases has been proposed. However, our revisiting study reveals that AthenaTest can generate less than one-fifth of the test cases correctly, due to a lack of assertion knowledge and test signature verification.

Objective:

This paper introduces A3Test, a novel DL-based approach to the generation of test cases, enhanced with assertion knowledge and a mechanism to verify consistency of the name and signatures of the tests. A3Test aims to adapt domain knowledge from assertion generation to test case generation.

Method:

A3Test employs domain adaptation principles and introduces a verification approach to name consistency and test signatures. We evaluate its effectiveness using 5,278 focal methods from the Defects4j dataset.

Results:

Our findings indicate that A3Test outperforms AthenaTest and ChatUniTest. A3Test generates 2.16% to 395.43% more correct test cases, achieves 2.17% to 34.29% higher method coverage, and 25.64% higher line coverage. A3Test achieves 2.13% to 12.20% higher branch coverage, 2.22% to 12.20% higher mutation scores, and 2.44% to 55.56% more correct assertions compared to both ChatUniTest and AthenaTest respectively for one iteration. When generating multiple test cases per method A3Test still shows improvements and comparable efficacy to ChatUnitTest. A survey of developers reveals that the majority of the participants 70.51% agree that test cases generated by A3Test are more readable than those generated by EvoSuite.

Conclusions:

A3Test significantly enhances test case generation through its incorporation of assertion knowledge and test signature verification, contributing to the generation of correct test cases.

查看原文本刊更多论文

A3Test：断言增强自动测试案例生成

背景：生成测试用例是软件开发中一项关键而又具有挑战性的任务。最近，有人提出了一种用于生成单元测试用例的深度学习（DL）方法--AthenaTest。目标：本文介绍了 A3Test，一种基于深度学习的生成测试用例的新方法，它增强了断言知识和测试名称与签名一致性验证机制。方法：A3Test 采用了领域适应原则，并引入了名称一致性和测试签名的验证方法。我们使用 Defects4j 数据集中的 5278 个焦点方法对其有效性进行了评估。结果：我们的研究结果表明，A3Test 优于 AthenaTest 和 ChatUniTest。A3Test 生成的测试用例正确率提高了 2.16% 到 395.43%，方法覆盖率提高了 2.17% 到 34.29%，线路覆盖率提高了 25.64%。在一次迭代中，与 ChatUniTest 和 AthenaTest 相比，A3Test 的分支覆盖率高出 2.13% 到 12.20%，突变分数高出 2.22% 到 12.20%，断言正确率高出 2.44% 到 55.56%。当为每个方法生成多个测试用例时，A3Test 仍有改进，其功效与 ChatUnitTest 相当。对开发人员的调查显示，70.51%的参与者认为 A3Test 生成的测试用例比 EvoSuite 生成的测试用例更具可读性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Information and Software Technology 工程技术-计算机：软件工程

CiteScore

9.10

自引率

7.70%

发文量

164

审稿时长

9.6 weeks

期刊介绍： Information and Software Technology is the international archival journal focusing on research and experience that contributes to the improvement of software development practices. The journal''s scope includes methods and techniques to better engineer software and manage its development. Articles submitted for review should have a clear component of software engineering or address ways to improve the engineering and management of software development. Areas covered by the journal include: • Software management, quality and metrics, • Software processes, • Software architecture, modelling, specification, design and programming • Functional and non-functional software requirements • Software testing and verification & validation • Empirical studies of all aspects of engineering and managing software development Short Communications is a new section dedicated to short papers addressing new ideas, controversial opinions, "Negative" results and much more. Read the Guide for authors for more information. The journal encourages and welcomes submissions of systematic literature studies (reviews and maps) within the scope of the journal. Information and Software Technology is the premiere outlet for systematic literature studies in software engineering.