Automatic test case generation using natural language processing: A systematic mapping study

IF 4.3 2区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information and Software Technology Pub Date : 2025-10-22 DOI:10.1016/j.infsof.2025.107929

Jordy Navarro, Ronald Ibarra

{"title":"Automatic test case generation using natural language processing: A systematic mapping study","authors":"Jordy Navarro, Ronald Ibarra","doi":"10.1016/j.infsof.2025.107929","DOIUrl":null,"url":null,"abstract":"<div><h3>Context:</h3><div>Artificial intelligence (AI) has made significant progress in recent years, which has motivated its use in many disciplines and industrial domains, including software engineering, especially in the testing process, where many research efforts have been made. These studies focus on the automatic test case generation using natural language processing (NLP), an emerging branch of AI. Despite these efforts, the literature lacks a structured and systematic approach, since reported mappings and systematic literature reviews have limitations in their scope.</div></div><div><h3>Objective:</h3><div>This study aims to systematically organize and synthesize the existing literature to establish the state of the art in the automatic generation of test cases using NLP.</div></div><div><h3>Methodology:</h3><div>We conducted systematic mapping following Kai Petersen’s methodology, exploring five databases. The initial search yielded 1262 articles, of which 61 were selected. 16 thematic questions and 4 non-thematic questions were posed.</div></div><div><h3>Results:</h3><div>The findings reveal an increase in the number of articles published in journals starting in 2022. Among the most reported NLP techniques are POS tagging, dependency parsing and tokenization, implemented with tools such as Stanford Core NLP and NLTK. The reported approaches mostly achieved a medium level of automation, using natural and formal language requirements as main inputs. Only 9 articles explicitly mention the use of test case design techniques, such as boundary value analysis, equivalent class partitioning, state transition and decision tables.</div></div><div><h3>Conclusions:</h3><div>We systematically identified and organized the reported primary studies on the automatic or semi-automatic generation of software test cases applying NLP.</div></div>","PeriodicalId":54983,"journal":{"name":"Information and Software Technology","volume":"189 ","pages":"Article 107929"},"PeriodicalIF":4.3000,"publicationDate":"2025-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information and Software Technology","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S095058492500268X","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Context:

Artificial intelligence (AI) has made significant progress in recent years, which has motivated its use in many disciplines and industrial domains, including software engineering, especially in the testing process, where many research efforts have been made. These studies focus on the automatic test case generation using natural language processing (NLP), an emerging branch of AI. Despite these efforts, the literature lacks a structured and systematic approach, since reported mappings and systematic literature reviews have limitations in their scope.

Objective:

This study aims to systematically organize and synthesize the existing literature to establish the state of the art in the automatic generation of test cases using NLP.

Methodology:

We conducted systematic mapping following Kai Petersen’s methodology, exploring five databases. The initial search yielded 1262 articles, of which 61 were selected. 16 thematic questions and 4 non-thematic questions were posed.

Results:

The findings reveal an increase in the number of articles published in journals starting in 2022. Among the most reported NLP techniques are POS tagging, dependency parsing and tokenization, implemented with tools such as Stanford Core NLP and NLTK. The reported approaches mostly achieved a medium level of automation, using natural and formal language requirements as main inputs. Only 9 articles explicitly mention the use of test case design techniques, such as boundary value analysis, equivalent class partitioning, state transition and decision tables.

Conclusions:

We systematically identified and organized the reported primary studies on the automatic or semi-automatic generation of software test cases applying NLP.

查看原文本刊更多论文

使用自然语言处理的自动测试用例生成：系统的映射研究

背景：人工智能（AI）近年来取得了重大进展，这推动了它在许多学科和工业领域的应用，包括软件工程，特别是在测试过程中，已经做出了许多研究工作。这些研究集中在使用自然语言处理（NLP）自动生成测试用例，这是人工智能的一个新兴分支。尽管做出了这些努力，但文献缺乏结构化和系统的方法，因为报告的映射和系统的文献综述在其范围内存在局限性。目的：本研究旨在系统地整理和综合现有文献，建立基于NLP的测试用例自动生成的最新技术。方法：我们按照Kai Petersen的方法进行了系统的映射，探索了五个数据库。最初的搜索产生了1262篇文章，其中61篇被选中。提出了16个专题问题和4个非专题问题。结果：研究结果显示，从2022年开始，在期刊上发表的文章数量有所增加。在报道最多的NLP技术中，有词性标注、依赖解析和标记化，这些技术是用斯坦福核心NLP和NLTK等工具实现的。所报告的方法大多实现了中等程度的自动化，使用自然和正式语言需求作为主要输入。只有9篇文章明确提到了测试用例设计技术的使用，比如边界值分析、等价类划分、状态转换和决策表。结论：我们系统地识别和组织了关于应用NLP自动或半自动生成软件测试用例的报告的主要研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Information and Software Technology 工程技术-计算机：软件工程

CiteScore

9.10

自引率

7.70%

发文量

164

审稿时长

9.6 weeks

期刊介绍： Information and Software Technology is the international archival journal focusing on research and experience that contributes to the improvement of software development practices. The journal''s scope includes methods and techniques to better engineer software and manage its development. Articles submitted for review should have a clear component of software engineering or address ways to improve the engineering and management of software development. Areas covered by the journal include: • Software management, quality and metrics, • Software processes, • Software architecture, modelling, specification, design and programming • Functional and non-functional software requirements • Software testing and verification & validation • Empirical studies of all aspects of engineering and managing software development Short Communications is a new section dedicated to short papers addressing new ideas, controversial opinions, "Negative" results and much more. Read the Guide for authors for more information. The journal encourages and welcomes submissions of systematic literature studies (reviews and maps) within the scope of the journal. Information and Software Technology is the premiere outlet for systematic literature studies in software engineering.