{"title":"Improving test suite generation quality through machine learning-driven boundary value analysis","authors":"Xiujing Guo , Hiroyuki Okamura , Tadashi Dohi","doi":"10.1016/j.array.2025.100496","DOIUrl":null,"url":null,"abstract":"<div><div>Boundary value analysis (BVA) is a widely used method in software testing to identify errors at the boundaries of input domains. However, traditional BVA is resource intensive and often impractical for complex systems with expansive input spaces. Recent advances in machine learning offer potential for automating BVA, improving efficiency and fault detection capabilities. This paper introduces a machine learning based approach for the automatic generation of boundary test inputs. The research focuses on addressing the automation of BVA processes, with a particular emphasis on white-box testing scenarios. The proposed methodology consists of two main steps. First, a ML-based discriminator is trained to identify the existence of a boundary between two test inputs. Based on the discriminator’s output, we calculate the boundary density using two proposed methods: “pointDensity” and “pairDensity.” In the second step, Markov Chain Monte Carlo (MCMC) techniques are applied to generate test inputs guided by the calculated boundary densities. Experiments were conducted to evaluate the fault detection capabilities of the ML-based approach compared to concolic testing and manual boundary analysis. The results show that our proposed method surpasses manual boundary analysis in five of the eight programs and outperforms concolic testing in four out of eight programs.</div></div>","PeriodicalId":8417,"journal":{"name":"Array","volume":"27 ","pages":"Article 100496"},"PeriodicalIF":4.5000,"publicationDate":"2025-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Array","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2590005625001237","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Boundary value analysis (BVA) is a widely used method in software testing to identify errors at the boundaries of input domains. However, traditional BVA is resource intensive and often impractical for complex systems with expansive input spaces. Recent advances in machine learning offer potential for automating BVA, improving efficiency and fault detection capabilities. This paper introduces a machine learning based approach for the automatic generation of boundary test inputs. The research focuses on addressing the automation of BVA processes, with a particular emphasis on white-box testing scenarios. The proposed methodology consists of two main steps. First, a ML-based discriminator is trained to identify the existence of a boundary between two test inputs. Based on the discriminator’s output, we calculate the boundary density using two proposed methods: “pointDensity” and “pairDensity.” In the second step, Markov Chain Monte Carlo (MCMC) techniques are applied to generate test inputs guided by the calculated boundary densities. Experiments were conducted to evaluate the fault detection capabilities of the ML-based approach compared to concolic testing and manual boundary analysis. The results show that our proposed method surpasses manual boundary analysis in five of the eight programs and outperforms concolic testing in four out of eight programs.