An investigation of the applicability of design of experiments to software testing

27th Annual NASA Goddard/IEEE Software Engineering Workshop, 2002. Proceedings. Pub Date : 2002-12-05 DOI:10.1109/SEW.2002.1199454

D. Kuhn, michael. reilly, michael. reilly

{"title":"An investigation of the applicability of design of experiments to software testing","authors":"D. Kuhn, michael. reilly, michael. reilly","doi":"10.1109/SEW.2002.1199454","DOIUrl":null,"url":null,"abstract":"Approaches to software testing based on methods from the field of design of experiments have been advocated as a means of providing high coverage at relatively low cost. Tools to generate all pairs, or higher n-degree combinations, of input values have been developed and demonstrated in a few applications, but little empirical evidence is available to aid developers in evaluating the effectiveness of these tools for particular problems. We investigate error reports from two large open-source software projects, a browser and Web server, to provide preliminary answers to three questions: Is there a point of diminishing returns at which generating all n-degree combinations is nearly as effective as all n+1-degree combinations? What is the appropriate value of n for particular classes of software? Does this value differ for different types of software, and by how much? Our findings suggest that more than 95% of errors in the software studied would be detected by test cases that cover all 4-way combinations of values, and that the browser and server software were similar in the percentage of errors detectable by combinations of degree 2 through 6.","PeriodicalId":146269,"journal":{"name":"27th Annual NASA Goddard/IEEE Software Engineering Workshop, 2002. Proceedings.","volume":"127 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2002-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"329","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"27th Annual NASA Goddard/IEEE Software Engineering Workshop, 2002. Proceedings.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SEW.2002.1199454","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 329

Abstract

Approaches to software testing based on methods from the field of design of experiments have been advocated as a means of providing high coverage at relatively low cost. Tools to generate all pairs, or higher n-degree combinations, of input values have been developed and demonstrated in a few applications, but little empirical evidence is available to aid developers in evaluating the effectiveness of these tools for particular problems. We investigate error reports from two large open-source software projects, a browser and Web server, to provide preliminary answers to three questions: Is there a point of diminishing returns at which generating all n-degree combinations is nearly as effective as all n+1-degree combinations? What is the appropriate value of n for particular classes of software? Does this value differ for different types of software, and by how much? Our findings suggest that more than 95% of errors in the software studied would be detected by test cases that cover all 4-way combinations of values, and that the browser and server software were similar in the percentage of errors detectable by combinations of degree 2 through 6.

查看原文本刊更多论文

实验设计在软件测试中的适用性研究

基于实验设计领域的方法的软件测试方法被提倡为一种以相对较低的成本提供高覆盖率的方法。已经开发并在一些应用程序中演示了生成输入值的所有对或更高n度组合的工具，但很少有经验证据可用于帮助开发人员评估这些工具对特定问题的有效性。我们调查了来自两个大型开源软件项目(浏览器和Web服务器)的错误报告，为三个问题提供了初步答案:是否存在一个收益递减点，在这个点上生成所有n度组合几乎与生成所有n+1度组合一样有效?对于特定类型的软件，n的合适值是多少?对于不同类型的软件，这个值是否不同?差异有多大?我们的研究结果表明，所研究的软件中超过95%的错误将被覆盖所有4种组合值的测试用例检测到，并且浏览器和服务器软件在通过2到6度组合检测到的错误百分比方面是相似的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

27th Annual NASA Goddard/IEEE Software Engineering Workshop, 2002. Proceedings.

自引率

0.00%

发文量