Test Case Prioritization for Compilers: A Text-Vector Based Approach

2016 IEEE International Conference on Software Testing, Verification and Validation (ICST) Pub Date : 2016-04-11 DOI:10.1109/ICST.2016.19

Junjie Chen, Y. Bai, Dan Hao, Yingfei Xiong, Hongyu Zhang, Lu Zhang, Bing Xie

{"title":"Test Case Prioritization for Compilers: A Text-Vector Based Approach","authors":"Junjie Chen, Y. Bai, Dan Hao, Yingfei Xiong, Hongyu Zhang, Lu Zhang, Bing Xie","doi":"10.1109/ICST.2016.19","DOIUrl":null,"url":null,"abstract":"Test case prioritization aims to schedule the execution order of test cases so as to detect bugs as early as possible. For compiler testing, the demand for both effectiveness and efficiency imposes challenge to test case prioritization. In the literature, most existing approaches prioritize test cases by using some coverage information (e.g., statement coverage or branch coverage), which is collected with considerable extra effort. Although input-based test case prioritization relies only on test inputs, it can hardly be applied when test inputs are programs. In this paper we propose a novel text-vector based test case prioritization approach, which prioritizes test cases for C compilers without coverage information. Our approach first transforms each test case into a text-vector by extracting its tokens which reflect fault-relevant characteristics and then prioritizes test cases based on these text-vectors. In particular, in our approach we present three prioritization strategies: greedy strategy, adaptive random strategy, and search strategy. To investigate the efficiency and effectiveness of our approach, we conduct an experiment on two C compilers (i.e., GCC and LLVM), and find that our approach is much more efficient than the existing approaches and is effective in prioritizing test cases.","PeriodicalId":155554,"journal":{"name":"2016 IEEE International Conference on Software Testing, Verification and Validation (ICST)","volume":"111 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"53","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE International Conference on Software Testing, Verification and Validation (ICST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICST.2016.19","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 53

Abstract

Test case prioritization aims to schedule the execution order of test cases so as to detect bugs as early as possible. For compiler testing, the demand for both effectiveness and efficiency imposes challenge to test case prioritization. In the literature, most existing approaches prioritize test cases by using some coverage information (e.g., statement coverage or branch coverage), which is collected with considerable extra effort. Although input-based test case prioritization relies only on test inputs, it can hardly be applied when test inputs are programs. In this paper we propose a novel text-vector based test case prioritization approach, which prioritizes test cases for C compilers without coverage information. Our approach first transforms each test case into a text-vector by extracting its tokens which reflect fault-relevant characteristics and then prioritizes test cases based on these text-vectors. In particular, in our approach we present three prioritization strategies: greedy strategy, adaptive random strategy, and search strategy. To investigate the efficiency and effectiveness of our approach, we conduct an experiment on two C compilers (i.e., GCC and LLVM), and find that our approach is much more efficient than the existing approaches and is effective in prioritizing test cases.

查看原文本刊更多论文

编译器的测试用例优先级:基于文本向量的方法

测试用例优先级的目的是安排测试用例的执行顺序，以便尽早发现错误。对于编译器测试，对有效性和效率的要求对测试用例的优先级提出了挑战。在文献中，大多数现有的方法通过使用一些覆盖信息(例如，语句覆盖或分支覆盖)来确定测试用例的优先级，这些信息是通过相当多的额外工作来收集的。尽管基于输入的测试用例优先级只依赖于测试输入，但是当测试输入是程序时，它很难被应用。在本文中，我们提出了一种新的基于文本向量的测试用例优先排序方法，该方法可以在没有覆盖信息的情况下对C编译器的测试用例进行优先排序。我们的方法首先通过提取反映故障相关特征的标记将每个测试用例转换为文本向量，然后基于这些文本向量对测试用例进行优先级排序。在我们的方法中，我们提出了三种优先级策略:贪婪策略、自适应随机策略和搜索策略。为了研究我们的方法的效率和有效性，我们在两个C编译器(即GCC和LLVM)上进行了实验，发现我们的方法比现有的方法要高效得多，并且在优先级测试用例方面是有效的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 IEEE International Conference on Software Testing, Verification and Validation (ICST)

自引率

0.00%

发文量