Using Machine Learning to Support Debugging with Tarantula

The 18th IEEE International Symposium on Software Reliability (ISSRE '07) Pub Date : 2007-11-05 DOI:10.1109/ISSRE.2007.31

L. Briand, Y. Labiche, Xuetao Liu

{"title":"Using Machine Learning to Support Debugging with Tarantula","authors":"L. Briand, Y. Labiche, Xuetao Liu","doi":"10.1109/ISSRE.2007.31","DOIUrl":null,"url":null,"abstract":"Using a specific machine learning technique, this paper proposes a way to identify suspicious statements during debugging. The technique is based on principles similar to Tarantula but addresses its main flaw: its difficulty to deal with the presence of multiple faults as it assumes that failing test cases execute the same fault(s). The improvement we present in this paper results from the use of C4.5 decision trees to identify various failure conditions based on information regarding the test cases' inputs and outputs. Failing test cases executing under similar conditions are then assumed to fail due to the same fault(s). Statements are then considered suspicious if they are covered by a large proportion of failing test cases that execute under similar conditions. We report on a case study that demonstrates improvement over the original Tarantula technique in terms of statement ranking. Another contribution of this paper is to show that failure conditions as modeled by a C4.5 decision tree accurately predict failures and can therefore be used as well to help debugging.","PeriodicalId":193805,"journal":{"name":"The 18th IEEE International Symposium on Software Reliability (ISSRE '07)","volume":"77 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"96","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The 18th IEEE International Symposium on Software Reliability (ISSRE '07)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISSRE.2007.31","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 96

Abstract

Using a specific machine learning technique, this paper proposes a way to identify suspicious statements during debugging. The technique is based on principles similar to Tarantula but addresses its main flaw: its difficulty to deal with the presence of multiple faults as it assumes that failing test cases execute the same fault(s). The improvement we present in this paper results from the use of C4.5 decision trees to identify various failure conditions based on information regarding the test cases' inputs and outputs. Failing test cases executing under similar conditions are then assumed to fail due to the same fault(s). Statements are then considered suspicious if they are covered by a large proportion of failing test cases that execute under similar conditions. We report on a case study that demonstrates improvement over the original Tarantula technique in terms of statement ranking. Another contribution of this paper is to show that failure conditions as modeled by a C4.5 decision tree accurately predict failures and can therefore be used as well to help debugging.

查看原文本刊更多论文

使用机器学习来支持狼蛛的调试

本文利用一种特定的机器学习技术，提出了一种在调试过程中识别可疑语句的方法。该技术基于类似于Tarantula的原则，但解决了它的主要缺陷:它难以处理多个错误的存在，因为它假设失败的测试用例执行相同的错误。我们在本文中提出的改进源于使用C4.5决策树来根据有关测试用例的输入和输出的信息识别各种故障条件。失败的测试用例在类似的条件下执行，然后假定由于相同的错误而失败。如果语句被大量在相似条件下执行的失败测试用例所覆盖，那么语句就被认为是可疑的。我们报告了一个案例研究，证明了原始的狼蛛技术在语句排名方面的改进。本文的另一个贡献是展示了由C4.5决策树建模的故障条件可以准确地预测故障，因此也可以用于帮助调试。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

The 18th IEEE International Symposium on Software Reliability (ISSRE '07)

自引率

0.00%

发文量