Beyond support and confidence: Exploring interestingness measures for rule-based specification mining

2015 IEEE 22nd International Conference on Software Analysis, Evolution, and Reengineering (SANER) Pub Date : 2015-03-01 DOI:10.1109/SANER.2015.7081843

Tien-Duy B. Le, D. Lo

{"title":"Beyond support and confidence: Exploring interestingness measures for rule-based specification mining","authors":"Tien-Duy B. Le, D. Lo","doi":"10.1109/SANER.2015.7081843","DOIUrl":null,"url":null,"abstract":"Numerous rule-based specification mining approaches have been proposed in the literature. Many of these approaches analyze a set of execution traces to discover interesting usage rules, e.g., whenever lock() is invoked, eventually unlock() is invoked. These techniques often generate and enumerate a set of candidate rules and compute some interestingness scores. Rules whose interestingness scores are above a certain threshold would then be output. In past studies, two measures, namely support and confidence, which are well-known measures, are often used to compute these scores. However, aside from these two, many other interestingness measures have been proposed. It is thus unclear if support and confidence are the best interestingness measures for specification mining. In this work, we perform an empirical study that investigates the utility of 38 interestingness measures in recovering correct specifications of classes from Java libraries. We used a ground truth dataset consisting of 683 rules and recorded execution traces that are produced when we run the DaCapo test suite. We apply 38 different interestingness measures to identify correct rules from a pool of candidate rules. Our study highlights that many measures are on par to support and confidence. Some of the measures are even better than support or confidence and at least one of the measures is statistically significantly better than the two measures. We also find that compositions of several measures with support statistically significantly outperform the composition of support and confidence. Our findings highlight the need to look beyond standard support and confidence to find interesting rules.","PeriodicalId":355949,"journal":{"name":"2015 IEEE 22nd International Conference on Software Analysis, Evolution, and Reengineering (SANER)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"44","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE 22nd International Conference on Software Analysis, Evolution, and Reengineering (SANER)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SANER.2015.7081843","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 44

Abstract

Numerous rule-based specification mining approaches have been proposed in the literature. Many of these approaches analyze a set of execution traces to discover interesting usage rules, e.g., whenever lock() is invoked, eventually unlock() is invoked. These techniques often generate and enumerate a set of candidate rules and compute some interestingness scores. Rules whose interestingness scores are above a certain threshold would then be output. In past studies, two measures, namely support and confidence, which are well-known measures, are often used to compute these scores. However, aside from these two, many other interestingness measures have been proposed. It is thus unclear if support and confidence are the best interestingness measures for specification mining. In this work, we perform an empirical study that investigates the utility of 38 interestingness measures in recovering correct specifications of classes from Java libraries. We used a ground truth dataset consisting of 683 rules and recorded execution traces that are produced when we run the DaCapo test suite. We apply 38 different interestingness measures to identify correct rules from a pool of candidate rules. Our study highlights that many measures are on par to support and confidence. Some of the measures are even better than support or confidence and at least one of the measures is statistically significantly better than the two measures. We also find that compositions of several measures with support statistically significantly outperform the composition of support and confidence. Our findings highlight the need to look beyond standard support and confidence to find interesting rules.

查看原文本刊更多论文

超越支持和信任:探索基于规则的规范挖掘的有趣度量

文献中已经提出了许多基于规则的规范挖掘方法。这些方法中有许多分析一组执行跟踪来发现有趣的使用规则，例如，无论何时调用lock()，最终都会调用unlock()。这些技术通常生成和列举一组候选规则，并计算一些有趣的分数。然后将输出趣味性分数高于某个阈值的规则。在过去的研究中，通常使用支持度和信心这两个众所周知的度量来计算这些分数。然而，除了这两个，还有许多其他有趣的措施被提出。因此，对于规范挖掘来说，支持度和置信度是否是最有趣的度量是不清楚的。在这项工作中，我们进行了一项实证研究，调查了38种兴趣度量在从Java库中恢复正确的类规范中的效用。我们使用了一个由683条规则组成的真实数据集，并记录了运行DaCapo测试套件时产生的执行跟踪。我们应用38种不同的兴趣度量来从候选规则池中识别正确的规则。我们的研究强调，许多措施与支持和信心是同等重要的。有些措施甚至比支持或信心更好，至少其中一个措施在统计上明显优于这两个措施。我们还发现，支持的几个措施的组成在统计上显著优于支持和信心的组成。我们的研究结果强调，需要超越标准的支持和信心来寻找有趣的规则。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 IEEE 22nd International Conference on Software Analysis, Evolution, and Reengineering (SANER)

自引率

0.00%

发文量