An Itemset-Driven Cluster-Oriented Approach to Extract Compact and Meaningful Sets of Association Rules

Sixth International Conference on Machine Learning and Applications (ICMLA 2007) Pub Date : 2007-12-13 DOI:10.1109/ICMLA.2007.45

C. H. Yamamoto, Maria Cristina Ferreira de Oliveira, M. L. Fujimoto, S. O. Rezende

引用次数: 3

Abstract

Extracting association rules from large datasets typically results in a huge amount of rules. An approach to tackle this problem is to filter the resulting rule set, which reduces the rules, at the cost of also eliminating potentially interesting ones. In exploring a new dataset in search of relevant associations, it may be more useful for miners to have an overview of the space of rules obtainable from the dataset, rather than getting an arbitrary set satisfying high values for given interest measures. We describe a rule extraction approach that favors rule diversity, allowing miners to gain an overview of the rule space while reducing semantic redundancy within the rule set. This approach adopts an itemset-driven rule generation coupled with a cluster-based filtering process. The set of rules so obtained provides a starting point for a user-driven exploration of it.

查看原文本刊更多论文

一种项集驱动的面向聚类的关联规则紧凑有意义集提取方法

从大型数据集中提取关联规则通常会产生大量的规则。解决这个问题的一种方法是过滤结果规则集，这样可以减少规则，但同时也要消除可能有趣的规则。在探索新的数据集以寻找相关关联时，对于矿工来说，从数据集中获得可获得的规则空间的概述可能更有用，而不是为给定的兴趣度量获得满足高值的任意集合。我们描述了一种有利于规则多样性的规则提取方法，允许矿工在减少规则集中的语义冗余的同时获得规则空间的概述。这种方法采用了项集驱动的规则生成和基于聚类的过滤过程。这样获得的规则集为用户驱动的探索提供了一个起点。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Sixth International Conference on Machine Learning and Applications (ICMLA 2007)

自引率

0.00%

发文量