{"title":"CONCISE: An Algorithm for Mining Positive and Negative Non-Redundant Association Rules","authors":"Bemarisika Parfait, Totohasina André","doi":"10.52458/978-93-91842-08-6-2","DOIUrl":null,"url":null,"abstract":"One challenge problem in association rules mining is the huge size of the extracted rule set many of which are uninteresting and redundant. In this paper, we propose an efficient algorithm CONCISE for generating all non-redundant positive and negative association rules. We first introduce an algorithm CMG (Closed, Maximal and Generators) for mining all frequent closed, maximal and their generators itemsets from large transaction databases. We then define four new bases representing non-redundant association rules from these frequent itemsets. We prove that these bases significantly reduce the number of extracted rules. We show the efficiency of our algorithm by computational experiments compared with existing algorithms.","PeriodicalId":247665,"journal":{"name":"SCRS Conference Proceedings on Intelligent Systems","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"SCRS Conference Proceedings on Intelligent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.52458/978-93-91842-08-6-2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
One challenge problem in association rules mining is the huge size of the extracted rule set many of which are uninteresting and redundant. In this paper, we propose an efficient algorithm CONCISE for generating all non-redundant positive and negative association rules. We first introduce an algorithm CMG (Closed, Maximal and Generators) for mining all frequent closed, maximal and their generators itemsets from large transaction databases. We then define four new bases representing non-redundant association rules from these frequent itemsets. We prove that these bases significantly reduce the number of extracted rules. We show the efficiency of our algorithm by computational experiments compared with existing algorithms.
关联规则挖掘中的一个挑战问题是所提取的规则集规模巨大,其中许多是无趣的和冗余的。本文提出了一种高效的生成所有非冗余正关联规则和负关联规则的简洁算法。首先介绍了一种CMG (Closed, maximum and Generators)算法,用于从大型事务数据库中挖掘所有频繁的Closed, maximum及其生成器项集。然后,我们定义了四个新的基,表示来自这些频繁项集的非冗余关联规则。我们证明了这些基显著地减少了提取规则的数量。通过计算实验与现有算法进行了比较,证明了算法的有效性。