An adaptive algorithm for incremental mining of association rules

Proceedings Ninth International Workshop on Database and Expert Systems Applications (Cat. No.98EX130) Pub Date : 1998-08-26 DOI:10.1109/DEXA.1998.707409

N. L. Sarda, N. Srinivas

{"title":"An adaptive algorithm for incremental mining of association rules","authors":"N. L. Sarda, N. Srinivas","doi":"10.1109/DEXA.1998.707409","DOIUrl":null,"url":null,"abstract":"The association rules represent an important class of knowledge that can be discovered from data warehouses. Current research efforts are focused on inventing efficient ways of discovering these rules from large databases. As databases grow, the discovered rules need to be verified and new rules need to be added to the knowledge base. Since mining afresh every time the database grows is inefficient, algorithms for incremental mining are being investigated. Their primary aim is to avoid or minimize scans of the older database by using the intermediate data constructed during the earlier mining. We present one such algorithm. We make use of large and candidate itemsets and their counts in the older database, and scan the increment to find which rules continue to prevail and which ones fail in the merged database. We are also able to find new rules for the incremental and updated database. The algorithm is adaptive in nature, as it infers the nature of the increment and avoids altogether if possible, multiple scans of the incremental database. Another salient feature is that it does not need multiple scans of the older database. We also indicate some results on its performance against synthetic data.","PeriodicalId":194923,"journal":{"name":"Proceedings Ninth International Workshop on Database and Expert Systems Applications (Cat. No.98EX130)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1998-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"90","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings Ninth International Workshop on Database and Expert Systems Applications (Cat. No.98EX130)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DEXA.1998.707409","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 90

Abstract

The association rules represent an important class of knowledge that can be discovered from data warehouses. Current research efforts are focused on inventing efficient ways of discovering these rules from large databases. As databases grow, the discovered rules need to be verified and new rules need to be added to the knowledge base. Since mining afresh every time the database grows is inefficient, algorithms for incremental mining are being investigated. Their primary aim is to avoid or minimize scans of the older database by using the intermediate data constructed during the earlier mining. We present one such algorithm. We make use of large and candidate itemsets and their counts in the older database, and scan the increment to find which rules continue to prevail and which ones fail in the merged database. We are also able to find new rules for the incremental and updated database. The algorithm is adaptive in nature, as it infers the nature of the increment and avoids altogether if possible, multiple scans of the incremental database. Another salient feature is that it does not need multiple scans of the older database. We also indicate some results on its performance against synthetic data.

查看原文本刊更多论文

关联规则增量挖掘的自适应算法

关联规则表示可以从数据仓库中发现的一类重要知识。目前的研究工作集中在发明从大型数据库中发现这些规则的有效方法上。随着数据库的增长，需要对发现的规则进行验证，并向知识库中添加新的规则。由于每次数据库增长时重新挖掘是低效的，因此正在研究增量挖掘的算法。它们的主要目的是通过使用在早期挖掘期间构造的中间数据来避免或最小化对旧数据库的扫描。我们提出了一个这样的算法。我们利用旧数据库中的大型和候选项目集及其计数，并扫描增量以查找合并数据库中哪些规则继续占上风，哪些规则失败。我们还能够为增量和更新的数据库找到新的规则。该算法本质上是自适应的，因为它推断增量的性质，并在可能的情况下避免对增量数据库进行多次扫描。另一个显著特性是它不需要对旧数据库进行多次扫描。我们还给出了一些针对合成数据的性能结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings Ninth International Workshop on Database and Expert Systems Applications (Cat. No.98EX130)

自引率

0.00%

发文量