{"title":"An effective Boolean algorithm for mining association rules in large databases","authors":"Suh-Ying Wur, Y. Leu","doi":"10.1109/DASFAA.1999.765750","DOIUrl":null,"url":null,"abstract":"In this paper, we present in effective Boolean algorithm for mining association rules in large databases of sales transactions. Like the a priori algorithm, the proposed Boolean algorithm mines association rules in two steps. In the first step, logic OR and AND operations are used to compute frequent itemsets. In the second step, logic AND and XOR operations are applied to derive all interesting association rules based on the computed frequent itemsets. By only scanning the database once and avoiding generating candidate itemsets in computing frequent itemsets, the Boolean algorithm gains a significant performance improvement over the a priori algorithm. We propose two efficient implementations of the Boolean algorithm, the bitstream approach and the sparse-matrix approach. Through comprehensive experiments, we show that both the bitstream approach and the sparse-matrix approach outperform the a priori algorithm in all database settings. The sparse-matrix approach in particular shows a very significant performance improvement over the a priori algorithm.","PeriodicalId":229416,"journal":{"name":"Proceedings. 6th International Conference on Advanced Systems for Advanced Applications","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1999-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"54","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. 6th International Conference on Advanced Systems for Advanced Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DASFAA.1999.765750","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 54
Abstract
In this paper, we present in effective Boolean algorithm for mining association rules in large databases of sales transactions. Like the a priori algorithm, the proposed Boolean algorithm mines association rules in two steps. In the first step, logic OR and AND operations are used to compute frequent itemsets. In the second step, logic AND and XOR operations are applied to derive all interesting association rules based on the computed frequent itemsets. By only scanning the database once and avoiding generating candidate itemsets in computing frequent itemsets, the Boolean algorithm gains a significant performance improvement over the a priori algorithm. We propose two efficient implementations of the Boolean algorithm, the bitstream approach and the sparse-matrix approach. Through comprehensive experiments, we show that both the bitstream approach and the sparse-matrix approach outperform the a priori algorithm in all database settings. The sparse-matrix approach in particular shows a very significant performance improvement over the a priori algorithm.