An Efficient Subset-Lattice Algorithm for Mining Closed Frequent Itemsets in Data Streams

Ye-In Chang, Chia-En Li, Wei-Hau Peng
{"title":"An Efficient Subset-Lattice Algorithm for Mining Closed Frequent Itemsets in Data Streams","authors":"Ye-In Chang, Chia-En Li, Wei-Hau Peng","doi":"10.1109/TAAI.2012.12","DOIUrl":null,"url":null,"abstract":"There are many applications of using association rules in data streams, such as market analysis, network security, sensor networks and web tracking. Mining closed frequent item sets is a further work of mining association rules, which aims to find the subsets of frequent item sets that could extract all frequent item sets. Formally, a closed frequent item set is a frequent item set which has no superset with the same support as it. One of well-known algorithms for mining closed frequent item sets based on the sliding window model is the New Moment algorithm. However, the New Moment algorithm could not efficiently mine closed frequent item sets in data streams, since they will generate closed frequent item sets and many unclosed frequent item sets. Moreover, when data in the sliding window is incrementally updated, the New Moment algorithm needs to reconstruct the whole tree structure. Therefore, we propose the Subset-Lattice algorithm which embeds the property of subsets into the lattice structure to efficiently mine closed frequent item sets over a data stream sliding window. Moreover, when data in the sliding window is incrementally updated, our Subset-Lattice algorithm will not reconstruct the whole lattice structure.","PeriodicalId":385063,"journal":{"name":"2012 Conference on Technologies and Applications of Artificial Intelligence","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 Conference on Technologies and Applications of Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TAAI.2012.12","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

Abstract

There are many applications of using association rules in data streams, such as market analysis, network security, sensor networks and web tracking. Mining closed frequent item sets is a further work of mining association rules, which aims to find the subsets of frequent item sets that could extract all frequent item sets. Formally, a closed frequent item set is a frequent item set which has no superset with the same support as it. One of well-known algorithms for mining closed frequent item sets based on the sliding window model is the New Moment algorithm. However, the New Moment algorithm could not efficiently mine closed frequent item sets in data streams, since they will generate closed frequent item sets and many unclosed frequent item sets. Moreover, when data in the sliding window is incrementally updated, the New Moment algorithm needs to reconstruct the whole tree structure. Therefore, we propose the Subset-Lattice algorithm which embeds the property of subsets into the lattice structure to efficiently mine closed frequent item sets over a data stream sliding window. Moreover, when data in the sliding window is incrementally updated, our Subset-Lattice algorithm will not reconstruct the whole lattice structure.
数据流中封闭频繁项集挖掘的高效子集-格算法
在数据流中使用关联规则有很多应用,如市场分析、网络安全、传感器网络和web跟踪等。挖掘封闭频繁项集是挖掘关联规则的进一步工作,其目的是找到能够提取所有频繁项集的频繁项集子集。正式地说,封闭频繁项集是没有与它具有相同支持的超集的频繁项集。基于滑动窗口模型的封闭频繁项集挖掘算法之一是新矩算法。然而,新矩算法不能有效地挖掘数据流中的封闭频繁项集,因为它们会生成封闭频繁项集和许多非封闭频繁项集。此外,当滑动窗口中的数据增量更新时,新矩算法需要重构整个树结构。因此,我们提出了子集-格算法,该算法将子集的性质嵌入到格结构中,以有效地挖掘数据流滑动窗口上的封闭频繁项集。此外,当滑动窗口中的数据增量更新时,我们的子集-晶格算法不会重建整个晶格结构。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信