A Fast Algorithm for Deriving Frequent Itemsets

Cheng-Wei Wu, Yun-Wei Lin, Ming Chen, Jiashu Cheng
{"title":"A Fast Algorithm for Deriving Frequent Itemsets","authors":"Cheng-Wei Wu, Yun-Wei Lin, Ming Chen, Jiashu Cheng","doi":"10.1109/taai54685.2021.00047","DOIUrl":null,"url":null,"abstract":"When mining frequent itemsets (abbr. FIs) from dense datasets, too many itemsets are generated and results in the mining task from a large amount of execution time and high memory consumption. Frequent closed itemset (abbr. FCI) is a lossless and concise representation of FIs. Mining FCIs can not only greatly reduce the execution time and memory consumption, but also retain the complete information all of FI. Although many studies have proposed different mining FCI algorithms, but they have less developed methods that can effectively derive all FIs from FCIs. Form this point of view, this study proposes a novel efficient algorithm named DFI-List for efficiently deriving FIS from FCIs. The algorithm adopts the methodology of depth-first-search and divide-and-conquer to derive all FIs from FCIs. DFI-List efficiently derives all the FIs with vertical index structure called Cid List and uses SC Table to quickly find the support count of the derived FI. Experimental results show that the execution speed and memory consumption of the proposed algorithm with the proposed strategy is better than of the state-of-art algorithm.","PeriodicalId":343821,"journal":{"name":"2021 International Conference on Technologies and Applications of Artificial Intelligence (TAAI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Technologies and Applications of Artificial Intelligence (TAAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/taai54685.2021.00047","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

When mining frequent itemsets (abbr. FIs) from dense datasets, too many itemsets are generated and results in the mining task from a large amount of execution time and high memory consumption. Frequent closed itemset (abbr. FCI) is a lossless and concise representation of FIs. Mining FCIs can not only greatly reduce the execution time and memory consumption, but also retain the complete information all of FI. Although many studies have proposed different mining FCI algorithms, but they have less developed methods that can effectively derive all FIs from FCIs. Form this point of view, this study proposes a novel efficient algorithm named DFI-List for efficiently deriving FIS from FCIs. The algorithm adopts the methodology of depth-first-search and divide-and-conquer to derive all FIs from FCIs. DFI-List efficiently derives all the FIs with vertical index structure called Cid List and uses SC Table to quickly find the support count of the derived FI. Experimental results show that the execution speed and memory consumption of the proposed algorithm with the proposed strategy is better than of the state-of-art algorithm.
频繁项集的快速生成算法
在从密集数据集中挖掘频繁项集(简称fi)时,会产生过多的项集,导致挖掘任务执行时间长,内存消耗高。频繁闭项集(简称FCI)是FCI的一种无损而简洁的表示。挖掘fci不仅可以大大减少执行时间和内存消耗,而且可以保留所有fci的完整信息。虽然许多研究提出了不同的挖掘FCI算法,但能够有效地从FCI中提取所有FCI的方法尚不多见。基于此,本研究提出了一种新的高效算法DFI-List,用于从fci中高效地提取FIS。该算法采用深度优先搜索和分而治之的方法,从fci中导出所有的fci。DFI-List通过称为Cid List的垂直索引结构有效地派生出所有FI,并使用SC Table快速查找派生出的FI的支持计数。实验结果表明,采用该策略的算法在执行速度和内存消耗方面均优于现有算法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信