查找频繁项:一种改进Apriori算法的新方法

Noorollah Karimtabar, Mohammad Javad Shayegan Fard
{"title":"查找频繁项:一种改进Apriori算法的新方法","authors":"Noorollah Karimtabar, Mohammad Javad Shayegan Fard","doi":"10.7494/csci.2022.23.2.3776","DOIUrl":null,"url":null,"abstract":"In the current paper, we use an intelligent method for improved the Apriori algorithm in order to extract frequent itemsets. PAA (proposed Apriori algorithm) is twofold. First, it is not necessary to take only one data item at each step. In fact, all possible combinations of the items could be generated at each step. Secondly, we can scan only some transactions instead of scanning all the transactions to obtain frequent itemset. For performance evaluation, we conducted three experiments with the traditional Apriori, BitTableFI, TDM-MFI, and MDC_Apriori algorithms. The results exhibit that due to the significant reduction in the number of transaction scans to obtain the itemset, the algorithm execution time is significantly reduced; as in the first experiment, the time spent to generate frequent items underwent a reduction by 52% compared to the algorithm in the first experiment. In the second experiment, the amount of time spent is equal to 65%, while in the third experiment, it is equal to 46%.","PeriodicalId":23063,"journal":{"name":"Theor. Comput. Sci.","volume":"8 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Finding Frequent Items: A Novel Method for Improving the Apriori Algorithm\",\"authors\":\"Noorollah Karimtabar, Mohammad Javad Shayegan Fard\",\"doi\":\"10.7494/csci.2022.23.2.3776\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the current paper, we use an intelligent method for improved the Apriori algorithm in order to extract frequent itemsets. PAA (proposed Apriori algorithm) is twofold. First, it is not necessary to take only one data item at each step. In fact, all possible combinations of the items could be generated at each step. Secondly, we can scan only some transactions instead of scanning all the transactions to obtain frequent itemset. For performance evaluation, we conducted three experiments with the traditional Apriori, BitTableFI, TDM-MFI, and MDC_Apriori algorithms. The results exhibit that due to the significant reduction in the number of transaction scans to obtain the itemset, the algorithm execution time is significantly reduced; as in the first experiment, the time spent to generate frequent items underwent a reduction by 52% compared to the algorithm in the first experiment. In the second experiment, the amount of time spent is equal to 65%, while in the third experiment, it is equal to 46%.\",\"PeriodicalId\":23063,\"journal\":{\"name\":\"Theor. Comput. Sci.\",\"volume\":\"8 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Theor. Comput. Sci.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.7494/csci.2022.23.2.3776\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Theor. Comput. Sci.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.7494/csci.2022.23.2.3776","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

本文采用一种改进Apriori算法的智能方法来提取频繁项集。PAA(提议的Apriori算法)是双重的。首先,没有必要在每个步骤中只获取一个数据项。事实上,每一步都可以生成所有可能的项目组合。其次,我们可以只扫描一些交易,而不是扫描所有的交易,以获得频繁的项目集。为了进行性能评估,我们使用传统Apriori、BitTableFI、TDM-MFI和MDC_Apriori算法进行了三次实验。结果表明,由于获取itemset的事务扫描次数显著减少,算法的执行时间显著缩短;与第一个实验一样,与第一个实验中的算法相比,生成频繁项目的时间减少了52%。在第二个实验中,花费的时间等于65%,而在第三个实验中,花费的时间等于46%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Finding Frequent Items: A Novel Method for Improving the Apriori Algorithm
In the current paper, we use an intelligent method for improved the Apriori algorithm in order to extract frequent itemsets. PAA (proposed Apriori algorithm) is twofold. First, it is not necessary to take only one data item at each step. In fact, all possible combinations of the items could be generated at each step. Secondly, we can scan only some transactions instead of scanning all the transactions to obtain frequent itemset. For performance evaluation, we conducted three experiments with the traditional Apriori, BitTableFI, TDM-MFI, and MDC_Apriori algorithms. The results exhibit that due to the significant reduction in the number of transaction scans to obtain the itemset, the algorithm execution time is significantly reduced; as in the first experiment, the time spent to generate frequent items underwent a reduction by 52% compared to the algorithm in the first experiment. In the second experiment, the amount of time spent is equal to 65%, while in the third experiment, it is equal to 46%.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信