查找频繁项的Apriori算法的扩展

Noorollah Karimtabar, Mohammad Javad Shayegan Fard
{"title":"查找频繁项的Apriori算法的扩展","authors":"Noorollah Karimtabar, Mohammad Javad Shayegan Fard","doi":"10.1109/ICWR49608.2020.9122282","DOIUrl":null,"url":null,"abstract":"The main purpose of data mining is to discover hidden and valuable knowledge from data. The Apriori algorithm is inefficient due to bulky deals of searching in a dataset. Bearing this in mind, this paper proposes an improved algorithm from Apriori using an intelligent method. Proposing an intelligent method in this study is to fulfill two purposes: First, we demonstrated that to create itemsets, instead of adding one item at each step, several items could be added. With this operation, the number of k-itemset steps will decline. Secondly, we have proved that by storing the transaction number of each itemset, there would be a diminishment in the time required for the dataset searches to find the frequent k-itemset in each step. To evaluate the performance, the Intelligent Apriori (lAP) algorithm has been compared with the MDC algorithm. The results of this experiment exhibit that since the transaction scans used to obtain the itemset momentously reduced in number, there was a considerable fall in the runtime needed to obtain a frequent itemset by the proposed algorithm. In this study, the time required to generate frequent items had a 46% reduction compared to that of the MDC_ Apriori algorithm.","PeriodicalId":231982,"journal":{"name":"2020 6th International Conference on Web Research (ICWR)","volume":"93 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"An Extension of the Apriori Algorithm for Finding Frequent Items\",\"authors\":\"Noorollah Karimtabar, Mohammad Javad Shayegan Fard\",\"doi\":\"10.1109/ICWR49608.2020.9122282\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The main purpose of data mining is to discover hidden and valuable knowledge from data. The Apriori algorithm is inefficient due to bulky deals of searching in a dataset. Bearing this in mind, this paper proposes an improved algorithm from Apriori using an intelligent method. Proposing an intelligent method in this study is to fulfill two purposes: First, we demonstrated that to create itemsets, instead of adding one item at each step, several items could be added. With this operation, the number of k-itemset steps will decline. Secondly, we have proved that by storing the transaction number of each itemset, there would be a diminishment in the time required for the dataset searches to find the frequent k-itemset in each step. To evaluate the performance, the Intelligent Apriori (lAP) algorithm has been compared with the MDC algorithm. The results of this experiment exhibit that since the transaction scans used to obtain the itemset momentously reduced in number, there was a considerable fall in the runtime needed to obtain a frequent itemset by the proposed algorithm. In this study, the time required to generate frequent items had a 46% reduction compared to that of the MDC_ Apriori algorithm.\",\"PeriodicalId\":231982,\"journal\":{\"name\":\"2020 6th International Conference on Web Research (ICWR)\",\"volume\":\"93 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 6th International Conference on Web Research (ICWR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICWR49608.2020.9122282\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 6th International Conference on Web Research (ICWR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICWR49608.2020.9122282","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

数据挖掘的主要目的是从数据中发现隐藏的和有价值的知识。由于在数据集中进行大量搜索,Apriori算法效率低下。考虑到这一点,本文提出了一种基于Apriori的智能改进算法。在本研究中提出一种智能方法是为了实现两个目的:首先,我们演示了创建项集时,可以添加多个项,而不是每一步添加一个项。通过这种操作,k项集的步骤数将会减少。其次,我们已经证明,通过存储每个项目集的事务号,在每一步中查找频繁的k-项目集所需的数据集搜索时间将会减少。为了评估性能,将智能Apriori (lAP)算法与MDC算法进行了比较。实验结果表明,由于用于获取项目集的事务扫描的数量大大减少,因此所提出的算法获得频繁项目集所需的运行时间大大减少。在本研究中,与MDC_ Apriori算法相比,生成频繁项所需的时间减少了46%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
An Extension of the Apriori Algorithm for Finding Frequent Items
The main purpose of data mining is to discover hidden and valuable knowledge from data. The Apriori algorithm is inefficient due to bulky deals of searching in a dataset. Bearing this in mind, this paper proposes an improved algorithm from Apriori using an intelligent method. Proposing an intelligent method in this study is to fulfill two purposes: First, we demonstrated that to create itemsets, instead of adding one item at each step, several items could be added. With this operation, the number of k-itemset steps will decline. Secondly, we have proved that by storing the transaction number of each itemset, there would be a diminishment in the time required for the dataset searches to find the frequent k-itemset in each step. To evaluate the performance, the Intelligent Apriori (lAP) algorithm has been compared with the MDC algorithm. The results of this experiment exhibit that since the transaction scans used to obtain the itemset momentously reduced in number, there was a considerable fall in the runtime needed to obtain a frequent itemset by the proposed algorithm. In this study, the time required to generate frequent items had a 46% reduction compared to that of the MDC_ Apriori algorithm.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信