查找频繁项:一种改进Apriori算法的新方法

Theor. Comput. Sci. Pub Date : 2022-07-06 DOI:10.7494/csci.2022.23.2.3776

Noorollah Karimtabar, Mohammad Javad Shayegan Fard

{"title":"查找频繁项:一种改进Apriori算法的新方法","authors":"Noorollah Karimtabar, Mohammad Javad Shayegan Fard","doi":"10.7494/csci.2022.23.2.3776","DOIUrl":null,"url":null,"abstract":"In the current paper, we use an intelligent method for improved the Apriori algorithm in order to extract frequent itemsets. PAA (proposed Apriori algorithm) is twofold. First, it is not necessary to take only one data item at each step. In fact, all possible combinations of the items could be generated at each step. Secondly, we can scan only some transactions instead of scanning all the transactions to obtain frequent itemset. For performance evaluation, we conducted three experiments with the traditional Apriori, BitTableFI, TDM-MFI, and MDC_Apriori algorithms. The results exhibit that due to the significant reduction in the number of transaction scans to obtain the itemset, the algorithm execution time is significantly reduced; as in the first experiment, the time spent to generate frequent items underwent a reduction by 52% compared to the algorithm in the first experiment. In the second experiment, the amount of time spent is equal to 65%, while in the third experiment, it is equal to 46%.","PeriodicalId":23063,"journal":{"name":"Theor. Comput. Sci.","volume":"8 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Finding Frequent Items: A Novel Method for Improving the Apriori Algorithm\",\"authors\":\"Noorollah Karimtabar, Mohammad Javad Shayegan Fard\",\"doi\":\"10.7494/csci.2022.23.2.3776\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the current paper, we use an intelligent method for improved the Apriori algorithm in order to extract frequent itemsets. PAA (proposed Apriori algorithm) is twofold. First, it is not necessary to take only one data item at each step. In fact, all possible combinations of the items could be generated at each step. Secondly, we can scan only some transactions instead of scanning all the transactions to obtain frequent itemset. For performance evaluation, we conducted three experiments with the traditional Apriori, BitTableFI, TDM-MFI, and MDC_Apriori algorithms. The results exhibit that due to the significant reduction in the number of transaction scans to obtain the itemset, the algorithm execution time is significantly reduced; as in the first experiment, the time spent to generate frequent items underwent a reduction by 52% compared to the algorithm in the first experiment. In the second experiment, the amount of time spent is equal to 65%, while in the third experiment, it is equal to 46%.\",\"PeriodicalId\":23063,\"journal\":{\"name\":\"Theor. Comput. Sci.\",\"volume\":\"8 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Theor. Comput. Sci.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.7494/csci.2022.23.2.3776\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Theor. Comput. Sci.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.7494/csci.2022.23.2.3776","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

本文采用一种改进Apriori算法的智能方法来提取频繁项集。PAA(提议的Apriori算法)是双重的。首先，没有必要在每个步骤中只获取一个数据项。事实上，每一步都可以生成所有可能的项目组合。其次，我们可以只扫描一些交易，而不是扫描所有的交易，以获得频繁的项目集。为了进行性能评估，我们使用传统Apriori、BitTableFI、TDM-MFI和MDC_Apriori算法进行了三次实验。结果表明，由于获取itemset的事务扫描次数显著减少，算法的执行时间显著缩短;与第一个实验一样，与第一个实验中的算法相比，生成频繁项目的时间减少了52%。在第二个实验中，花费的时间等于65%，而在第三个实验中，花费的时间等于46%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Finding Frequent Items: A Novel Method for Improving the Apriori Algorithm

In the current paper, we use an intelligent method for improved the Apriori algorithm in order to extract frequent itemsets. PAA (proposed Apriori algorithm) is twofold. First, it is not necessary to take only one data item at each step. In fact, all possible combinations of the items could be generated at each step. Secondly, we can scan only some transactions instead of scanning all the transactions to obtain frequent itemset. For performance evaluation, we conducted three experiments with the traditional Apriori, BitTableFI, TDM-MFI, and MDC_Apriori algorithms. The results exhibit that due to the significant reduction in the number of transaction scans to obtain the itemset, the algorithm execution time is significantly reduced; as in the first experiment, the time spent to generate frequent items underwent a reduction by 52% compared to the algorithm in the first experiment. In the second experiment, the amount of time spent is equal to 65%, while in the third experiment, it is equal to 46%.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Theor. Comput. Sci.

自引率

0.00%

发文量