Mahdieh Dehghani, A. Kamandi, M. Shabankhah, A. Moeini
{"title":"Toward a Distinguishing Approach for Improving the Apriori Algorithm","authors":"Mahdieh Dehghani, A. Kamandi, M. Shabankhah, A. Moeini","doi":"10.1109/ICCKE48569.2019.8965206","DOIUrl":null,"url":null,"abstract":"Association rule mining, one of the most important branches of data mining, which focused on detecting frequent patterns of itemsets. Apriori is the first algorithm proposed for association rule mining. This algorithm has the best response and can detect all frequent itemsets from transaction databases. Apriori is of time complexity order two to the power n at worst case, n is the number of items in the database. At each step, the database is scanned to detect frequent itemsets. As a result, this algorithm has a very large response time for large databases. There are two ways to reduce the response time of this algorithm. First, prune the itemsets which candidate for checking. Second, reduce the dimension of the database. We used the second solution and reduce the dimension of the database considering that if a set is frequent, all of its subsets are frequent with more frequencies in the database. In the proposed algorithm, database scanned one time, and then frequent itemsets are detected by the reduced database. Our algorithm improved an apriori response time. To evaluate the algorithm, precision and recall measures have been used. According to the experimental in most cases, the algorithm can provide precision and recall above ninety percent.","PeriodicalId":6685,"journal":{"name":"2019 9th International Conference on Computer and Knowledge Engineering (ICCKE)","volume":"48 1","pages":"309-314"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 9th International Conference on Computer and Knowledge Engineering (ICCKE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCKE48569.2019.8965206","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Association rule mining, one of the most important branches of data mining, which focused on detecting frequent patterns of itemsets. Apriori is the first algorithm proposed for association rule mining. This algorithm has the best response and can detect all frequent itemsets from transaction databases. Apriori is of time complexity order two to the power n at worst case, n is the number of items in the database. At each step, the database is scanned to detect frequent itemsets. As a result, this algorithm has a very large response time for large databases. There are two ways to reduce the response time of this algorithm. First, prune the itemsets which candidate for checking. Second, reduce the dimension of the database. We used the second solution and reduce the dimension of the database considering that if a set is frequent, all of its subsets are frequent with more frequencies in the database. In the proposed algorithm, database scanned one time, and then frequent itemsets are detected by the reduced database. Our algorithm improved an apriori response time. To evaluate the algorithm, precision and recall measures have been used. According to the experimental in most cases, the algorithm can provide precision and recall above ninety percent.