{"title":"MR-Apriori count distribution algorithm for parallel Action Rules discovery","authors":"A. Tzacheva, Midhun M. Sunny, Pranava Mummoju","doi":"10.1109/ICKEA.2016.7803005","DOIUrl":null,"url":null,"abstract":"Data mining deals with the extraction of hidden predictive information from large databases. One of the central tasks associated with data mining is to discover profitable actions from the dataset for the decision maker. Discovering these actions can be accomplished through extracting Action Rules from the data, which has become an attractive research topic in data mining. Several methods have been developed for the discovery of Action Rules and variety of methods for association rules in the past few years. However, with the explosive recent growth of the amounts of data, there is a need for the development of scalable methods for Action Rules discovery to accommodate the massive datasets. We are not aware of any such methods existing at this time. In this paper, we propose a novel approach - a parallel Action Rule discovery algorithm based on MapReduce paradigm through count distribution. We use Hadoop, as a scalable and distributed framework for implementing this method. Experiment shows much faster computational time for Action Rules discovery in a distributed environment compared to the traditional single machine method.","PeriodicalId":241850,"journal":{"name":"2016 IEEE International Conference on Knowledge Engineering and Applications (ICKEA)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE International Conference on Knowledge Engineering and Applications (ICKEA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICKEA.2016.7803005","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Data mining deals with the extraction of hidden predictive information from large databases. One of the central tasks associated with data mining is to discover profitable actions from the dataset for the decision maker. Discovering these actions can be accomplished through extracting Action Rules from the data, which has become an attractive research topic in data mining. Several methods have been developed for the discovery of Action Rules and variety of methods for association rules in the past few years. However, with the explosive recent growth of the amounts of data, there is a need for the development of scalable methods for Action Rules discovery to accommodate the massive datasets. We are not aware of any such methods existing at this time. In this paper, we propose a novel approach - a parallel Action Rule discovery algorithm based on MapReduce paradigm through count distribution. We use Hadoop, as a scalable and distributed framework for implementing this method. Experiment shows much faster computational time for Action Rules discovery in a distributed environment compared to the traditional single machine method.