{"title":"Discovery of Action Rules at Lowest Cost in Spark","authors":"A. Tzacheva, A. Bagavathi, Lavanya Ayila","doi":"10.1109/ICDMW.2017.173","DOIUrl":null,"url":null,"abstract":"Action Rules or Actionable patterns is a type of rule-based approach in data mining that recommends to a user specific actions, in order to achieve a desired result or goal. The amount of data in the world is growing at an exponential rate, doubling almost every two years. Distributed computing platforms like Hadoop and Spark, have eased the computation of this high velocity data. Leveraging these cutting-edge technologies in the field of Data Mining to process huge volumes of data can improve the performance and allow user to gain insights from large datasets with quick turnaround time. In this paper, we present an approach for discovering low cost actionable patterns, and provide actionable recommendations. We adapt this algorithm to distributed environment using Apache Spark framework. We evaluate the performance of the algorithm with two datasets in transportation and medical domain.","PeriodicalId":389183,"journal":{"name":"2017 IEEE International Conference on Data Mining Workshops (ICDMW)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE International Conference on Data Mining Workshops (ICDMW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDMW.2017.173","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9
Abstract
Action Rules or Actionable patterns is a type of rule-based approach in data mining that recommends to a user specific actions, in order to achieve a desired result or goal. The amount of data in the world is growing at an exponential rate, doubling almost every two years. Distributed computing platforms like Hadoop and Spark, have eased the computation of this high velocity data. Leveraging these cutting-edge technologies in the field of Data Mining to process huge volumes of data can improve the performance and allow user to gain insights from large datasets with quick turnaround time. In this paper, we present an approach for discovering low cost actionable patterns, and provide actionable recommendations. We adapt this algorithm to distributed environment using Apache Spark framework. We evaluate the performance of the algorithm with two datasets in transportation and medical domain.