{"title":"rTuner: A Performance Enhancement of MapReduce Job","authors":"Ripon Patgiri, Rajdeep Das","doi":"10.1145/3177457.3191710","DOIUrl":null,"url":null,"abstract":"In this paper, we present a novel task scheduling algorithm, called rTuner. The key objective of the rTuner is to enhance the reduce task execution time in heterogeneous environments. Because, the reduce task is a very expensive process. The reduce tasks comprise of three phases, unlike to the map task, namely, copy phase, shuffle phase, and reduce phase. Therefore, the rescheduling a straggler reduce task can negatively affect the performance, if the scheduling algorithms does not analyze the underlying situation. The rTuner analyzes the reduce tasks' straggling reason, and tunes the reduce task. If a reduce task becomes straggler, then rTuner reschedules it in a suitable node depending on the situation. Our benchmark result shows that enhancement of reduce tasks boosts up the CPU elapsed time significantly. Moreover, we show the efficacy of the rTuner by extensive experiment in low-cost commodity hardware. The rTuner is able to improve the total job execution time of MapReduce significantly, either a heterogeneous environment or homogeneous environment. The rTuner is capable of reducing the execution time by 86.86 seconds and 100.67 seconds on an average over the Longest Approximate Time to End (LATE) in homogeneous and heterogeneous environment respectively. In addition, the rTuner is also able to improve the execution time by 142.44 seconds and 132.52 seconds over LATE in homogeneous and heterogeneous environment at the best situation respectively.","PeriodicalId":297531,"journal":{"name":"Proceedings of the 10th International Conference on Computer Modeling and Simulation","volume":"132 2","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 10th International Conference on Computer Modeling and Simulation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3177457.3191710","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
In this paper, we present a novel task scheduling algorithm, called rTuner. The key objective of the rTuner is to enhance the reduce task execution time in heterogeneous environments. Because, the reduce task is a very expensive process. The reduce tasks comprise of three phases, unlike to the map task, namely, copy phase, shuffle phase, and reduce phase. Therefore, the rescheduling a straggler reduce task can negatively affect the performance, if the scheduling algorithms does not analyze the underlying situation. The rTuner analyzes the reduce tasks' straggling reason, and tunes the reduce task. If a reduce task becomes straggler, then rTuner reschedules it in a suitable node depending on the situation. Our benchmark result shows that enhancement of reduce tasks boosts up the CPU elapsed time significantly. Moreover, we show the efficacy of the rTuner by extensive experiment in low-cost commodity hardware. The rTuner is able to improve the total job execution time of MapReduce significantly, either a heterogeneous environment or homogeneous environment. The rTuner is capable of reducing the execution time by 86.86 seconds and 100.67 seconds on an average over the Longest Approximate Time to End (LATE) in homogeneous and heterogeneous environment respectively. In addition, the rTuner is also able to improve the execution time by 142.44 seconds and 132.52 seconds over LATE in homogeneous and heterogeneous environment at the best situation respectively.