rTuner: A Performance Enhancement of MapReduce Job

Ripon Patgiri, Rajdeep Das
{"title":"rTuner: A Performance Enhancement of MapReduce Job","authors":"Ripon Patgiri, Rajdeep Das","doi":"10.1145/3177457.3191710","DOIUrl":null,"url":null,"abstract":"In this paper, we present a novel task scheduling algorithm, called rTuner. The key objective of the rTuner is to enhance the reduce task execution time in heterogeneous environments. Because, the reduce task is a very expensive process. The reduce tasks comprise of three phases, unlike to the map task, namely, copy phase, shuffle phase, and reduce phase. Therefore, the rescheduling a straggler reduce task can negatively affect the performance, if the scheduling algorithms does not analyze the underlying situation. The rTuner analyzes the reduce tasks' straggling reason, and tunes the reduce task. If a reduce task becomes straggler, then rTuner reschedules it in a suitable node depending on the situation. Our benchmark result shows that enhancement of reduce tasks boosts up the CPU elapsed time significantly. Moreover, we show the efficacy of the rTuner by extensive experiment in low-cost commodity hardware. The rTuner is able to improve the total job execution time of MapReduce significantly, either a heterogeneous environment or homogeneous environment. The rTuner is capable of reducing the execution time by 86.86 seconds and 100.67 seconds on an average over the Longest Approximate Time to End (LATE) in homogeneous and heterogeneous environment respectively. In addition, the rTuner is also able to improve the execution time by 142.44 seconds and 132.52 seconds over LATE in homogeneous and heterogeneous environment at the best situation respectively.","PeriodicalId":297531,"journal":{"name":"Proceedings of the 10th International Conference on Computer Modeling and Simulation","volume":"132 2","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 10th International Conference on Computer Modeling and Simulation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3177457.3191710","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

In this paper, we present a novel task scheduling algorithm, called rTuner. The key objective of the rTuner is to enhance the reduce task execution time in heterogeneous environments. Because, the reduce task is a very expensive process. The reduce tasks comprise of three phases, unlike to the map task, namely, copy phase, shuffle phase, and reduce phase. Therefore, the rescheduling a straggler reduce task can negatively affect the performance, if the scheduling algorithms does not analyze the underlying situation. The rTuner analyzes the reduce tasks' straggling reason, and tunes the reduce task. If a reduce task becomes straggler, then rTuner reschedules it in a suitable node depending on the situation. Our benchmark result shows that enhancement of reduce tasks boosts up the CPU elapsed time significantly. Moreover, we show the efficacy of the rTuner by extensive experiment in low-cost commodity hardware. The rTuner is able to improve the total job execution time of MapReduce significantly, either a heterogeneous environment or homogeneous environment. The rTuner is capable of reducing the execution time by 86.86 seconds and 100.67 seconds on an average over the Longest Approximate Time to End (LATE) in homogeneous and heterogeneous environment respectively. In addition, the rTuner is also able to improve the execution time by 142.44 seconds and 132.52 seconds over LATE in homogeneous and heterogeneous environment at the best situation respectively.
rTuner: MapReduce作业的性能提升
本文提出了一种新的任务调度算法rTuner。rTuner的主要目标是提高异构环境中任务的执行时间。因为reduce任务是一个非常昂贵的过程。与map任务不同,reduce任务包括三个阶段,即复制阶段、洗牌阶段和reduce阶段。因此,如果调度算法没有对底层情况进行分析,那么重新调度一个straggler reduce任务可能会对性能产生负面影响。rTuner分析reduce任务的散列原因,并对reduce任务进行调优。如果一个reduce任务变得掉队,那么rTuner将根据情况在合适的节点中重新调度它。我们的基准测试结果表明,reduce任务的增强显著提高了CPU运行时间。此外,我们通过在低成本商品硬件上的大量实验证明了rTuner的有效性。rTuner能够显著提高MapReduce的总作业执行时间,无论是在异构环境还是同构环境中。在同质和异构环境中,rTuner能够在最长近似结束时间(LATE)上分别平均减少86.86秒和100.67秒的执行时间。此外,在最佳情况下,rTuner还能够在同构和异构环境中分别比LATE提高142.44秒和132.52秒的执行时间。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信