LBVP: A load balance algorithm based on Virtual Partition in Hadoop cluster

Yuanquan Fan, Weiguo Wu, Haijun Cao, Huo Zhu, Wei Wei, Pengfei Zheng
{"title":"LBVP: A load balance algorithm based on Virtual Partition in Hadoop cluster","authors":"Yuanquan Fan, Weiguo Wu, Haijun Cao, Huo Zhu, Wei Wei, Pengfei Zheng","doi":"10.1109/APCLOUDCC.2012.6486508","DOIUrl":null,"url":null,"abstract":"An approach based on Virtual Partition is proposed to improve the load balance in Reduce phase in MapReduce-based system in cloud computing. After each Map task finished, the output keys are partitioned to different virtual partitions according to Hash Function. And LBVP (a load balance algorithm based on continuous virtual partition) is designed to combine all virtual partitions to the same number of reduce tasks, and ensure each reduce task having balanced input data. The experimental results indicate that the load balance of the amount of Reduce function input is improved effectively and the performance is not degraded significantly by using virtual partition and load allocation algorithm.","PeriodicalId":331441,"journal":{"name":"2012 IEEE Asia Pacific Cloud Computing Congress (APCloudCC)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE Asia Pacific Cloud Computing Congress (APCloudCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APCLOUDCC.2012.6486508","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12

Abstract

An approach based on Virtual Partition is proposed to improve the load balance in Reduce phase in MapReduce-based system in cloud computing. After each Map task finished, the output keys are partitioned to different virtual partitions according to Hash Function. And LBVP (a load balance algorithm based on continuous virtual partition) is designed to combine all virtual partitions to the same number of reduce tasks, and ensure each reduce task having balanced input data. The experimental results indicate that the load balance of the amount of Reduce function input is improved effectively and the performance is not degraded significantly by using virtual partition and load allocation algorithm.
LBVP: Hadoop集群中基于虚拟分区的负载均衡算法
针对云计算中基于mapreduce的系统中Reduce阶段的负载平衡问题,提出了一种基于虚拟分区的方法。每个Map任务完成后,根据Hash Function将输出键划分到不同的虚拟分区。LBVP(一种基于连续虚拟分区的负载平衡算法)旨在将所有虚拟分区组合为相同数量的reduce任务,并确保每个reduce任务具有均衡的输入数据。实验结果表明,虚拟分区和负载分配算法有效地改善了Reduce函数输入量的负载平衡,性能没有明显下降。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信