{"title":"LBVP: A load balance algorithm based on Virtual Partition in Hadoop cluster","authors":"Yuanquan Fan, Weiguo Wu, Haijun Cao, Huo Zhu, Wei Wei, Pengfei Zheng","doi":"10.1109/APCLOUDCC.2012.6486508","DOIUrl":null,"url":null,"abstract":"An approach based on Virtual Partition is proposed to improve the load balance in Reduce phase in MapReduce-based system in cloud computing. After each Map task finished, the output keys are partitioned to different virtual partitions according to Hash Function. And LBVP (a load balance algorithm based on continuous virtual partition) is designed to combine all virtual partitions to the same number of reduce tasks, and ensure each reduce task having balanced input data. The experimental results indicate that the load balance of the amount of Reduce function input is improved effectively and the performance is not degraded significantly by using virtual partition and load allocation algorithm.","PeriodicalId":331441,"journal":{"name":"2012 IEEE Asia Pacific Cloud Computing Congress (APCloudCC)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE Asia Pacific Cloud Computing Congress (APCloudCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APCLOUDCC.2012.6486508","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12
Abstract
An approach based on Virtual Partition is proposed to improve the load balance in Reduce phase in MapReduce-based system in cloud computing. After each Map task finished, the output keys are partitioned to different virtual partitions according to Hash Function. And LBVP (a load balance algorithm based on continuous virtual partition) is designed to combine all virtual partitions to the same number of reduce tasks, and ensure each reduce task having balanced input data. The experimental results indicate that the load balance of the amount of Reduce function input is improved effectively and the performance is not degraded significantly by using virtual partition and load allocation algorithm.