A-MapCG: An Adaptive MapReduce Framework for GPUs

2017 International Conference on Networking, Architecture, and Storage (NAS) Pub Date : 2017-08-01 DOI:10.1109/NAS.2017.8026842

Lifeng Liu, Yue Zhang, Meilin Liu, Chong-Jun Wang, Jun Wang

{"title":"A-MapCG: An Adaptive MapReduce Framework for GPUs","authors":"Lifeng Liu, Yue Zhang, Meilin Liu, Chong-Jun Wang, Jun Wang","doi":"10.1109/NAS.2017.8026842","DOIUrl":null,"url":null,"abstract":"The MapReduce framework proposed by Google to process large data sets is an efficient framework used in many areas, such as social network, scientific research, electronic business, etc. Hence, many MapReduce frameworks are proposed and implemented on different platforms. However, these MapReduce frameworks have limitations, and they cannot handle the collision problem in the map phase, and the unbalanced workload problem in the reduce phase. In this paper, an Adaptive MapReduce Framework (A-MapCG) is proposed based on the MapCG framework, to further improve the MapReduce performance on GPU platforms. Based on the experiments, we observed that for certain MapReduce applications emitting multiple Key/value (K/V) pairs for the same key, the atomic collision problem degrades the map phase performance of the MapReduce framework substantially. In addition, the workload unbalance problem wastes parallel computing resources and limits the overall reduction phase performance of the MapReduce framework on GPU platforms. A-MapCG uses segmentation table and intra-warp combination to reduce the number of collisions during the map phase. A-MapCG also adopts balanced workload assignment to improve the reduce phase performance. The proposed A-MapCG framework is evaluated on the Tesla K40 GPU hosted by Intel Core i7-4790. The case study shows that the map phase of A-MapCG achieves a speedup of 4.63 over MapCG for the test case, Word Count, with a 64MB workload. The average reduce phase speedup of A-MapCG over MapCG with parallel reductions of Word Count is 6.92. The average reduce phase speedup of A-MapCG over MapCG with serial reductions of Word Count is 4.11.","PeriodicalId":222161,"journal":{"name":"2017 International Conference on Networking, Architecture, and Storage (NAS)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Networking, Architecture, and Storage (NAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NAS.2017.8026842","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

The MapReduce framework proposed by Google to process large data sets is an efficient framework used in many areas, such as social network, scientific research, electronic business, etc. Hence, many MapReduce frameworks are proposed and implemented on different platforms. However, these MapReduce frameworks have limitations, and they cannot handle the collision problem in the map phase, and the unbalanced workload problem in the reduce phase. In this paper, an Adaptive MapReduce Framework (A-MapCG) is proposed based on the MapCG framework, to further improve the MapReduce performance on GPU platforms. Based on the experiments, we observed that for certain MapReduce applications emitting multiple Key/value (K/V) pairs for the same key, the atomic collision problem degrades the map phase performance of the MapReduce framework substantially. In addition, the workload unbalance problem wastes parallel computing resources and limits the overall reduction phase performance of the MapReduce framework on GPU platforms. A-MapCG uses segmentation table and intra-warp combination to reduce the number of collisions during the map phase. A-MapCG also adopts balanced workload assignment to improve the reduce phase performance. The proposed A-MapCG framework is evaluated on the Tesla K40 GPU hosted by Intel Core i7-4790. The case study shows that the map phase of A-MapCG achieves a speedup of 4.63 over MapCG for the test case, Word Count, with a 64MB workload. The average reduce phase speedup of A-MapCG over MapCG with parallel reductions of Word Count is 6.92. The average reduce phase speedup of A-MapCG over MapCG with serial reductions of Word Count is 4.11.

查看原文本刊更多论文

A-MapCG: gpu的自适应MapReduce框架

Google提出的用于处理大数据集的MapReduce框架是一个应用于社交网络、科研、电子商务等多个领域的高效框架。因此，许多MapReduce框架被提出并在不同的平台上实现。但是，这些MapReduce框架都有局限性，无法处理map阶段的冲突问题和reduce阶段的工作负载不平衡问题。为了进一步提高MapReduce在GPU平台上的性能，本文在MapCG框架的基础上提出了一种自适应MapReduce框架(A-MapCG)。在实验的基础上，我们观察到，对于某些MapReduce应用程序发出多个Key/value (K/V)对相同的键，原子碰撞问题大大降低了MapReduce框架的映射阶段性能。此外，负载不平衡问题浪费了并行计算资源，限制了MapReduce框架在GPU平台上的整体约简阶段性能。A-MapCG使用分割表和intra-warp组合来减少映射阶段的碰撞次数。A-MapCG还采用均衡的工作负载分配来提高reduce phase的性能。在Intel酷睿i7-4790托管的Tesla K40 GPU上对所提出的A-MapCG框架进行了评估。案例研究表明，对于测试用例Word Count，在64MB的工作负载下，a -MapCG的地图阶段比MapCG实现了4.63的加速。A-MapCG的平均约简相位加速比并行约简字数的MapCG快6.92。A-MapCG的平均约简相位加速比连续约简字数的MapCG的平均约简相位加速为4.11。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 International Conference on Networking, Architecture, and Storage (NAS)

自引率

0.00%

发文量