Local and Global Optimization of MapReduce Program Model

2011 IEEE World Congress on Services Pub Date : 2011-07-04 DOI:10.1109/SERVICES.2011.64

Congchong Liu, Shujia Zhou

引用次数: 4

Abstract

MapReduce, which was introduced by Google, provides two functional interfaces, Map and Reduce, for a user to write the user-specific code to process the large amount of data. It has been widely deployed in cloud computing systems. The parallel tasks, data partition, and data transit are automatically managed by its runtime system. This paper proposes a solution to optimize the MapReduce program model and demonstrate it with X10. We develop an adaptive load distribution scheme to balance the load on each node and consequently reduce across-node communication cost occurring in the Reduce function. In addition, we exploit shared-memory in each node to further reduce the communication cost with multi-core programming.

查看原文本刊更多论文

MapReduce程序模型的局部和全局优化

MapReduce是由Google推出的，它提供了Map和Reduce两个功能接口，供用户编写特定于用户的代码来处理大量数据。它已被广泛部署在云计算系统中。并行任务、数据分区和数据传输由运行时系统自动管理。本文提出了一种优化MapReduce程序模型的方案，并用X10进行了验证。我们开发了一种自适应负载分配方案来平衡每个节点上的负载，从而减少reduce函数中发生的跨节点通信开销。此外，我们利用每个节点的共享内存，进一步降低了多核编程的通信成本。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2011 IEEE World Congress on Services

自引率

0.00%

发文量