Accumulative Computation on MapReduce

Q4 Computer Science

Journal of Information Processing Pub Date : 2014-01-22 DOI:10.11185/IMT.9.73

Yu Liu, Kento Emoto, Kiminori Matsuzaki, Zhenjiang Hu

{"title":"Accumulative Computation on MapReduce","authors":"Yu Liu, Kento Emoto, Kiminori Matsuzaki, Zhenjiang Hu","doi":"10.11185/IMT.9.73","DOIUrl":null,"url":null,"abstract":"MapReduce programming model attracts a lot of enthusiasm among both industry and academia, largely because it simplifies the implementations of many data parallel applications. In spite of the simplicity of the program- ming model, there are many applications that are hard to be implemented by MapReduce, due to their innate characters of computational dependency. In this paper we propose a new approach of using the programming pattern accumulate over MapReduce, to handle a large class of problems that cannot be simply divided into independent sub-computations. Using this accumulate pattern, many problems that have computational dependency can be easily expressed, and then the programs will be transformed to MapReduce programs executed on large clusters. Users without much knowledge of MapReduce can also easily write programs in a sequential manner but finally obtain efficient and scalable MapRe- duce programs. We describe the programming interface of our accumulate framework and explain how to transform a user-specified accumulate computation to an efficient MapReduce program. Our experiments and evaluations illustrate the usefulness and efficiency of the framework.","PeriodicalId":16243,"journal":{"name":"Journal of Information Processing","volume":"9 1","pages":"73-82"},"PeriodicalIF":0.0000,"publicationDate":"2014-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Information Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.11185/IMT.9.73","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Computer Science","Score":null,"Total":0}

引用次数: 4

Abstract

MapReduce programming model attracts a lot of enthusiasm among both industry and academia, largely because it simplifies the implementations of many data parallel applications. In spite of the simplicity of the program- ming model, there are many applications that are hard to be implemented by MapReduce, due to their innate characters of computational dependency. In this paper we propose a new approach of using the programming pattern accumulate over MapReduce, to handle a large class of problems that cannot be simply divided into independent sub-computations. Using this accumulate pattern, many problems that have computational dependency can be easily expressed, and then the programs will be transformed to MapReduce programs executed on large clusters. Users without much knowledge of MapReduce can also easily write programs in a sequential manner but finally obtain efficient and scalable MapRe- duce programs. We describe the programming interface of our accumulate framework and explain how to transform a user-specified accumulate computation to an efficient MapReduce program. Our experiments and evaluations illustrate the usefulness and efficiency of the framework.

查看原文本刊更多论文

MapReduce的累计计算

MapReduce编程模型吸引了工业界和学术界的大量热情，主要是因为它简化了许多数据并行应用程序的实现。尽管MapReduce的编程模型很简单，但由于其固有的计算依赖性，许多应用程序很难被MapReduce实现。在本文中，我们提出了一种使用MapReduce上的编程模式累积的新方法，以处理不能简单地划分为独立子计算的大类问题。使用这种累积模式，可以很容易地表达许多具有计算依赖性的问题，然后将程序转换为在大型集群上执行的MapReduce程序。没有太多MapReduce知识的用户也可以轻松地以顺序方式编写程序，但最终获得高效且可扩展的MapReduce程序。我们描述了我们的累积框架的编程接口，并解释了如何将用户指定的累积计算转换为高效的MapReduce程序。我们的实验和评估说明了该框架的有效性和有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Information Processing Computer Science-Computer Science (all)

CiteScore

1.20

自引率

0.00%

发文量