Applying Map-Reduce Paradigm for Parallel Closed Cube Computation

2009 First International Confernce on Advances in Databases, Knowledge, and Data Applications Pub Date : 2009-03-01 DOI:10.1109/DBKDA.2009.32

Kuznecov Sergey, K. Yury

引用次数: 33

Abstract

After many years of studies, efficient data cube computation remains an open field of research due to ever-growing amounts of data. One of the most efficient algorithms (quotient cubes) is based on the notion of cube cells closure, condensing groups of cells into equivalence classes, which allows to losslessly decrease amount of data to be stored. Recently developed parallel computation framework Map-Reduce lead to a new wave of interest to large-scale algorithms for data analysis (and to so called cloud-computing paradigm). This paper is devoted to applying such approaches to data and computation intensive task of OLAP-cube computation. We show that there are two scales of Map-Reduce applicability (for local multicore or multiprocessor server and multi-server clusters), present cube construction and query processing algorithms used at the both levels. Experimental results demonstrate that algorithms are scalable.

查看原文本刊更多论文

Map-Reduce范式在并行闭立方计算中的应用

经过多年的研究，由于数据量的不断增长，高效的数据立方体计算仍然是一个开放的研究领域。最有效的算法之一(商立方体)是基于立方体单元闭包的概念，将单元组压缩成等价类，这允许无损地减少要存储的数据量。最近开发的并行计算框架Map-Reduce引起了对大规模数据分析算法(以及所谓的云计算范式)的新兴趣。本文致力于将这些方法应用于OLAP-cube计算中的数据密集型和计算密集型任务。我们展示了Map-Reduce的适用性有两种尺度(适用于本地多核或多处理器服务器和多服务器集群)，并给出了在这两个级别上使用的多维数据集构建和查询处理算法。实验结果表明，该算法具有可扩展性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2009 First International Confernce on Advances in Databases, Knowledge, and Data Applications

自引率

0.00%

发文量