Shuffles and Circuits (On Lower Bounds for Modern Parallel Computation)

Journal of the ACM (JACM) Pub Date : 2018-11-22 DOI:10.1145/3232536

T. Roughgarden, Sergei Vassilvitskii, Joshua R. Wang

{"title":"Shuffles and Circuits (On Lower Bounds for Modern Parallel Computation)","authors":"T. Roughgarden, Sergei Vassilvitskii, Joshua R. Wang","doi":"10.1145/3232536","DOIUrl":null,"url":null,"abstract":"The goal of this article is to identify fundamental limitations on how efficiently algorithms implemented on platforms such as MapReduce and Hadoop can compute the central problems in motivating application domains, such as graph connectivity problems. We introduce an abstract model of massively parallel computation, where essentially the only restrictions are that the “fan-in” of each machine is limited to s bits, where s is smaller than the input size n, and that computation proceeds in synchronized rounds, with no communication between different machines within a round. Lower bounds on the round complexity of a problem in this model apply to every computing platform that shares the most basic design principles of MapReduce-type systems. We prove that computations in our model that use few rounds can be represented as low-degree polynomials over the reals. This connection allows us to translate a lower bound on the (approximate) polynomial degree of a Boolean function to a lower bound on the round complexity of every (randomized) massively parallel computation of that function. These lower bounds apply even in the “unbounded width” version of our model, where the number of machines can be arbitrarily large. As one example of our general results, computing any nontrivial monotone graph property—such as connectivity—requires a super-constant number of rounds when every machine receives only a subpolynomial (in n) number of input bits s. Finally, we prove that, in two senses, our lower bounds are the best one could hope for. For the unbounded-width model, we prove a matching upper bound. Restricting to a polynomial number of machines, we show that asymptotically better lower bounds would separate P from NC1.","PeriodicalId":17199,"journal":{"name":"Journal of the ACM (JACM)","volume":"23 1","pages":"1 - 24"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"23","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the ACM (JACM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3232536","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 23

Abstract

The goal of this article is to identify fundamental limitations on how efficiently algorithms implemented on platforms such as MapReduce and Hadoop can compute the central problems in motivating application domains, such as graph connectivity problems. We introduce an abstract model of massively parallel computation, where essentially the only restrictions are that the “fan-in” of each machine is limited to s bits, where s is smaller than the input size n, and that computation proceeds in synchronized rounds, with no communication between different machines within a round. Lower bounds on the round complexity of a problem in this model apply to every computing platform that shares the most basic design principles of MapReduce-type systems. We prove that computations in our model that use few rounds can be represented as low-degree polynomials over the reals. This connection allows us to translate a lower bound on the (approximate) polynomial degree of a Boolean function to a lower bound on the round complexity of every (randomized) massively parallel computation of that function. These lower bounds apply even in the “unbounded width” version of our model, where the number of machines can be arbitrarily large. As one example of our general results, computing any nontrivial monotone graph property—such as connectivity—requires a super-constant number of rounds when every machine receives only a subpolynomial (in n) number of input bits s. Finally, we prove that, in two senses, our lower bounds are the best one could hope for. For the unbounded-width model, we prove a matching upper bound. Restricting to a polynomial number of machines, we show that asymptotically better lower bounds would separate P from NC1.

查看原文本刊更多论文

洗牌与电路(关于现代并行计算的下界)

本文的目标是确定在MapReduce和Hadoop等平台上实现的算法如何有效地计算激励应用程序领域中的核心问题(如图连接问题)的基本限制。我们引入了一个大规模并行计算的抽象模型，其本质上唯一的限制是每台机器的“扇入”被限制为s位，其中s小于输入大小n，并且计算以同步轮进行，在一轮内不同机器之间没有通信。该模型中问题的循环复杂度的下限适用于共享mapreduce类型系统最基本设计原则的每个计算平台。我们证明了在我们的模型中使用少量轮的计算可以表示为实数上的低次多项式。这种联系允许我们将布尔函数的(近似)多项式度的下界转换为该函数的每个(随机)大规模并行计算的循环复杂度的下界。这些下界甚至适用于我们模型的“无界宽度”版本，其中机器数量可以任意大。作为我们一般结果的一个例子，当每台机器只接收到一个次多项式(n)个数的输入比特s时，计算任何非平凡单调图属性(如连通性)需要一个超常数的轮数。最后，我们证明，在两种意义上，我们的下界是我们所能期望的最好的。对于无边界宽度模型，我们证明了一个匹配的上界。在多项式机器数量的限制下，我们证明了渐近更好的下界可以将P与NC1分开。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of the ACM (JACM)

自引率

0.00%

发文量