Balanced Allocations in Batches: Simplified and Generalized

Proceedings of the 34th ACM Symposium on Parallelism in Algorithms and Architectures Pub Date : 2022-03-25 DOI:10.1145/3490148.3538593

Dimitrios Los, Thomas Sauerwald

{"title":"Balanced Allocations in Batches: Simplified and Generalized","authors":"Dimitrios Los, Thomas Sauerwald","doi":"10.1145/3490148.3538593","DOIUrl":null,"url":null,"abstract":"We consider the allocation of m balls (jobs) into n bins (servers). In the Two-Choice process, for each of m sequentially arriving balls, two randomly chosen bins are sampled and the ball is placed in the least loaded bin. It is well-known that the maximum load is m/n + log2 logn + O(1) with high probability. Berenbrink, Czumaj, Englert, Friedetzky and Nagel [7] introduced a parallel version of this process, where m balls arrive in consecutive batches of size b = n each. Balls within the same batch are allocated in parallel, using the load information of the bins at the beginning of the batch. They proved that the gap of this process is O(logn) with high probability. In this work, we present a new analysis of this setting, which is based on exponential potential functions. This allows us to both simplify and generalize the analysis of [7] in different ways: (1) Our analysis covers a broad class of processes. This includes not only Two-Choice, but also processes with fewer bin samples like the (1 + β)-process, processes which can only receive one bit of information from each bin sample and graphical allocation, where bins correspond to vertices in a graph. (2) Balls may be of different weights, as long as their weights are independent samples from a distribution satisfying a technical condition on its moment generating function. (3) For any batch sizes b ≥ n, we prove a gap of is O (b/n·logn). For any b ∈ [n, n3], we improve this to is O (b/n + logn) and show that it is tight for a family of processes. This implies the unexpected result that for e.g. the (1 + β)-process with constant β ∈ (0, 1], the gap is Θ(logn) for all b ∈ [n, n logn]. We also conduct experiments which support our theoretical results, and even hint at a superiority of less powerful processes like (1+ β) for large batch sizes. Full version of the paper at: https://arxiv.org/abs/2203.13902.","PeriodicalId":112865,"journal":{"name":"Proceedings of the 34th ACM Symposium on Parallelism in Algorithms and Architectures","volume":"88 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 34th ACM Symposium on Parallelism in Algorithms and Architectures","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3490148.3538593","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

Abstract

We consider the allocation of m balls (jobs) into n bins (servers). In the Two-Choice process, for each of m sequentially arriving balls, two randomly chosen bins are sampled and the ball is placed in the least loaded bin. It is well-known that the maximum load is m/n + log2 logn + O(1) with high probability. Berenbrink, Czumaj, Englert, Friedetzky and Nagel [7] introduced a parallel version of this process, where m balls arrive in consecutive batches of size b = n each. Balls within the same batch are allocated in parallel, using the load information of the bins at the beginning of the batch. They proved that the gap of this process is O(logn) with high probability. In this work, we present a new analysis of this setting, which is based on exponential potential functions. This allows us to both simplify and generalize the analysis of [7] in different ways: (1) Our analysis covers a broad class of processes. This includes not only Two-Choice, but also processes with fewer bin samples like the (1 + β)-process, processes which can only receive one bit of information from each bin sample and graphical allocation, where bins correspond to vertices in a graph. (2) Balls may be of different weights, as long as their weights are independent samples from a distribution satisfying a technical condition on its moment generating function. (3) For any batch sizes b ≥ n, we prove a gap of is O (b/n·logn). For any b ∈ [n, n3], we improve this to is O (b/n + logn) and show that it is tight for a family of processes. This implies the unexpected result that for e.g. the (1 + β)-process with constant β ∈ (0, 1], the gap is Θ(logn) for all b ∈ [n, n logn]. We also conduct experiments which support our theoretical results, and even hint at a superiority of less powerful processes like (1+ β) for large batch sizes. Full version of the paper at: https://arxiv.org/abs/2203.13902.

查看原文本刊更多论文

分批均衡分配:简化和一般化

我们考虑将m个球(作业)分配到n个箱(服务器)中。在two - choice过程中，对于m个依次到达的球，随机选择两个箱子进行采样，并将球放置在装载最少的箱子中。众所周知，最大负载是m/n + log2 logn + O(1)，且概率很大。Berenbrink, Czumaj, Englert, Friedetzky和Nagel引入了这个过程的并行版本，其中m个球以每个大小为b = n的连续批次到达。在同一批次中的球被并行分配，使用在批次开始时的桶的负载信息。他们高概率地证明了这一过程的间隙为O(logn)。在这项工作中，我们提出了一种基于指数势函数的新分析方法。这使我们能够以不同的方式简化和概括[7]的分析:(1)我们的分析涵盖了广泛的过程类别。这不仅包括Two-Choice，还包括较少的bin样本的处理，如(1 + β)-过程，只能从每个bin样本和图形分配中接收一位信息的过程，其中bin对应于图中的顶点。(2)球可以有不同的权值，只要它们的权值是一个分布中满足技术条件的独立样本。(3)对于任意批大小b≥n，我们证明了缺口为O (b/n·logn)。对于任意b∈[n, n3]，我们将其改进为O (b/n + logn)，并证明它对于一系列过程是紧的。这意味着意想不到的结果，例如，对于常数β∈(0,1)的(1 + β)-过程，对于所有b∈[n, n logn]，间隙为Θ(logn)。我们还进行了实验来支持我们的理论结果，甚至暗示了(1+ β)等不太强大的工艺在大批量生产中的优越性。全文见:https://arxiv.org/abs/2203.13902。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 34th ACM Symposium on Parallelism in Algorithms and Architectures

自引率

0.00%

发文量