Efficient load balancing in large-scale systems

2016 Annual Conference on Information Science and Systems (CISS) Pub Date : 2016-03-16 DOI:10.1109/CISS.2016.7460533

Debankur Mukherjee, S. Borst, J. V. Leeuwaarden, P. Whiting

{"title":"Efficient load balancing in large-scale systems","authors":"Debankur Mukherjee, S. Borst, J. V. Leeuwaarden, P. Whiting","doi":"10.1109/CISS.2016.7460533","DOIUrl":null,"url":null,"abstract":"We consider a system of N identical parallel server pools and a single dispatcher where tasks arrive as a Poisson process. Arriving tasks cannot be queued, and must immediately be assigned to one of the server pools to start execution. The execution times are assumed to be exponentially distributed, and do not depend on the number of tasks contending for service. However, the experienced performance (e.g. in terms of received throughput or packet-level delay) does degrade with an increasing number of concurrent tasks at the same server pool. In order to optimize the performance, the dispatcher therefore aims to evenly distribute the tasks across the various server pools, using either a power-of-d or a threshold-based load balancing scheme. In the power-of-d scheme, an arriving task is assigned to the server pool with the minimum number of active tasks among d(N) randomly selected server pools (1 ≤ d(N) ≤ N). In the threshold-based scheme, an incoming task is dispatched to an arbitrary server pool with fewer than L active tasks, if there is any, to an arbitrary server pool with fewer than H > L tasks otherwise, or to a randomly selected server pool if all server pools have H or more tasks. This scheme can be implemented in a server-driven manner, with O(1) communication overhead per task, as opposed to O(d(N)) in the power-of-d scheme. We derive the fluid-level dynamics for the power-of-d scheme with d(N) → ∞ as N → ∞ and the threshold-based scheme, along with the associated fixed points. As it turns out, the fluid limit for the power-of-d scheme does not depend on the exact growth rate of d(N). We also characterize the diffusion-level behavior of the power-of-d scheme with d(N) ≫ √N log(N), and show that it coincides with that of the threshold-based scheme with suitably selected parameters L and H. In particular, the threshold-based scheme can achieve similar performance as the power-of-d scheme with d(N) ≫ √N log(N), and thus diffusion-level optimality, with only O(1) rather than O(N) communication overhead per task.","PeriodicalId":346776,"journal":{"name":"2016 Annual Conference on Information Science and Systems (CISS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Annual Conference on Information Science and Systems (CISS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CISS.2016.7460533","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

We consider a system of N identical parallel server pools and a single dispatcher where tasks arrive as a Poisson process. Arriving tasks cannot be queued, and must immediately be assigned to one of the server pools to start execution. The execution times are assumed to be exponentially distributed, and do not depend on the number of tasks contending for service. However, the experienced performance (e.g. in terms of received throughput or packet-level delay) does degrade with an increasing number of concurrent tasks at the same server pool. In order to optimize the performance, the dispatcher therefore aims to evenly distribute the tasks across the various server pools, using either a power-of-d or a threshold-based load balancing scheme. In the power-of-d scheme, an arriving task is assigned to the server pool with the minimum number of active tasks among d(N) randomly selected server pools (1 ≤ d(N) ≤ N). In the threshold-based scheme, an incoming task is dispatched to an arbitrary server pool with fewer than L active tasks, if there is any, to an arbitrary server pool with fewer than H > L tasks otherwise, or to a randomly selected server pool if all server pools have H or more tasks. This scheme can be implemented in a server-driven manner, with O(1) communication overhead per task, as opposed to O(d(N)) in the power-of-d scheme. We derive the fluid-level dynamics for the power-of-d scheme with d(N) → ∞ as N → ∞ and the threshold-based scheme, along with the associated fixed points. As it turns out, the fluid limit for the power-of-d scheme does not depend on the exact growth rate of d(N). We also characterize the diffusion-level behavior of the power-of-d scheme with d(N) ≫ √N log(N), and show that it coincides with that of the threshold-based scheme with suitably selected parameters L and H. In particular, the threshold-based scheme can achieve similar performance as the power-of-d scheme with d(N) ≫ √N log(N), and thus diffusion-level optimality, with only O(1) rather than O(N) communication overhead per task.

查看原文本刊更多论文

大规模系统中的高效负载均衡

我们考虑一个由N个相同的并行服务器池和单个调度程序组成的系统，其中任务作为泊松进程到达。到达的任务不能排队，必须立即分配给其中一个服务器池以开始执行。假设执行时间呈指数分布，并且不依赖于争用服务的任务数量。然而，体验性能(例如，接收吞吐量或包级延迟)确实会随着同一服务器池中并发任务数量的增加而降低。因此，为了优化性能，调度程序的目标是使用power-of-d或基于阈值的负载平衡方案，在不同的服务器池中均匀地分配任务。power-of-d计划,到达一个任务被分配给服务器池的最小数量的活动任务中d (N)随机选择服务器池(1 d≤(N)≤N)。基于阈值方案,传入的任务分派到一个任意服务器池少于L活动任务,如果有任何,任意服务器池少于H > L的任务,否则,或一个随机选择的服务器池如果所有服务器池有H或多个任务。此方案可以以服务器驱动的方式实现，每个任务的通信开销为0(1)，而在d的幂方案中为0 (d(N))。我们导出了当d(N)→∞为N→∞时的d的幂格式和基于阈值的格式的液位动力学，以及相关的不动点。事实证明，d的幂方案的流体极限并不取决于d(N)的确切增长率。我们还描述了d(N) >√N log(N)的d次方方案的扩散级行为，并表明它与适当选择参数L和h的基于阈值的方案的扩散级行为是一致的。特别是，基于阈值的方案可以达到与d(N) >√N log(N)的d次方方案相似的性能，因此具有扩散级最优性，每个任务只有O(1)而不是O(N)的通信开销。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 Annual Conference on Information Science and Systems (CISS)

自引率

0.00%

发文量