{"title":"Randomized load balancing with a helper","authors":"Chunpu Wang, Chen Feng, Julian Cheng","doi":"10.1109/ICCNC.2017.7876182","DOIUrl":null,"url":null,"abstract":"In a cloud environment, a scheduler assigns arriving tasks to one of many servers, with the goal of minimizing response times. There are two conventional approaches to cloud scheduling. The first is called the Join-the-Shortest-Queue (JSQ) algorithm, which directs an arriving task to the least loaded server. Despite its excellent delay performance, JSQ is throughput-limited, and thus doesn't scale well with the number of servers. The second is called the Power-of-d-choices (Pod) algorithm, which selects d servers at random and routes a task to the least loaded server of the d servers. Despite its scalability, Pod suffers from long tail response times. In this paper, a hybrid scheduling strategy is proposed, and it consists of a Pod scheduler and a throughput-limited helper. Hybrid scheduling takes the best of both worlds, enjoying scalability and low tail response times. In particular, hybrid scheduling has bounded maximum queue size in the large-system regime, which is in sharp contrast to the Pod scheduling whose maximum queue size is unbounded.","PeriodicalId":135028,"journal":{"name":"2017 International Conference on Computing, Networking and Communications (ICNC)","volume":"76 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Computing, Networking and Communications (ICNC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCNC.2017.7876182","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
In a cloud environment, a scheduler assigns arriving tasks to one of many servers, with the goal of minimizing response times. There are two conventional approaches to cloud scheduling. The first is called the Join-the-Shortest-Queue (JSQ) algorithm, which directs an arriving task to the least loaded server. Despite its excellent delay performance, JSQ is throughput-limited, and thus doesn't scale well with the number of servers. The second is called the Power-of-d-choices (Pod) algorithm, which selects d servers at random and routes a task to the least loaded server of the d servers. Despite its scalability, Pod suffers from long tail response times. In this paper, a hybrid scheduling strategy is proposed, and it consists of a Pod scheduler and a throughput-limited helper. Hybrid scheduling takes the best of both worlds, enjoying scalability and low tail response times. In particular, hybrid scheduling has bounded maximum queue size in the large-system regime, which is in sharp contrast to the Pod scheduling whose maximum queue size is unbounded.
在云环境中,调度器将到达的任务分配给众多服务器中的一个,其目标是最小化响应时间。有两种传统的云调度方法。第一种称为最短队列连接(join - The - short - queue, JSQ)算法,它将到达的任务定向到负载最少的服务器。尽管具有出色的延迟性能,但JSQ的吞吐量有限,因此不能很好地随服务器数量进行扩展。第二种算法称为Power-of-d-choices (Pod)算法,它随机选择d个服务器,并将任务路由到d个服务器中负载最少的服务器。尽管具有可扩展性,但Pod的响应时间长。本文提出了一种混合调度策略,该策略由Pod调度程序和吞吐量有限的助手组成。混合调度充分利用了两者的优点,享有可伸缩性和低尾响应时间。特别是在大系统环境下,混合调度的最大队列大小是有界的,这与Pod调度的最大队列大小是无界的形成了鲜明对比。