Load balancing guardrails: keeping your heavy traffic on the road to low response times (invited paper)

Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing Pub Date : 2021-06-15 DOI:10.1145/3406325.3465359

Isaac Grosof, Ziv Scully, Mor Harchol-Balter

{"title":"Load balancing guardrails: keeping your heavy traffic on the road to low response times (invited paper)","authors":"Isaac Grosof, Ziv Scully, Mor Harchol-Balter","doi":"10.1145/3406325.3465359","DOIUrl":null,"url":null,"abstract":"This talk is about scheduling and load balancing in a multi-server system, with the goal of minimizing mean response time in a general stochastic setting. We will specifically concentrate on the common case of a load balancing system, where a front-end load balancer (a.k.a. dispatcher) dispatches requests to multiple back-end servers, each with their own queue. Much is known about load balancing in the case where the scheduling at the servers is First-Come-First-Served (FCFS). However, to minimize mean response time, we need to use Shortest-Remaining-Processing-Time (SRPT) scheduling at the servers. Unfortunately, there is almost nothing known about optimal dispatching when SRPT scheduling is used at the servers. To make things worse, it turns out that the traditional dispatching policies that are used in practice with FCFS servers often have poor performance in systems with SRPT servers. In this talk, we devise a simple fix that can be applied to any dispatching policy. This fix, called \"guardrails\" ensures that the dispatching policy yields optimal mean response time under heavy traffic, when used in a system with SRPT servers. Any dispatching policy, when augmented with guardrails becomes heavy-traffic optimal. Our results also yield the first analytical bounds on mean response time for load balancing systems with SRPT scheduling at the servers. Load balancing and scheduling are highly studied both in the stochastic and the worst-case scheduling communities. One aim of this talk is to contrast some differences in the approaches of the two communities when tackling multi-server scheduling problems.","PeriodicalId":132752,"journal":{"name":"Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3406325.3465359","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 8

Abstract

This talk is about scheduling and load balancing in a multi-server system, with the goal of minimizing mean response time in a general stochastic setting. We will specifically concentrate on the common case of a load balancing system, where a front-end load balancer (a.k.a. dispatcher) dispatches requests to multiple back-end servers, each with their own queue. Much is known about load balancing in the case where the scheduling at the servers is First-Come-First-Served (FCFS). However, to minimize mean response time, we need to use Shortest-Remaining-Processing-Time (SRPT) scheduling at the servers. Unfortunately, there is almost nothing known about optimal dispatching when SRPT scheduling is used at the servers. To make things worse, it turns out that the traditional dispatching policies that are used in practice with FCFS servers often have poor performance in systems with SRPT servers. In this talk, we devise a simple fix that can be applied to any dispatching policy. This fix, called "guardrails" ensures that the dispatching policy yields optimal mean response time under heavy traffic, when used in a system with SRPT servers. Any dispatching policy, when augmented with guardrails becomes heavy-traffic optimal. Our results also yield the first analytical bounds on mean response time for load balancing systems with SRPT scheduling at the servers. Load balancing and scheduling are highly studied both in the stochastic and the worst-case scheduling communities. One aim of this talk is to contrast some differences in the approaches of the two communities when tackling multi-server scheduling problems.

查看原文本刊更多论文

负载平衡护栏:让繁忙的交通保持低响应时间(特邀论文)

这次演讲是关于多服务器系统中的调度和负载平衡，目标是在一般随机设置下最小化平均响应时间。我们将特别关注负载平衡系统的常见情况，其中前端负载平衡器(又称调度器)将请求分发到多个后端服务器，每个后端服务器都有自己的队列。在服务器上的调度是先到先服务(FCFS)的情况下，对于负载平衡有很多了解。然而，为了最小化平均响应时间，我们需要在服务器上使用最短剩余处理时间(SRPT)调度。不幸的是，当在服务器上使用SRPT调度时，对于最优调度几乎一无所知。更糟糕的是，在实践中使用FCFS服务器的传统调度策略在使用SRPT服务器的系统中通常性能不佳。在本次演讲中，我们将设计一个可以应用于任何调度策略的简单修复方法。此修复称为“护栏”，可确保调度策略在具有SRPT服务器的系统中使用时，在繁忙的流量下产生最佳的平均响应时间。任何调度策略，当增加了护栏时，都是繁忙交通的最佳选择。我们的结果还产生了在服务器上使用SRPT调度的负载平衡系统的平均响应时间的第一个分析界限。负载均衡和调度在随机调度和最坏调度两方面都得到了高度的研究。这次演讲的目的之一是对比两个社区在处理多服务器调度问题时方法的一些差异。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing

自引率

0.00%

发文量