{"title":"Distributed Rate Scaling in Large-Scale Service Systems","authors":"Daan Rutten, Martin Zubeldia, Debankur Mukherjee","doi":"10.1145/3626570.3626579","DOIUrl":null,"url":null,"abstract":"We consider a large-scale parallel-server system, where each server dynamically chooses its processing speed in a completely distributed fashion. The goal is to minimize the global cost that is the sum of the average cost of maintaining the respective processing speeds of all servers and a certain non-decreasing function of the sojourn time of tasks. The key challenges arise from the facts that the arrival rate of tasks is unknown and that there is no centralized control or communication among the servers. Using insights from stochastic approximation, we develop a novel rate-scaling algorithm and prove that the cost of the processing rates under our algorithm converges to the globally optimum value as the system size becomes large. En route, we also analyze the performance of a fully heterogeneous parallel-server system (i.e, where each server has a different processing speed), which might be of independent interest.","PeriodicalId":35745,"journal":{"name":"Performance Evaluation Review","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Performance Evaluation Review","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3626570.3626579","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 0
Abstract
We consider a large-scale parallel-server system, where each server dynamically chooses its processing speed in a completely distributed fashion. The goal is to minimize the global cost that is the sum of the average cost of maintaining the respective processing speeds of all servers and a certain non-decreasing function of the sojourn time of tasks. The key challenges arise from the facts that the arrival rate of tasks is unknown and that there is no centralized control or communication among the servers. Using insights from stochastic approximation, we develop a novel rate-scaling algorithm and prove that the cost of the processing rates under our algorithm converges to the globally optimum value as the system size becomes large. En route, we also analyze the performance of a fully heterogeneous parallel-server system (i.e, where each server has a different processing speed), which might be of independent interest.