{"title":"大规模服务系统中的分布式速率缩放","authors":"Daan Rutten, Martin Zubeldia, Debankur Mukherjee","doi":"10.1145/3626570.3626579","DOIUrl":null,"url":null,"abstract":"We consider a large-scale parallel-server system, where each server dynamically chooses its processing speed in a completely distributed fashion. The goal is to minimize the global cost that is the sum of the average cost of maintaining the respective processing speeds of all servers and a certain non-decreasing function of the sojourn time of tasks. The key challenges arise from the facts that the arrival rate of tasks is unknown and that there is no centralized control or communication among the servers. Using insights from stochastic approximation, we develop a novel rate-scaling algorithm and prove that the cost of the processing rates under our algorithm converges to the globally optimum value as the system size becomes large. En route, we also analyze the performance of a fully heterogeneous parallel-server system (i.e, where each server has a different processing speed), which might be of independent interest.","PeriodicalId":35745,"journal":{"name":"Performance Evaluation Review","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Distributed Rate Scaling in Large-Scale Service Systems\",\"authors\":\"Daan Rutten, Martin Zubeldia, Debankur Mukherjee\",\"doi\":\"10.1145/3626570.3626579\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We consider a large-scale parallel-server system, where each server dynamically chooses its processing speed in a completely distributed fashion. The goal is to minimize the global cost that is the sum of the average cost of maintaining the respective processing speeds of all servers and a certain non-decreasing function of the sojourn time of tasks. The key challenges arise from the facts that the arrival rate of tasks is unknown and that there is no centralized control or communication among the servers. Using insights from stochastic approximation, we develop a novel rate-scaling algorithm and prove that the cost of the processing rates under our algorithm converges to the globally optimum value as the system size becomes large. En route, we also analyze the performance of a fully heterogeneous parallel-server system (i.e, where each server has a different processing speed), which might be of independent interest.\",\"PeriodicalId\":35745,\"journal\":{\"name\":\"Performance Evaluation Review\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-09-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Performance Evaluation Review\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3626570.3626579\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Performance Evaluation Review","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3626570.3626579","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Computer Science","Score":null,"Total":0}
Distributed Rate Scaling in Large-Scale Service Systems
We consider a large-scale parallel-server system, where each server dynamically chooses its processing speed in a completely distributed fashion. The goal is to minimize the global cost that is the sum of the average cost of maintaining the respective processing speeds of all servers and a certain non-decreasing function of the sojourn time of tasks. The key challenges arise from the facts that the arrival rate of tasks is unknown and that there is no centralized control or communication among the servers. Using insights from stochastic approximation, we develop a novel rate-scaling algorithm and prove that the cost of the processing rates under our algorithm converges to the globally optimum value as the system size becomes large. En route, we also analyze the performance of a fully heterogeneous parallel-server system (i.e, where each server has a different processing speed), which might be of independent interest.