CARL: Cost-Optimized Online Container Placement on VMs Using Adversarial Reinforcement Learning

IF 5.3 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Transactions on Cloud Computing Pub Date : 2025-01-13 DOI:10.1109/TCC.2025.3528446

Prathamesh Saraf Vinayak;Saswat Subhajyoti Mallick;Lakshmi Jagarlamudi;Anirban Chakraborty;Yogesh Simmhan

{"title":"CARL: Cost-Optimized Online Container Placement on VMs Using Adversarial Reinforcement Learning","authors":"Prathamesh Saraf Vinayak;Saswat Subhajyoti Mallick;Lakshmi Jagarlamudi;Anirban Chakraborty;Yogesh Simmhan","doi":"10.1109/TCC.2025.3528446","DOIUrl":null,"url":null,"abstract":"Containerization has become popular for the deployment of applications on public clouds. Large enterprises may host 100 s of applications on 1000 s containers that are placed onto Virtual Machines (VMs). Such placement decisions happen continuously as applications are updated by DevOps pipelines that deploy the containers. Managing the placement of container resource requests onto the available capacities of VMs needs to be cost-efficient. This is well-studied, and usually modelled as a multi-dimensional Vector Bin-packing Problem (VBP). Many heuristics, and recently machine learning approaches, have been developed to solve this NP-hard problem for real-time decisions. We propose CARL, a novel approach to solve VBP through Adversarial Reinforcement Learning (RL) for cost minimization. It mimics the placement behavior of an offline semi-optimal VBP solver (teacher), while automatically learning a reward function for reducing the VM costs which out-performs the teacher. It requires limited historical container workload traces to train, and is resilient to changes in the workload distribution during inferencing. We extensively evaluate CARL on workloads derived from realistic traces from Google and Alibaba for the placement of 5 k–10 k container requests onto 2 k–8 k VMs, and compare it with classic heuristics and state-of-the-art RL methods. (1) CARL is fast, e.g., making placement decisions at <inline-formula><tex-math>$\\approx 1900$</tex-math></inline-formula> requests/sec onto 8,900 candidate VMs. (2) It is efficient, achieving <inline-formula><tex-math>$\\approx 16\\%$</tex-math></inline-formula> lower VM costs than classic and contemporary RL methods. (3) It is robust to changes in the workload, offering competitive results even when the resource needs or inter-arrival time of the container requests skew from the training workload.","PeriodicalId":13202,"journal":{"name":"IEEE Transactions on Cloud Computing","volume":"13 1","pages":"321-335"},"PeriodicalIF":5.3000,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Cloud Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10839070/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Containerization has become popular for the deployment of applications on public clouds. Large enterprises may host 100 s of applications on 1000 s containers that are placed onto Virtual Machines (VMs). Such placement decisions happen continuously as applications are updated by DevOps pipelines that deploy the containers. Managing the placement of container resource requests onto the available capacities of VMs needs to be cost-efficient. This is well-studied, and usually modelled as a multi-dimensional Vector Bin-packing Problem (VBP). Many heuristics, and recently machine learning approaches, have been developed to solve this NP-hard problem for real-time decisions. We propose CARL, a novel approach to solve VBP through Adversarial Reinforcement Learning (RL) for cost minimization. It mimics the placement behavior of an offline semi-optimal VBP solver (teacher), while automatically learning a reward function for reducing the VM costs which out-performs the teacher. It requires limited historical container workload traces to train, and is resilient to changes in the workload distribution during inferencing. We extensively evaluate CARL on workloads derived from realistic traces from Google and Alibaba for the placement of 5 k–10 k container requests onto 2 k–8 k VMs, and compare it with classic heuristics and state-of-the-art RL methods. (1) CARL is fast, e.g., making placement decisions at

$\approx 1900$

requests/sec onto 8,900 candidate VMs. (2) It is efficient, achieving

$\approx 16\%$

lower VM costs than classic and contemporary RL methods. (3) It is robust to changes in the workload, offering competitive results even when the resource needs or inter-arrival time of the container requests skew from the training workload.

查看原文本刊更多论文

卡尔：使用对抗性强化学习在虚拟机上进行成本优化的在线容器放置

容器化已经成为在公共云上部署应用程序的流行方式。大型企业可能在放置在虚拟机（vm）上的1000个容器上托管100个应用程序。当部署容器的DevOps管道更新应用程序时，这样的放置决策会不断发生。管理容器资源请求在vm可用容量上的放置需要具有成本效益。这是一个很好的研究，通常建模为多维向量装箱问题（VBP）。许多启发式方法和最近的机器学习方法已经被开发出来，以解决实时决策的np困难问题。我们提出了一种通过对抗强化学习（RL）实现成本最小化的解决VBP的新方法CARL。它模仿离线半最优VBP求解器（教师）的放置行为，同时自动学习一个奖励函数，以减少优于教师的VM成本。它需要有限的历史容器工作负载跟踪来训练，并且在推理过程中对工作负载分布的变化具有弹性。我们在谷歌和阿里巴巴的实际跟踪中广泛评估了CARL对工作负载的影响，将5 k - 10 k容器请求放置在2 k - 8 k vm上，并将其与经典的启发式方法和最先进的强化学习方法进行了比较。(1) CARL速度很快，例如，在8,900个候选vm上以大约每秒1900个请求的速度做出放置决策。(2)它是高效的，比经典和现代RL方法的VM成本低约16%。(3)它对工作量的变化具有鲁棒性，即使当资源需求或容器请求的间隔到达时间偏离训练工作量时，也能提供有竞争力的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Cloud Computing Computer Science-Software

CiteScore

9.40

自引率

6.20%

发文量

167

期刊介绍： The IEEE Transactions on Cloud Computing (TCC) is dedicated to the multidisciplinary field of cloud computing. It is committed to the publication of articles that present innovative research ideas, application results, and case studies in cloud computing, focusing on key technical issues related to theory, algorithms, systems, applications, and performance.