Distributed Resource Autoscaling in Kubernetes Edge Clusters

2022 18th International Conference on Network and Service Management (CNSM) Pub Date : 2022-10-31 DOI:10.23919/CNSM55787.2022.9965056

Dimitrios Spatharakis, Ioannis Dimolitsas, E. Vlahakis, Dimitrios Dechouniotis, N. Athanasopoulos, S. Papavassiliou

{"title":"Distributed Resource Autoscaling in Kubernetes Edge Clusters","authors":"Dimitrios Spatharakis, Ioannis Dimolitsas, E. Vlahakis, Dimitrios Dechouniotis, N. Athanasopoulos, S. Papavassiliou","doi":"10.23919/CNSM55787.2022.9965056","DOIUrl":null,"url":null,"abstract":"Maximizing the performance of modern applications requires timely resource management of the virtualized resources. However, proactively deploying resources for meeting specific application requirements subject to a dynamic workload profile of incoming requests is extremely challenging. To this end, the fundamental problems of task scheduling and resource autoscaling must be jointly addressed. This paper presents a scalable architecture compatible with the decentralized nature of Kubernetes [1], to solve both. Exploiting the stability guarantees of a novel AIMD-like task scheduling solution, we dynamically redirect the incoming requests towards the containerized application. To cope with dynamic workloads, a prediction mechanism allows us to estimate the number of incoming requests. Additionally, a Machine Learning-based (ML) Application Profiling Modeling is introduced to address the scaling, by co-designing the theoretically-computed service rates obtained from the AIMD algorithm with the current performance metrics. The proposed solution is compared with the state-of-the-art autoscaling techniques under a realistic dataset in a small edge infrastructure and the trade-off between resource utilization and QoS violations are analyzed. Our solution provides better resource utilization by reducing CPU cores by 8% with only an acceptable increase in QoS violations.","PeriodicalId":232521,"journal":{"name":"2022 18th International Conference on Network and Service Management (CNSM)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 18th International Conference on Network and Service Management (CNSM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/CNSM55787.2022.9965056","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Maximizing the performance of modern applications requires timely resource management of the virtualized resources. However, proactively deploying resources for meeting specific application requirements subject to a dynamic workload profile of incoming requests is extremely challenging. To this end, the fundamental problems of task scheduling and resource autoscaling must be jointly addressed. This paper presents a scalable architecture compatible with the decentralized nature of Kubernetes [1], to solve both. Exploiting the stability guarantees of a novel AIMD-like task scheduling solution, we dynamically redirect the incoming requests towards the containerized application. To cope with dynamic workloads, a prediction mechanism allows us to estimate the number of incoming requests. Additionally, a Machine Learning-based (ML) Application Profiling Modeling is introduced to address the scaling, by co-designing the theoretically-computed service rates obtained from the AIMD algorithm with the current performance metrics. The proposed solution is compared with the state-of-the-art autoscaling techniques under a realistic dataset in a small edge infrastructure and the trade-off between resource utilization and QoS violations are analyzed. Our solution provides better resource utilization by reducing CPU cores by 8% with only an acceptable increase in QoS violations.

查看原文本刊更多论文

Kubernetes边缘集群中的分布式资源自动缩放

为了使现代应用程序的性能最大化，需要对虚拟化资源进行及时的资源管理。然而，根据传入请求的动态工作负载概要，主动部署资源以满足特定的应用程序需求是极具挑战性的。为此，任务调度和资源自动伸缩的基本问题必须共同解决。本文提出了一种与Kubernetes的去中心化特性兼容的可扩展架构[1]，以解决这两个问题。利用一种新颖的类似aimd的任务调度解决方案的稳定性保证，我们动态地将传入的请求重定向到容器化的应用程序。为了应对动态工作负载，预测机制允许我们估计传入请求的数量。此外，引入了一种基于机器学习的应用程序分析建模，通过将AIMD算法获得的理论计算的服务率与当前性能指标共同设计，来解决可伸缩性问题。在小边缘基础设施的实际数据集下，将所提出的解决方案与最先进的自缩放技术进行了比较，并分析了资源利用率和QoS违规之间的权衡。我们的解决方案通过减少8%的CPU内核提供了更好的资源利用率，而QoS违规的增加只是可以接受的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 18th International Conference on Network and Service Management (CNSM)

自引率

0.00%

发文量