具有软资源隔离的容器编排平台的服务感知自动伸缩策略

公共管理高层论坛 Pub Date : 2023-06-06 DOI:10.1109/EuCNC/6GSummit58263.2023.10188268

F. Tonini, C. Natalino, D. Temesgene, Z. Ghebretensae, L. Wosinska, P. Monti

{"title":"具有软资源隔离的容器编排平台的服务感知自动伸缩策略","authors":"F. Tonini, C. Natalino, D. Temesgene, Z. Ghebretensae, L. Wosinska, P. Monti","doi":"10.1109/EuCNC/6GSummit58263.2023.10188268","DOIUrl":null,"url":null,"abstract":"Container orchestration platforms like Kubernetes (K8s) allow easy deployment and management of cloud native services. When deploying their services, service providers need to specify a proper amount of resources to K8s, so that the desired Quality of Service (QoS) to their users can be maintained. To cope with the varying traffic demand coming from users, they can rely on the K8s Horizontal Pod Autoscaling (HPA) mechanism. To ensure that enough resources are available when needed, the standard HPA mechanism relies on resource overprovisioning. In this way, the required QoS is achieved most of (or all) the time but at the expense of additional resources that are allocated (and charged for), while they may stay idle for significant periods of time. A way to reduce overprovisioning is provided by the soft resource isolation of K8s, which allows services to compensate for a temporary lack of resources with shared resources, i.e., idle resources of the machines where services are running. However, during traffic spikes, these idle resources may not be enough to serve the whole demand, degrading the QoS. The HPA, which is not aware of how much demand could not be served, is not always able to correctly estimate the required additional resources, further degrading the QoS. To overcome this, service providers need to leverage overprovisioning, limiting the use of shared resources. In this paper, we propose a novel mechanism for autoscaling resources in K8s that relies on service-related data to avoid the additional degradation introduced by the HPA. The proposed strategy also offers a way to tune overprovisioning and shared resources. Simulation results show that our approach can reduce idle resources by up to 60% compared with the HPA mechanism.","PeriodicalId":65870,"journal":{"name":"公共管理高层论坛","volume":"17 1","pages":"454-459"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Service-Aware Autoscaling Strategy for Container Orchestration Platforms with Soft Resource Isolation\",\"authors\":\"F. Tonini, C. Natalino, D. Temesgene, Z. Ghebretensae, L. Wosinska, P. Monti\",\"doi\":\"10.1109/EuCNC/6GSummit58263.2023.10188268\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Container orchestration platforms like Kubernetes (K8s) allow easy deployment and management of cloud native services. When deploying their services, service providers need to specify a proper amount of resources to K8s, so that the desired Quality of Service (QoS) to their users can be maintained. To cope with the varying traffic demand coming from users, they can rely on the K8s Horizontal Pod Autoscaling (HPA) mechanism. To ensure that enough resources are available when needed, the standard HPA mechanism relies on resource overprovisioning. In this way, the required QoS is achieved most of (or all) the time but at the expense of additional resources that are allocated (and charged for), while they may stay idle for significant periods of time. A way to reduce overprovisioning is provided by the soft resource isolation of K8s, which allows services to compensate for a temporary lack of resources with shared resources, i.e., idle resources of the machines where services are running. However, during traffic spikes, these idle resources may not be enough to serve the whole demand, degrading the QoS. The HPA, which is not aware of how much demand could not be served, is not always able to correctly estimate the required additional resources, further degrading the QoS. To overcome this, service providers need to leverage overprovisioning, limiting the use of shared resources. In this paper, we propose a novel mechanism for autoscaling resources in K8s that relies on service-related data to avoid the additional degradation introduced by the HPA. The proposed strategy also offers a way to tune overprovisioning and shared resources. Simulation results show that our approach can reduce idle resources by up to 60% compared with the HPA mechanism.\",\"PeriodicalId\":65870,\"journal\":{\"name\":\"公共管理高层论坛\",\"volume\":\"17 1\",\"pages\":\"454-459\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"公共管理高层论坛\",\"FirstCategoryId\":\"96\",\"ListUrlMain\":\"https://doi.org/10.1109/EuCNC/6GSummit58263.2023.10188268\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"公共管理高层论坛","FirstCategoryId":"96","ListUrlMain":"https://doi.org/10.1109/EuCNC/6GSummit58263.2023.10188268","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

像Kubernetes (k8)这样的容器编排平台允许轻松部署和管理云原生服务。在部署服务时，服务提供商需要为k8指定适当数量的资源，以便为其用户维持所需的服务质量(QoS)。为了应对来自用户的不同流量需求，他们可以依靠K8s水平Pod自动缩放(HPA)机制。为了确保在需要时有足够的资源可用，标准HPA机制依赖于资源过度供应。通过这种方式，所需的QoS在大部分时间(或全部时间)都得到了实现，但代价是要分配(并为此收费)额外的资源，而这些资源可能在相当长的一段时间内处于空闲状态。k8的软资源隔离提供了一种减少过度配置的方法，它允许服务使用共享资源(即运行服务的机器的空闲资源)来补偿资源的暂时缺乏。但是，在流量高峰期间，这些空闲资源可能不足以满足全部需求，从而降低QoS。HPA不知道有多少需求不能被满足，因此并不总是能够正确地估计所需的额外资源，从而进一步降低了QoS。为了克服这个问题，服务提供者需要利用过度供应，限制共享资源的使用。在本文中，我们提出了一种新的基于服务相关数据的k8资源自动缩放机制，以避免HPA带来的额外退化。所建议的策略还提供了一种调优过度配置和共享资源的方法。仿真结果表明，与HPA机制相比，我们的方法可以减少多达60%的空闲资源。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Service-Aware Autoscaling Strategy for Container Orchestration Platforms with Soft Resource Isolation

Container orchestration platforms like Kubernetes (K8s) allow easy deployment and management of cloud native services. When deploying their services, service providers need to specify a proper amount of resources to K8s, so that the desired Quality of Service (QoS) to their users can be maintained. To cope with the varying traffic demand coming from users, they can rely on the K8s Horizontal Pod Autoscaling (HPA) mechanism. To ensure that enough resources are available when needed, the standard HPA mechanism relies on resource overprovisioning. In this way, the required QoS is achieved most of (or all) the time but at the expense of additional resources that are allocated (and charged for), while they may stay idle for significant periods of time. A way to reduce overprovisioning is provided by the soft resource isolation of K8s, which allows services to compensate for a temporary lack of resources with shared resources, i.e., idle resources of the machines where services are running. However, during traffic spikes, these idle resources may not be enough to serve the whole demand, degrading the QoS. The HPA, which is not aware of how much demand could not be served, is not always able to correctly estimate the required additional resources, further degrading the QoS. To overcome this, service providers need to leverage overprovisioning, limiting the use of shared resources. In this paper, we propose a novel mechanism for autoscaling resources in K8s that relies on service-related data to avoid the additional degradation introduced by the HPA. The proposed strategy also offers a way to tune overprovisioning and shared resources. Simulation results show that our approach can reduce idle resources by up to 60% compared with the HPA mechanism.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

公共管理高层论坛

自引率

0.00%

发文量

385