Application type awareness pod-level and system-level container scheduling

IF 6.2 2区计算机科学 Q1 COMPUTER SCIENCE, THEORY & METHODS

Future Generation Computer Systems-The International Journal of Escience Pub Date : 2025-05-16 DOI:10.1016/j.future.2025.107898

Zheqi Zhang, Yaling Xun, Haifeng Yang, Jianghui Cai

{"title":"Application type awareness pod-level and system-level container scheduling","authors":"Zheqi Zhang, Yaling Xun, Haifeng Yang, Jianghui Cai","doi":"10.1016/j.future.2025.107898","DOIUrl":null,"url":null,"abstract":"<div><div>Kubernetes, as a powerful tool for managing containerized applications, is considered a promising tool for supporting cloud computing platforms. The default scheduling scoring strategy only considers seeking an optimal node for the current pod and ignores the availability of subsequent nodes. Additionally, the node with the highest overall score may not necessarily be the most suitable node for the current task when the scoring process is performed. Therefore, a new Container scheduling strategies based on application type awareness at the pod and system levels (ATASL) is proposed. Firstly, in order to address the computational waste caused by the need to sequentially traverse and score all nodes in traditional node filtering methods, ATASL binds labels for Pods and nodes based on the required resources of Pods and the remaining resources of nodes, corresponding to “Compute” and “Memory”. So the subsequent scheduling of Pods is restricted to the corresponding groups of nodes only, avoiding the traversal of all nodes for scoring. Moreover, before scheduling each new task, ATASL adjusts the node roles to accommodate dynamic load changes based on the real-time resource status of the nodes. Secondly, when calculating the node score, not only the Pod-level score that matches the resource demand of the Pod is considered, but also the “system penalty score” mechanism is introduced to avoid the performance bottleneck caused by the over-utilization of a certain resource. This mechanism imposes a penalty on nodes where the utilization of a particular resource significantly exceeds the overall average utilization of the cluster, preventing resource imbalance and performance degradation (i.e., preventing overburdened nodes from being selected). Finally, a Kubernetes cluster was built using VMware to evaluate system performance. The experimental results show that ATASL can significantly improve the overall throughput and system resource utilization of the cluster, and also lead to a substantial improvement in node balance.</div></div>","PeriodicalId":55132,"journal":{"name":"Future Generation Computer Systems-The International Journal of Escience","volume":"173 ","pages":"Article 107898"},"PeriodicalIF":6.2000,"publicationDate":"2025-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Future Generation Computer Systems-The International Journal of Escience","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167739X25001931","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}

引用次数: 0

Abstract

Kubernetes, as a powerful tool for managing containerized applications, is considered a promising tool for supporting cloud computing platforms. The default scheduling scoring strategy only considers seeking an optimal node for the current pod and ignores the availability of subsequent nodes. Additionally, the node with the highest overall score may not necessarily be the most suitable node for the current task when the scoring process is performed. Therefore, a new Container scheduling strategies based on application type awareness at the pod and system levels (ATASL) is proposed. Firstly, in order to address the computational waste caused by the need to sequentially traverse and score all nodes in traditional node filtering methods, ATASL binds labels for Pods and nodes based on the required resources of Pods and the remaining resources of nodes, corresponding to “Compute” and “Memory”. So the subsequent scheduling of Pods is restricted to the corresponding groups of nodes only, avoiding the traversal of all nodes for scoring. Moreover, before scheduling each new task, ATASL adjusts the node roles to accommodate dynamic load changes based on the real-time resource status of the nodes. Secondly, when calculating the node score, not only the Pod-level score that matches the resource demand of the Pod is considered, but also the “system penalty score” mechanism is introduced to avoid the performance bottleneck caused by the over-utilization of a certain resource. This mechanism imposes a penalty on nodes where the utilization of a particular resource significantly exceeds the overall average utilization of the cluster, preventing resource imbalance and performance degradation (i.e., preventing overburdened nodes from being selected). Finally, a Kubernetes cluster was built using VMware to evaluate system performance. The experimental results show that ATASL can significantly improve the overall throughput and system resource utilization of the cluster, and also lead to a substantial improvement in node balance.

查看原文本刊更多论文

应用程序类型感知pod级和系统级容器调度

Kubernetes作为管理容器化应用程序的强大工具，被认为是支持云计算平台的一个很有前途的工具。默认的调度评分策略只考虑为当前pod寻找最优节点，而忽略后续节点的可用性。此外，在执行评分过程时，总分最高的节点不一定是最适合当前任务的节点。为此，提出了一种基于pod和系统级应用类型感知的容器调度策略（ATASL）。首先，为了解决传统节点过滤方法中需要对所有节点进行顺序遍历和评分造成的计算浪费，ATASL根据Pods所需资源和节点剩余资源为Pods和节点绑定标签，分别对应“Compute”和“Memory”。因此，Pods的后续调度仅限于相应的节点组，避免了遍历所有节点进行评分。在调度新任务之前，ATASL会根据节点的实时资源状态，调整节点角色，以适应负载的动态变化。其次，在计算节点得分时，不仅考虑与Pod的资源需求相匹配的Pod级得分，还引入了“系统罚分”机制，避免了由于某一资源的过度使用而导致的性能瓶颈。当某个特定资源的利用率明显超过集群的总体平均利用率时，这种机制会对节点施加惩罚，防止资源不平衡和性能下降（即，防止选择负担过重的节点）。最后，使用VMware构建了一个Kubernetes集群来评估系统性能。实验结果表明，ATASL可以显著提高集群的整体吞吐量和系统资源利用率，并显著改善节点平衡。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Future Generation Computer Systems-The International Journal of Escience 工程技术-计算机：理论方法

CiteScore

19.90

自引率

2.70%

发文量

376

审稿时长

10.6 months

期刊介绍： Computing infrastructures and systems are constantly evolving, resulting in increasingly complex and collaborative scientific applications. To cope with these advancements, there is a growing need for collaborative tools that can effectively map, control, and execute these applications. Furthermore, with the explosion of Big Data, there is a requirement for innovative methods and infrastructures to collect, analyze, and derive meaningful insights from the vast amount of data generated. This necessitates the integration of computational and storage capabilities, databases, sensors, and human collaboration. Future Generation Computer Systems aims to pioneer advancements in distributed systems, collaborative environments, high-performance computing, and Big Data analytics. It strives to stay at the forefront of developments in grids, clouds, and the Internet of Things (IoT) to effectively address the challenges posed by these wide-area, fully distributed sensing and computing systems.