Zheqi Zhang, Yaling Xun, Haifeng Yang, Jianghui Cai
{"title":"Application type awareness pod-level and system-level container scheduling","authors":"Zheqi Zhang, Yaling Xun, Haifeng Yang, Jianghui Cai","doi":"10.1016/j.future.2025.107898","DOIUrl":null,"url":null,"abstract":"<div><div>Kubernetes, as a powerful tool for managing containerized applications, is considered a promising tool for supporting cloud computing platforms. The default scheduling scoring strategy only considers seeking an optimal node for the current pod and ignores the availability of subsequent nodes. Additionally, the node with the highest overall score may not necessarily be the most suitable node for the current task when the scoring process is performed. Therefore, a new Container scheduling strategies based on application type awareness at the pod and system levels (ATASL) is proposed. Firstly, in order to address the computational waste caused by the need to sequentially traverse and score all nodes in traditional node filtering methods, ATASL binds labels for Pods and nodes based on the required resources of Pods and the remaining resources of nodes, corresponding to “Compute” and “Memory”. So the subsequent scheduling of Pods is restricted to the corresponding groups of nodes only, avoiding the traversal of all nodes for scoring. Moreover, before scheduling each new task, ATASL adjusts the node roles to accommodate dynamic load changes based on the real-time resource status of the nodes. Secondly, when calculating the node score, not only the Pod-level score that matches the resource demand of the Pod is considered, but also the “system penalty score” mechanism is introduced to avoid the performance bottleneck caused by the over-utilization of a certain resource. This mechanism imposes a penalty on nodes where the utilization of a particular resource significantly exceeds the overall average utilization of the cluster, preventing resource imbalance and performance degradation (i.e., preventing overburdened nodes from being selected). Finally, a Kubernetes cluster was built using VMware to evaluate system performance. The experimental results show that ATASL can significantly improve the overall throughput and system resource utilization of the cluster, and also lead to a substantial improvement in node balance.</div></div>","PeriodicalId":55132,"journal":{"name":"Future Generation Computer Systems-The International Journal of Escience","volume":"173 ","pages":"Article 107898"},"PeriodicalIF":6.2000,"publicationDate":"2025-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Future Generation Computer Systems-The International Journal of Escience","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167739X25001931","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Kubernetes, as a powerful tool for managing containerized applications, is considered a promising tool for supporting cloud computing platforms. The default scheduling scoring strategy only considers seeking an optimal node for the current pod and ignores the availability of subsequent nodes. Additionally, the node with the highest overall score may not necessarily be the most suitable node for the current task when the scoring process is performed. Therefore, a new Container scheduling strategies based on application type awareness at the pod and system levels (ATASL) is proposed. Firstly, in order to address the computational waste caused by the need to sequentially traverse and score all nodes in traditional node filtering methods, ATASL binds labels for Pods and nodes based on the required resources of Pods and the remaining resources of nodes, corresponding to “Compute” and “Memory”. So the subsequent scheduling of Pods is restricted to the corresponding groups of nodes only, avoiding the traversal of all nodes for scoring. Moreover, before scheduling each new task, ATASL adjusts the node roles to accommodate dynamic load changes based on the real-time resource status of the nodes. Secondly, when calculating the node score, not only the Pod-level score that matches the resource demand of the Pod is considered, but also the “system penalty score” mechanism is introduced to avoid the performance bottleneck caused by the over-utilization of a certain resource. This mechanism imposes a penalty on nodes where the utilization of a particular resource significantly exceeds the overall average utilization of the cluster, preventing resource imbalance and performance degradation (i.e., preventing overburdened nodes from being selected). Finally, a Kubernetes cluster was built using VMware to evaluate system performance. The experimental results show that ATASL can significantly improve the overall throughput and system resource utilization of the cluster, and also lead to a substantial improvement in node balance.
期刊介绍:
Computing infrastructures and systems are constantly evolving, resulting in increasingly complex and collaborative scientific applications. To cope with these advancements, there is a growing need for collaborative tools that can effectively map, control, and execute these applications.
Furthermore, with the explosion of Big Data, there is a requirement for innovative methods and infrastructures to collect, analyze, and derive meaningful insights from the vast amount of data generated. This necessitates the integration of computational and storage capabilities, databases, sensors, and human collaboration.
Future Generation Computer Systems aims to pioneer advancements in distributed systems, collaborative environments, high-performance computing, and Big Data analytics. It strives to stay at the forefront of developments in grids, clouds, and the Internet of Things (IoT) to effectively address the challenges posed by these wide-area, fully distributed sensing and computing systems.