Improving scalability, energy efficiency, and cost-effectiveness in Kubernetes clusters using a nonlinear regression-based predictive replica model and ORLE algorithm
IF 4.3 3区 计算机科学Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
{"title":"Improving scalability, energy efficiency, and cost-effectiveness in Kubernetes clusters using a nonlinear regression-based predictive replica model and ORLE algorithm","authors":"Indrani Vasireddy, Rajeev Wankar, Raghavendra Rao Chillarige","doi":"10.1016/j.eij.2025.100732","DOIUrl":null,"url":null,"abstract":"<div><div>Container-based deployments have transformed how modern applications are packaged, deployed, and scaled. They bring increasing benefits in terms of development agility, testing, and collaboration. Kubernetes is a famous container orchestration engine that manages the life cycle of containerized applications by automatically scaling the containers and load balancing among them. In this paper, we present an application-level leader election method known as ORLE(Optimal Replica Leader Election). After conducting a detailed performance analysis of ORLE, we also developed a nonlinear regression Predictive Replica Model that predicts the throughput in real time, which helps in the early identification of conditions that require replicas’ scaling up or down. We integrated the proposed ORLE with this nonlinear regression Predictive Replica Model to improve the performance of Kubernetes clusters. Our model autoscales replicas concerning real-time traffic measurements to maximize resource utilization and system scalability. ORLE selects the most suitable replica as the leader to optimize the load distribution and reduce the overall latency of the request. Our experimental results show that our proposed solution outperforms the original leader election mechanism (Default RAFT) of Kubernetes and an existing state-of-the-art algorithm Balanced Leader Distribution (BLD) by up to 25% throughput improvement, 40% latency reduction, optimization of energy consumed, and cost efficiency per working replica under real-time conditions. The proposed model is more beneficial for stateful applications and microservices architectures since consistent performance and fast leader election are crucial in ensuring the Kubernetes cluster reliability and performance and are highly suitable for Kubernetes clusters deployed in a cloud computing environment, which demands high scalability, low latency, and efficient resource management.</div></div>","PeriodicalId":56010,"journal":{"name":"Egyptian Informatics Journal","volume":"31 ","pages":"Article 100732"},"PeriodicalIF":4.3000,"publicationDate":"2025-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Egyptian Informatics Journal","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1110866525001252","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Container-based deployments have transformed how modern applications are packaged, deployed, and scaled. They bring increasing benefits in terms of development agility, testing, and collaboration. Kubernetes is a famous container orchestration engine that manages the life cycle of containerized applications by automatically scaling the containers and load balancing among them. In this paper, we present an application-level leader election method known as ORLE(Optimal Replica Leader Election). After conducting a detailed performance analysis of ORLE, we also developed a nonlinear regression Predictive Replica Model that predicts the throughput in real time, which helps in the early identification of conditions that require replicas’ scaling up or down. We integrated the proposed ORLE with this nonlinear regression Predictive Replica Model to improve the performance of Kubernetes clusters. Our model autoscales replicas concerning real-time traffic measurements to maximize resource utilization and system scalability. ORLE selects the most suitable replica as the leader to optimize the load distribution and reduce the overall latency of the request. Our experimental results show that our proposed solution outperforms the original leader election mechanism (Default RAFT) of Kubernetes and an existing state-of-the-art algorithm Balanced Leader Distribution (BLD) by up to 25% throughput improvement, 40% latency reduction, optimization of energy consumed, and cost efficiency per working replica under real-time conditions. The proposed model is more beneficial for stateful applications and microservices architectures since consistent performance and fast leader election are crucial in ensuring the Kubernetes cluster reliability and performance and are highly suitable for Kubernetes clusters deployed in a cloud computing environment, which demands high scalability, low latency, and efficient resource management.
期刊介绍:
The Egyptian Informatics Journal is published by the Faculty of Computers and Artificial Intelligence, Cairo University. This Journal provides a forum for the state-of-the-art research and development in the fields of computing, including computer sciences, information technologies, information systems, operations research and decision support. Innovative and not-previously-published work in subjects covered by the Journal is encouraged to be submitted, whether from academic, research or commercial sources.