A S-Threshold Method to Perform Horizontal Auto-Scaling in a Cloud Computing Environment

IF 1.5 4区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Concurrency and Computation-Practice & Experience Pub Date : 2025-05-26 DOI:10.1002/cpe.70135

Archana Archana, Narander Kumar

{"title":"A S-Threshold Method to Perform Horizontal Auto-Scaling in a Cloud Computing Environment","authors":"Archana Archana, Narander Kumar","doi":"10.1002/cpe.70135","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Cloud computing provides numerous benefits to individuals and any organization, making it a vital technology today. Users can communicate with the cloud computing systems and access the resources through the network. The need for resources can be increased, or at some time, it can be minimized according to the necessity of the resources. However, dynamic scaling is the pivotal property of cloud computing systems to dynamically devote the resources and manage the load while maintaining the quality of service requirement. This paper aims to provide a horizontal auto-scaling method that makes the system fault-tolerant, meets processing demand, optimizes the overall performance, meets user satisfaction, and ensures the quality of service requirement. The proposed S-Threshold method combines the State–action–reward–state–action (SARSA) method with the threshold model. It overcomes the drawbacks of the threshold method, automatically allocates the resources by adding or removing the machine according to the requisite basis, and ensures the aforementioned objectives. In this paper, the system initially relies on threshold-based rules for quick decisions. Over time, SARSA learns from experience and improves decisions, making scaling more adaptive and optimized. The proposed method juxtaposed with some existing methods and evaluated the five performance metrics to state the effectiveness of the proposed method. CloudSimPlus simulator simulates the proposed method into three significant categories: with various numbers of cloudlets, processing speeds, and VMs. While considering the increasing number of cloudlets, increasing processing speed, and increase in VMs, it noted that evaluated results of the proposed method give 50%, 80%, 85%, 60%, and 62% better results with respect to makespan, response time, waiting duration, average turnaround time, and throughput respectively.</p>\n </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"37 12-14","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2025-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Concurrency and Computation-Practice & Experience","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cpe.70135","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 0

Abstract

Cloud computing provides numerous benefits to individuals and any organization, making it a vital technology today. Users can communicate with the cloud computing systems and access the resources through the network. The need for resources can be increased, or at some time, it can be minimized according to the necessity of the resources. However, dynamic scaling is the pivotal property of cloud computing systems to dynamically devote the resources and manage the load while maintaining the quality of service requirement. This paper aims to provide a horizontal auto-scaling method that makes the system fault-tolerant, meets processing demand, optimizes the overall performance, meets user satisfaction, and ensures the quality of service requirement. The proposed S-Threshold method combines the State–action–reward–state–action (SARSA) method with the threshold model. It overcomes the drawbacks of the threshold method, automatically allocates the resources by adding or removing the machine according to the requisite basis, and ensures the aforementioned objectives. In this paper, the system initially relies on threshold-based rules for quick decisions. Over time, SARSA learns from experience and improves decisions, making scaling more adaptive and optimized. The proposed method juxtaposed with some existing methods and evaluated the five performance metrics to state the effectiveness of the proposed method. CloudSimPlus simulator simulates the proposed method into three significant categories: with various numbers of cloudlets, processing speeds, and VMs. While considering the increasing number of cloudlets, increasing processing speed, and increase in VMs, it noted that evaluated results of the proposed method give 50%, 80%, 85%, 60%, and 62% better results with respect to makespan, response time, waiting duration, average turnaround time, and throughput respectively.

查看原文本刊更多论文

云计算环境下水平自动伸缩的s -阈值方法

云计算为个人和任何组织提供了许多好处，使其成为当今重要的技术。用户可以通过网络与云计算系统通信，访问云计算资源。对资源的需求可以增加，或者在某些时候，可以根据资源的必要性将其最小化。然而，动态扩展是云计算系统在保持服务质量要求的同时动态投入资源和管理负载的关键属性。本文旨在提供一种水平自扩展方法，使系统具有容错性，满足处理需求，优化整体性能，满足用户满意度，保证服务质量需求。提出的S-Threshold方法将状态-动作-奖励-状态-动作（SARSA）方法与阈值模型相结合。它克服了阈值法的缺点，根据必要的基数，通过增减机器自动分配资源，保证了上述目标的实现。在本文中，系统最初依赖于基于阈值的规则来进行快速决策。随着时间的推移，SARSA从经验中学习并改进决策，使扩展更具适应性和优化性。将所提出的方法与现有的一些方法并置，并对五个性能指标进行评估，以说明所提出方法的有效性。CloudSimPlus模拟器将提出的方法模拟为三个重要类别：具有不同数量的cloudlets、处理速度和vm。考虑到cloudlets数量的增加、处理速度的提高和vm的增加，该方法的评估结果在makespan、响应时间、等待时间、平均周转时间和吞吐量方面分别提高了50%、80%、85%、60%和62%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Concurrency and Computation-Practice & Experience 工程技术-计算机：理论方法

CiteScore

5.00

自引率

10.00%

发文量

664

审稿时长

9.6 months

期刊介绍： Concurrency and Computation: Practice and Experience (CCPE) publishes high-quality, original research papers, and authoritative research review papers, in the overlapping fields of: Parallel and distributed computing; High-performance computing; Computational and data science; Artificial intelligence and machine learning; Big data applications, algorithms, and systems; Network science; Ontologies and semantics; Security and privacy; Cloud/edge/fog computing; Green computing; and Quantum computing.