Cloud Workload Forecasting via Latency-Aware Time Series Clustering-Based Scheduling Technique

IF 1.5 4区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Concurrency and Computation-Practice & Experience Pub Date : 2025-06-19 DOI:10.1002/cpe.70151

P. Sridhar, R. R. Sathiya

{"title":"Cloud Workload Forecasting via Latency-Aware Time Series Clustering-Based Scheduling Technique","authors":"P. Sridhar, R. R. Sathiya","doi":"10.1002/cpe.70151","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Cloud computing is a fundamental paradigm for computing services based on the elasticity attribute, in which available resources are effectively adjusted for changing workloads over time. A critical challenge in such systems is the task scheduling problem, which aims to identify the optimal allocation of resources to maximize performance and minimize response times. To overcome these drawbacks, a novel latency-aware time series-based scheduling (LATS) algorithm has been proposed in this paper for predicting future server loads. The proposed method involves collecting workloads, preprocessing and clustering them, predicting time series, and post-processing the data. The workload data will be divided according to a historical time window during the preprocessing phase. Next, the time series data will be clustered based on the latency classes using the dynamic fuzzy c-means algorithm. The time series prediction phase utilizes the Gated Recurrent Unit (GRU), and post-processing is performed to retrieve the original data. An evaluation of the accuracy of future workload predictions was conducted based on actual requests to web servers, and the silhouette score was utilized as the metric for assessing cluster performance. The proposed model has been compared with previous approaches involving Crystal LP, SWDF, and GA-PSO approaches in terms of prediction accuracy by 31.9%, 18.74%, and 12.16%, respectively.</p>\n </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"37 15-17","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2025-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Concurrency and Computation-Practice & Experience","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cpe.70151","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 0

Abstract

Cloud computing is a fundamental paradigm for computing services based on the elasticity attribute, in which available resources are effectively adjusted for changing workloads over time. A critical challenge in such systems is the task scheduling problem, which aims to identify the optimal allocation of resources to maximize performance and minimize response times. To overcome these drawbacks, a novel latency-aware time series-based scheduling (LATS) algorithm has been proposed in this paper for predicting future server loads. The proposed method involves collecting workloads, preprocessing and clustering them, predicting time series, and post-processing the data. The workload data will be divided according to a historical time window during the preprocessing phase. Next, the time series data will be clustered based on the latency classes using the dynamic fuzzy c-means algorithm. The time series prediction phase utilizes the Gated Recurrent Unit (GRU), and post-processing is performed to retrieve the original data. An evaluation of the accuracy of future workload predictions was conducted based on actual requests to web servers, and the silhouette score was utilized as the metric for assessing cluster performance. The proposed model has been compared with previous approaches involving Crystal LP, SWDF, and GA-PSO approaches in terms of prediction accuracy by 31.9%, 18.74%, and 12.16%, respectively.

查看原文本刊更多论文

基于延迟感知时间序列聚类调度技术的云工作负载预测

云计算是基于弹性属性计算服务的一种基本范式，在这种范式中，可用资源可以随着时间的推移有效地调整以适应不断变化的工作负载。在这样的系统中，一个关键的挑战是任务调度问题，其目的是确定资源的最佳分配，以最大限度地提高性能和最小化响应时间。为了克服这些缺点，本文提出了一种新的基于延迟感知的时间序列调度（LATS）算法来预测未来的服务器负载。该方法包括收集工作负载，对其进行预处理和聚类，预测时间序列，以及对数据进行后处理。在预处理阶段，将根据历史时间窗口划分工作负载数据。接下来，使用动态模糊c均值算法对时间序列数据进行基于延迟类的聚类。时间序列预测阶段使用门控循环单元（GRU），并进行后处理以检索原始数据。基于对web服务器的实际请求，对未来工作负载预测的准确性进行了评估，并利用剪影分数作为评估集群性能的指标。与Crystal LP、SWDF和GA-PSO方法相比，该模型的预测准确率分别提高了31.9%、18.74%和12.16%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Concurrency and Computation-Practice & Experience 工程技术-计算机：理论方法

CiteScore

5.00

自引率

10.00%

发文量

664

审稿时长

9.6 months

期刊介绍： Concurrency and Computation: Practice and Experience (CCPE) publishes high-quality, original research papers, and authoritative research review papers, in the overlapping fields of: Parallel and distributed computing; High-performance computing; Computational and data science; Artificial intelligence and machine learning; Big data applications, algorithms, and systems; Network science; Ontologies and semantics; Security and privacy; Cloud/edge/fog computing; Green computing; and Quantum computing.