{"title":"基于延迟感知时间序列聚类调度技术的云工作负载预测","authors":"P. Sridhar, R. R. Sathiya","doi":"10.1002/cpe.70151","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Cloud computing is a fundamental paradigm for computing services based on the elasticity attribute, in which available resources are effectively adjusted for changing workloads over time. A critical challenge in such systems is the task scheduling problem, which aims to identify the optimal allocation of resources to maximize performance and minimize response times. To overcome these drawbacks, a novel latency-aware time series-based scheduling (LATS) algorithm has been proposed in this paper for predicting future server loads. The proposed method involves collecting workloads, preprocessing and clustering them, predicting time series, and post-processing the data. The workload data will be divided according to a historical time window during the preprocessing phase. Next, the time series data will be clustered based on the latency classes using the dynamic fuzzy c-means algorithm. The time series prediction phase utilizes the Gated Recurrent Unit (GRU), and post-processing is performed to retrieve the original data. An evaluation of the accuracy of future workload predictions was conducted based on actual requests to web servers, and the silhouette score was utilized as the metric for assessing cluster performance. The proposed model has been compared with previous approaches involving Crystal LP, SWDF, and GA-PSO approaches in terms of prediction accuracy by 31.9%, 18.74%, and 12.16%, respectively.</p>\n </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"37 15-17","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2025-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Cloud Workload Forecasting via Latency-Aware Time Series Clustering-Based Scheduling Technique\",\"authors\":\"P. Sridhar, R. R. Sathiya\",\"doi\":\"10.1002/cpe.70151\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>Cloud computing is a fundamental paradigm for computing services based on the elasticity attribute, in which available resources are effectively adjusted for changing workloads over time. A critical challenge in such systems is the task scheduling problem, which aims to identify the optimal allocation of resources to maximize performance and minimize response times. To overcome these drawbacks, a novel latency-aware time series-based scheduling (LATS) algorithm has been proposed in this paper for predicting future server loads. The proposed method involves collecting workloads, preprocessing and clustering them, predicting time series, and post-processing the data. The workload data will be divided according to a historical time window during the preprocessing phase. Next, the time series data will be clustered based on the latency classes using the dynamic fuzzy c-means algorithm. The time series prediction phase utilizes the Gated Recurrent Unit (GRU), and post-processing is performed to retrieve the original data. An evaluation of the accuracy of future workload predictions was conducted based on actual requests to web servers, and the silhouette score was utilized as the metric for assessing cluster performance. The proposed model has been compared with previous approaches involving Crystal LP, SWDF, and GA-PSO approaches in terms of prediction accuracy by 31.9%, 18.74%, and 12.16%, respectively.</p>\\n </div>\",\"PeriodicalId\":55214,\"journal\":{\"name\":\"Concurrency and Computation-Practice & Experience\",\"volume\":\"37 15-17\",\"pages\":\"\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2025-06-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Concurrency and Computation-Practice & Experience\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/cpe.70151\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Concurrency and Computation-Practice & Experience","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cpe.70151","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
Cloud Workload Forecasting via Latency-Aware Time Series Clustering-Based Scheduling Technique
Cloud computing is a fundamental paradigm for computing services based on the elasticity attribute, in which available resources are effectively adjusted for changing workloads over time. A critical challenge in such systems is the task scheduling problem, which aims to identify the optimal allocation of resources to maximize performance and minimize response times. To overcome these drawbacks, a novel latency-aware time series-based scheduling (LATS) algorithm has been proposed in this paper for predicting future server loads. The proposed method involves collecting workloads, preprocessing and clustering them, predicting time series, and post-processing the data. The workload data will be divided according to a historical time window during the preprocessing phase. Next, the time series data will be clustered based on the latency classes using the dynamic fuzzy c-means algorithm. The time series prediction phase utilizes the Gated Recurrent Unit (GRU), and post-processing is performed to retrieve the original data. An evaluation of the accuracy of future workload predictions was conducted based on actual requests to web servers, and the silhouette score was utilized as the metric for assessing cluster performance. The proposed model has been compared with previous approaches involving Crystal LP, SWDF, and GA-PSO approaches in terms of prediction accuracy by 31.9%, 18.74%, and 12.16%, respectively.
期刊介绍:
Concurrency and Computation: Practice and Experience (CCPE) publishes high-quality, original research papers, and authoritative research review papers, in the overlapping fields of:
Parallel and distributed computing;
High-performance computing;
Computational and data science;
Artificial intelligence and machine learning;
Big data applications, algorithms, and systems;
Network science;
Ontologies and semantics;
Security and privacy;
Cloud/edge/fog computing;
Green computing; and
Quantum computing.