Machine Learning Based Workload Prediction in Cloud Computing

2020 29th International Conference on Computer Communications and Networks (ICCCN) Pub Date : 2020-08-01 DOI:10.1109/ICCCN49398.2020.9209730

Jiechao Gao, Haoyu Wang, Haiying Shen

{"title":"Machine Learning Based Workload Prediction in Cloud Computing","authors":"Jiechao Gao, Haoyu Wang, Haiying Shen","doi":"10.1109/ICCCN49398.2020.9209730","DOIUrl":null,"url":null,"abstract":"As a widely used IT service, more and more companies shift their services to cloud datacenters. It is important for cloud service providers (CSPs) to provide cloud service resources with high elasticity and cost-effectiveness and then achieve good quality of service (QoS) for their clients. However, meeting QoS with cost-effective resource is a challenging problem for CSPs because the workloads of Virtual Machines (VMs) experience variation over time. It is highly necessary to provide an accurate VMs workload prediction method for resource provisioning to efficiently manage cloud resources. In this paper, we first compare the performance of representative state-of-the-art workload prediction methods. We suggest a method to conduct the prediction a certain time before the predicted time point in order to allow sufficient time for task scheduling based on predicted workload. To further improve the prediction accuracy, we introduce a clustering based workload prediction method, which first clusters all the tasks into several categories and then trains a prediction model for each category respectively. The trace-driven experiments based on Google cluster trace demonstrates that our clustering based workload prediction methods outperform other comparison methods and improve the prediction accuracy to around 90% both in CPU and memory.","PeriodicalId":137835,"journal":{"name":"2020 29th International Conference on Computer Communications and Networks (ICCCN)","volume":"44 12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"136","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 29th International Conference on Computer Communications and Networks (ICCCN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCN49398.2020.9209730","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 136

Abstract

As a widely used IT service, more and more companies shift their services to cloud datacenters. It is important for cloud service providers (CSPs) to provide cloud service resources with high elasticity and cost-effectiveness and then achieve good quality of service (QoS) for their clients. However, meeting QoS with cost-effective resource is a challenging problem for CSPs because the workloads of Virtual Machines (VMs) experience variation over time. It is highly necessary to provide an accurate VMs workload prediction method for resource provisioning to efficiently manage cloud resources. In this paper, we first compare the performance of representative state-of-the-art workload prediction methods. We suggest a method to conduct the prediction a certain time before the predicted time point in order to allow sufficient time for task scheduling based on predicted workload. To further improve the prediction accuracy, we introduce a clustering based workload prediction method, which first clusters all the tasks into several categories and then trains a prediction model for each category respectively. The trace-driven experiments based on Google cluster trace demonstrates that our clustering based workload prediction methods outperform other comparison methods and improve the prediction accuracy to around 90% both in CPU and memory.

查看原文本刊更多论文

云计算中基于机器学习的工作负荷预测

云数据中心作为一种应用广泛的IT服务，越来越多的企业将其业务转移到云数据中心。云服务提供商(csp)提供高弹性和高成本效益的云服务资源，从而为其客户实现良好的服务质量(QoS)是非常重要的。然而，由于虚拟机(vm)的工作负载会随着时间的推移而变化，因此对csp来说，用经济有效的资源满足QoS是一个具有挑战性的问题。为了高效地管理云资源，提供准确的虚拟机工作负载预测方法是非常必要的。在本文中，我们首先比较了具有代表性的最先进的工作负荷预测方法的性能。我们建议在预测时间点之前的某个时间点进行预测，以便有足够的时间根据预测的工作量进行任务调度。为了进一步提高预测精度，我们引入了一种基于聚类的工作负荷预测方法，该方法首先将所有任务聚类成几个类别，然后分别为每个类别训练预测模型。基于Google聚类跟踪的跟踪驱动实验表明，我们的基于聚类的工作负载预测方法优于其他比较方法，在CPU和内存方面的预测准确率都提高到90%左右。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 29th International Conference on Computer Communications and Networks (ICCCN)

自引率

0.00%

发文量