Ensemble Learning for Predicting Task Connectivity Over Time in Cloud Data Centers

Mustafa Daraghmeh, A. Agarwal, Y. Jararweh
{"title":"Ensemble Learning for Predicting Task Connectivity Over Time in Cloud Data Centers","authors":"Mustafa Daraghmeh, A. Agarwal, Y. Jararweh","doi":"10.1109/INFOCOMWKSHPS57453.2023.10225964","DOIUrl":null,"url":null,"abstract":"The rapid growth and interdependencies of cloud-hosted services and applications have made it imperative to optimize data center resource management while maintaining low operating costs. Inefficient resource use, rising energy consumption, and operating costs impact cloud provider's ability to provide high-quality services on an elastic basis. However, changing the selection and decision-making processes based on how the tasks are set up can improve scheduling and resource management in cloud data centers. In this paper, we develop a multivariate time series prediction model for task connectivity based on windowing characterization and ensemble learning methods. The high cardinality features are handled using a counter encoder, and the task trace data is transformed using a sliding window, from which features are extracted and used in conjunction with the task profile data to train and tune the candidate estimators. The best model outcomes are then used to construct an ensembled estimator. As part of the evaluation, a baseline comparison is performed in order to determine how well ensemble learning predicts task connectivity over time. The model outcomes are assisted using standard classification metrics such as accuracy, precision, and recall, including the F1 score, Kappa, and Matthews correlation coefficient. The results show that the proposed model outperformed the traditional models in most performance metrics, indicating the successful implementation of an ensemble learning approach for task connectivity predictions in large-scale cloud data centers.","PeriodicalId":354290,"journal":{"name":"IEEE INFOCOM 2023 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS)","volume":"80 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE INFOCOM 2023 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INFOCOMWKSHPS57453.2023.10225964","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The rapid growth and interdependencies of cloud-hosted services and applications have made it imperative to optimize data center resource management while maintaining low operating costs. Inefficient resource use, rising energy consumption, and operating costs impact cloud provider's ability to provide high-quality services on an elastic basis. However, changing the selection and decision-making processes based on how the tasks are set up can improve scheduling and resource management in cloud data centers. In this paper, we develop a multivariate time series prediction model for task connectivity based on windowing characterization and ensemble learning methods. The high cardinality features are handled using a counter encoder, and the task trace data is transformed using a sliding window, from which features are extracted and used in conjunction with the task profile data to train and tune the candidate estimators. The best model outcomes are then used to construct an ensembled estimator. As part of the evaluation, a baseline comparison is performed in order to determine how well ensemble learning predicts task connectivity over time. The model outcomes are assisted using standard classification metrics such as accuracy, precision, and recall, including the F1 score, Kappa, and Matthews correlation coefficient. The results show that the proposed model outperformed the traditional models in most performance metrics, indicating the successful implementation of an ensemble learning approach for task connectivity predictions in large-scale cloud data centers.
集成学习用于预测云数据中心中任务随时间的连通性
云托管服务和应用程序的快速增长和相互依赖性使得优化数据中心资源管理成为当务之急,同时保持较低的运营成本。低效的资源使用、不断上升的能源消耗和运营成本影响了云提供商在弹性基础上提供高质量服务的能力。然而,根据任务的设置方式改变选择和决策过程可以改善云数据中心的调度和资源管理。在本文中,我们建立了一个基于窗口表征和集成学习方法的任务连接的多变量时间序列预测模型。使用计数器编码器处理高基数特征,使用滑动窗口转换任务跟踪数据,从中提取特征并与任务概要数据一起使用,以训练和调整候选估计器。然后使用最佳模型结果构造一个集成估计器。作为评估的一部分,将执行基线比较,以确定集成学习如何很好地预测任务连接。模型结果使用标准分类指标,如准确性、精密度和召回率,包括F1分数、Kappa和Matthews相关系数来辅助。结果表明,该模型在大多数性能指标上优于传统模型,表明集成学习方法在大规模云数据中心任务连通性预测中的成功实现。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信