{"title":"Ensemble Learning for Predicting Task Connectivity Over Time in Cloud Data Centers","authors":"Mustafa Daraghmeh, A. Agarwal, Y. Jararweh","doi":"10.1109/INFOCOMWKSHPS57453.2023.10225964","DOIUrl":null,"url":null,"abstract":"The rapid growth and interdependencies of cloud-hosted services and applications have made it imperative to optimize data center resource management while maintaining low operating costs. Inefficient resource use, rising energy consumption, and operating costs impact cloud provider's ability to provide high-quality services on an elastic basis. However, changing the selection and decision-making processes based on how the tasks are set up can improve scheduling and resource management in cloud data centers. In this paper, we develop a multivariate time series prediction model for task connectivity based on windowing characterization and ensemble learning methods. The high cardinality features are handled using a counter encoder, and the task trace data is transformed using a sliding window, from which features are extracted and used in conjunction with the task profile data to train and tune the candidate estimators. The best model outcomes are then used to construct an ensembled estimator. As part of the evaluation, a baseline comparison is performed in order to determine how well ensemble learning predicts task connectivity over time. The model outcomes are assisted using standard classification metrics such as accuracy, precision, and recall, including the F1 score, Kappa, and Matthews correlation coefficient. The results show that the proposed model outperformed the traditional models in most performance metrics, indicating the successful implementation of an ensemble learning approach for task connectivity predictions in large-scale cloud data centers.","PeriodicalId":354290,"journal":{"name":"IEEE INFOCOM 2023 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS)","volume":"80 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE INFOCOM 2023 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INFOCOMWKSHPS57453.2023.10225964","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The rapid growth and interdependencies of cloud-hosted services and applications have made it imperative to optimize data center resource management while maintaining low operating costs. Inefficient resource use, rising energy consumption, and operating costs impact cloud provider's ability to provide high-quality services on an elastic basis. However, changing the selection and decision-making processes based on how the tasks are set up can improve scheduling and resource management in cloud data centers. In this paper, we develop a multivariate time series prediction model for task connectivity based on windowing characterization and ensemble learning methods. The high cardinality features are handled using a counter encoder, and the task trace data is transformed using a sliding window, from which features are extracted and used in conjunction with the task profile data to train and tune the candidate estimators. The best model outcomes are then used to construct an ensembled estimator. As part of the evaluation, a baseline comparison is performed in order to determine how well ensemble learning predicts task connectivity over time. The model outcomes are assisted using standard classification metrics such as accuracy, precision, and recall, including the F1 score, Kappa, and Matthews correlation coefficient. The results show that the proposed model outperformed the traditional models in most performance metrics, indicating the successful implementation of an ensemble learning approach for task connectivity predictions in large-scale cloud data centers.