Ruichao Mo , Weiwei Lin , Guozhi Liu , Haolin Liu , Ligang He
{"title":"Learning from imbalance: Cross-server power prediction in large data centers via domain adaptation regression","authors":"Ruichao Mo , Weiwei Lin , Guozhi Liu , Haolin Liu , Ligang He","doi":"10.1016/j.eswa.2025.127845","DOIUrl":null,"url":null,"abstract":"<div><div>Machine learning (ML) models currently excel at predicting power consumption for servers in cloud data centers. The heterogeneity of server configurations violates the assumption of independent and identically distributed (i.i.d.) data, resulting in distribution shifts that pose significant challenges for cross-server power prediction. Additionally, the labels of power consumption data collected over a limited time show a natural imbalance, causing the power prediction performance to degrade when encountering missing labels. Therefore, learning meaningful knowledge from imbalanced power consumption data of real servers for cross-server power prediction remains challenging. To address this challenge, we consider imbalanced cross-server power prediction, formulated as a semi-supervised domain adaptation regression problem in scenarios where few labeled data points of target servers are available. Consequently, an <u>i</u>mbalanced <u>c</u>ross-<u>s</u>erver <u>p</u>ower prediction method, named <em><strong>ICSP</strong></em>, is proposed. To prevent learning biased knowledge from imbalanced data, unbalanced optimal transport is employed to align the joint probability distribution of the source and target servers. Moreover, by incorporating the few labels of target servers as a priori constraints, the performance of <em><strong>ICSP</strong></em> in coping with distribution shift is further improved. Extensive experiments on a real-world dataset demonstrate the superiority of <em><strong>ICSP</strong></em> over existing domain adaptation regression methods for imbalanced cross-server power prediction.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"287 ","pages":"Article 127845"},"PeriodicalIF":7.5000,"publicationDate":"2025-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425014678","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Machine learning (ML) models currently excel at predicting power consumption for servers in cloud data centers. The heterogeneity of server configurations violates the assumption of independent and identically distributed (i.i.d.) data, resulting in distribution shifts that pose significant challenges for cross-server power prediction. Additionally, the labels of power consumption data collected over a limited time show a natural imbalance, causing the power prediction performance to degrade when encountering missing labels. Therefore, learning meaningful knowledge from imbalanced power consumption data of real servers for cross-server power prediction remains challenging. To address this challenge, we consider imbalanced cross-server power prediction, formulated as a semi-supervised domain adaptation regression problem in scenarios where few labeled data points of target servers are available. Consequently, an imbalanced cross-server power prediction method, named ICSP, is proposed. To prevent learning biased knowledge from imbalanced data, unbalanced optimal transport is employed to align the joint probability distribution of the source and target servers. Moreover, by incorporating the few labels of target servers as a priori constraints, the performance of ICSP in coping with distribution shift is further improved. Extensive experiments on a real-world dataset demonstrate the superiority of ICSP over existing domain adaptation regression methods for imbalanced cross-server power prediction.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.