{"title":"不确定环境下数据中心作业调度与能量管理","authors":"Zhaohao Ding;Shijie Chen;Yimeng Sun;Kun Shi;Jiaying Wang;Songsong Chen;Tao Xiao;Yehan Wang;Xuan Wei","doi":"10.1109/TIA.2025.3548576","DOIUrl":null,"url":null,"abstract":"Data centers have become crucial infrastructure in the digital age, leading to a significant increase in energy consumption. Job scheduling stands out as an effective method to regulate the data center energy consumption by delaying job execution within resource availability and quality of service constraints. However, aleatoric uncertainties associated with incoming job information and real-time electricity market prices, and epistemic uncertainties inherent in the learning environment jointly present unique challenges for efficient job scheduling schemes. To tackle multiple types of uncertainties, we propose an efficient risk-aware job scheduling method for data centers in uncertain environments. Firstly, we formulate the data center job scheduling problem within a Markov framework incorporating job heterogeneity. To capture epistemic and aleatoric uncertainties, the policy function is reconstructed by integrating state-action value distributions with efficient exploration based on enhanced distributional reinforcement learning. Furthermore, to account for the risk preferences in data center decision-making, we include consideration of Conditional Value at Risk in the model. Numerical simulation results demonstrate that the proposed strategy can rapidly adapt to uncertain environments and help data centers make risk-aware job scheduling decisions.","PeriodicalId":13337,"journal":{"name":"IEEE Transactions on Industry Applications","volume":"61 4","pages":"5489-5500"},"PeriodicalIF":4.5000,"publicationDate":"2025-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Data Center Job Scheduling and Energy Management Under Uncertain Environments\",\"authors\":\"Zhaohao Ding;Shijie Chen;Yimeng Sun;Kun Shi;Jiaying Wang;Songsong Chen;Tao Xiao;Yehan Wang;Xuan Wei\",\"doi\":\"10.1109/TIA.2025.3548576\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Data centers have become crucial infrastructure in the digital age, leading to a significant increase in energy consumption. Job scheduling stands out as an effective method to regulate the data center energy consumption by delaying job execution within resource availability and quality of service constraints. However, aleatoric uncertainties associated with incoming job information and real-time electricity market prices, and epistemic uncertainties inherent in the learning environment jointly present unique challenges for efficient job scheduling schemes. To tackle multiple types of uncertainties, we propose an efficient risk-aware job scheduling method for data centers in uncertain environments. Firstly, we formulate the data center job scheduling problem within a Markov framework incorporating job heterogeneity. To capture epistemic and aleatoric uncertainties, the policy function is reconstructed by integrating state-action value distributions with efficient exploration based on enhanced distributional reinforcement learning. Furthermore, to account for the risk preferences in data center decision-making, we include consideration of Conditional Value at Risk in the model. Numerical simulation results demonstrate that the proposed strategy can rapidly adapt to uncertain environments and help data centers make risk-aware job scheduling decisions.\",\"PeriodicalId\":13337,\"journal\":{\"name\":\"IEEE Transactions on Industry Applications\",\"volume\":\"61 4\",\"pages\":\"5489-5500\"},\"PeriodicalIF\":4.5000,\"publicationDate\":\"2025-03-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Industry Applications\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10916511/\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Industry Applications","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10916511/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Data Center Job Scheduling and Energy Management Under Uncertain Environments
Data centers have become crucial infrastructure in the digital age, leading to a significant increase in energy consumption. Job scheduling stands out as an effective method to regulate the data center energy consumption by delaying job execution within resource availability and quality of service constraints. However, aleatoric uncertainties associated with incoming job information and real-time electricity market prices, and epistemic uncertainties inherent in the learning environment jointly present unique challenges for efficient job scheduling schemes. To tackle multiple types of uncertainties, we propose an efficient risk-aware job scheduling method for data centers in uncertain environments. Firstly, we formulate the data center job scheduling problem within a Markov framework incorporating job heterogeneity. To capture epistemic and aleatoric uncertainties, the policy function is reconstructed by integrating state-action value distributions with efficient exploration based on enhanced distributional reinforcement learning. Furthermore, to account for the risk preferences in data center decision-making, we include consideration of Conditional Value at Risk in the model. Numerical simulation results demonstrate that the proposed strategy can rapidly adapt to uncertain environments and help data centers make risk-aware job scheduling decisions.
期刊介绍:
The scope of the IEEE Transactions on Industry Applications includes all scope items of the IEEE Industry Applications Society, that is, the advancement of the theory and practice of electrical and electronic engineering in the development, design, manufacture, and application of electrical systems, apparatus, devices, and controls to the processes and equipment of industry and commerce; the promotion of safe, reliable, and economic installations; industry leadership in energy conservation and environmental, health, and safety issues; the creation of voluntary engineering standards and recommended practices; and the professional development of its membership.