{"title":"通过主动干扰抑制控制辅助强化学习优化暖通空调系统控制","authors":"Can Cui, Jiahui Xue, Lanjun Liu","doi":"10.1016/j.energy.2025.135824","DOIUrl":null,"url":null,"abstract":"<div><div>Optimal control of multi-zone HVAC systems may suffer from noise and disturbances that affect control accuracy and performance, and faces computational challenges caused by multiple control variables. To address these challenges, this paper proposes a novel method that incorporates reinforcement learning and active disturbance rejection control through a main-auxiliary controller configuration. A main controller is designed based on twin delayed deep deterministic policy gradient, which is responsible for controlling zone supply airflows. An auxiliary controller is configured based on active disturbance rejection control, which regulates the fresh air ratio and meanwhile handling the disturbances and uncertainties. The two controllers work in parallel with exchange information in real-time to optimize HVAC systems in dynamically uncertain environments. In the proposed method, the control variables are separated and handled by main and auxiliary controllers respectively, which reduces the action space of reinforcement learning algorithm and partly decouples the thermal loads and ventilation loads. An EnergyPlus-Python co-simulation platform has been developed using real-world data. Test results demonstrate that the proposed AD-RL method can enhance indoor comfort and IAQ. Furthermore, compared to the rule-based method and the classical TD3-based approach, it can reduce the daily HVAC energy consumption by up to 22.37 % and 13.53 %, respectively.</div></div>","PeriodicalId":11647,"journal":{"name":"Energy","volume":"323 ","pages":"Article 135824"},"PeriodicalIF":9.0000,"publicationDate":"2025-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimal control of HVAC systems through active disturbance rejection control-assisted reinforcement learning\",\"authors\":\"Can Cui, Jiahui Xue, Lanjun Liu\",\"doi\":\"10.1016/j.energy.2025.135824\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Optimal control of multi-zone HVAC systems may suffer from noise and disturbances that affect control accuracy and performance, and faces computational challenges caused by multiple control variables. To address these challenges, this paper proposes a novel method that incorporates reinforcement learning and active disturbance rejection control through a main-auxiliary controller configuration. A main controller is designed based on twin delayed deep deterministic policy gradient, which is responsible for controlling zone supply airflows. An auxiliary controller is configured based on active disturbance rejection control, which regulates the fresh air ratio and meanwhile handling the disturbances and uncertainties. The two controllers work in parallel with exchange information in real-time to optimize HVAC systems in dynamically uncertain environments. In the proposed method, the control variables are separated and handled by main and auxiliary controllers respectively, which reduces the action space of reinforcement learning algorithm and partly decouples the thermal loads and ventilation loads. An EnergyPlus-Python co-simulation platform has been developed using real-world data. Test results demonstrate that the proposed AD-RL method can enhance indoor comfort and IAQ. Furthermore, compared to the rule-based method and the classical TD3-based approach, it can reduce the daily HVAC energy consumption by up to 22.37 % and 13.53 %, respectively.</div></div>\",\"PeriodicalId\":11647,\"journal\":{\"name\":\"Energy\",\"volume\":\"323 \",\"pages\":\"Article 135824\"},\"PeriodicalIF\":9.0000,\"publicationDate\":\"2025-03-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Energy\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0360544225014665\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENERGY & FUELS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0360544225014665","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
Optimal control of HVAC systems through active disturbance rejection control-assisted reinforcement learning
Optimal control of multi-zone HVAC systems may suffer from noise and disturbances that affect control accuracy and performance, and faces computational challenges caused by multiple control variables. To address these challenges, this paper proposes a novel method that incorporates reinforcement learning and active disturbance rejection control through a main-auxiliary controller configuration. A main controller is designed based on twin delayed deep deterministic policy gradient, which is responsible for controlling zone supply airflows. An auxiliary controller is configured based on active disturbance rejection control, which regulates the fresh air ratio and meanwhile handling the disturbances and uncertainties. The two controllers work in parallel with exchange information in real-time to optimize HVAC systems in dynamically uncertain environments. In the proposed method, the control variables are separated and handled by main and auxiliary controllers respectively, which reduces the action space of reinforcement learning algorithm and partly decouples the thermal loads and ventilation loads. An EnergyPlus-Python co-simulation platform has been developed using real-world data. Test results demonstrate that the proposed AD-RL method can enhance indoor comfort and IAQ. Furthermore, compared to the rule-based method and the classical TD3-based approach, it can reduce the daily HVAC energy consumption by up to 22.37 % and 13.53 %, respectively.
期刊介绍:
Energy is a multidisciplinary, international journal that publishes research and analysis in the field of energy engineering. Our aim is to become a leading peer-reviewed platform and a trusted source of information for energy-related topics.
The journal covers a range of areas including mechanical engineering, thermal sciences, and energy analysis. We are particularly interested in research on energy modelling, prediction, integrated energy systems, planning, and management.
Additionally, we welcome papers on energy conservation, efficiency, biomass and bioenergy, renewable energy, electricity supply and demand, energy storage, buildings, and economic and policy issues. These topics should align with our broader multidisciplinary focus.