通过主动干扰抑制控制辅助强化学习优化暖通空调系统控制

IF 9 1区工程技术 Q1 ENERGY & FUELS

Energy Pub Date : 2025-03-25 DOI:10.1016/j.energy.2025.135824

Can Cui, Jiahui Xue, Lanjun Liu

{"title":"通过主动干扰抑制控制辅助强化学习优化暖通空调系统控制","authors":"Can Cui, Jiahui Xue, Lanjun Liu","doi":"10.1016/j.energy.2025.135824","DOIUrl":null,"url":null,"abstract":"<div><div>Optimal control of multi-zone HVAC systems may suffer from noise and disturbances that affect control accuracy and performance, and faces computational challenges caused by multiple control variables. To address these challenges, this paper proposes a novel method that incorporates reinforcement learning and active disturbance rejection control through a main-auxiliary controller configuration. A main controller is designed based on twin delayed deep deterministic policy gradient, which is responsible for controlling zone supply airflows. An auxiliary controller is configured based on active disturbance rejection control, which regulates the fresh air ratio and meanwhile handling the disturbances and uncertainties. The two controllers work in parallel with exchange information in real-time to optimize HVAC systems in dynamically uncertain environments. In the proposed method, the control variables are separated and handled by main and auxiliary controllers respectively, which reduces the action space of reinforcement learning algorithm and partly decouples the thermal loads and ventilation loads. An EnergyPlus-Python co-simulation platform has been developed using real-world data. Test results demonstrate that the proposed AD-RL method can enhance indoor comfort and IAQ. Furthermore, compared to the rule-based method and the classical TD3-based approach, it can reduce the daily HVAC energy consumption by up to 22.37 % and 13.53 %, respectively.</div></div>","PeriodicalId":11647,"journal":{"name":"Energy","volume":"323 ","pages":"Article 135824"},"PeriodicalIF":9.0000,"publicationDate":"2025-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimal control of HVAC systems through active disturbance rejection control-assisted reinforcement learning\",\"authors\":\"Can Cui, Jiahui Xue, Lanjun Liu\",\"doi\":\"10.1016/j.energy.2025.135824\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Optimal control of multi-zone HVAC systems may suffer from noise and disturbances that affect control accuracy and performance, and faces computational challenges caused by multiple control variables. To address these challenges, this paper proposes a novel method that incorporates reinforcement learning and active disturbance rejection control through a main-auxiliary controller configuration. A main controller is designed based on twin delayed deep deterministic policy gradient, which is responsible for controlling zone supply airflows. An auxiliary controller is configured based on active disturbance rejection control, which regulates the fresh air ratio and meanwhile handling the disturbances and uncertainties. The two controllers work in parallel with exchange information in real-time to optimize HVAC systems in dynamically uncertain environments. In the proposed method, the control variables are separated and handled by main and auxiliary controllers respectively, which reduces the action space of reinforcement learning algorithm and partly decouples the thermal loads and ventilation loads. An EnergyPlus-Python co-simulation platform has been developed using real-world data. Test results demonstrate that the proposed AD-RL method can enhance indoor comfort and IAQ. Furthermore, compared to the rule-based method and the classical TD3-based approach, it can reduce the daily HVAC energy consumption by up to 22.37 % and 13.53 %, respectively.</div></div>\",\"PeriodicalId\":11647,\"journal\":{\"name\":\"Energy\",\"volume\":\"323 \",\"pages\":\"Article 135824\"},\"PeriodicalIF\":9.0000,\"publicationDate\":\"2025-03-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Energy\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0360544225014665\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENERGY & FUELS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0360544225014665","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENERGY & FUELS","Score":null,"Total":0}

引用次数: 0

摘要

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Optimal control of HVAC systems through active disturbance rejection control-assisted reinforcement learning

Optimal control of multi-zone HVAC systems may suffer from noise and disturbances that affect control accuracy and performance, and faces computational challenges caused by multiple control variables. To address these challenges, this paper proposes a novel method that incorporates reinforcement learning and active disturbance rejection control through a main-auxiliary controller configuration. A main controller is designed based on twin delayed deep deterministic policy gradient, which is responsible for controlling zone supply airflows. An auxiliary controller is configured based on active disturbance rejection control, which regulates the fresh air ratio and meanwhile handling the disturbances and uncertainties. The two controllers work in parallel with exchange information in real-time to optimize HVAC systems in dynamically uncertain environments. In the proposed method, the control variables are separated and handled by main and auxiliary controllers respectively, which reduces the action space of reinforcement learning algorithm and partly decouples the thermal loads and ventilation loads. An EnergyPlus-Python co-simulation platform has been developed using real-world data. Test results demonstrate that the proposed AD-RL method can enhance indoor comfort and IAQ. Furthermore, compared to the rule-based method and the classical TD3-based approach, it can reduce the daily HVAC energy consumption by up to 22.37 % and 13.53 %, respectively.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Energy 工程技术-能源与燃料

CiteScore

15.30

自引率

14.40%

发文量

审稿时长

14.2 weeks

期刊介绍： Energy is a multidisciplinary, international journal that publishes research and analysis in the field of energy engineering. Our aim is to become a leading peer-reviewed platform and a trusted source of information for energy-related topics. The journal covers a range of areas including mechanical engineering, thermal sciences, and energy analysis. We are particularly interested in research on energy modelling, prediction, integrated energy systems, planning, and management. Additionally, we welcome papers on energy conservation, efficiency, biomass and bioenergy, renewable energy, electricity supply and demand, energy storage, buildings, and economic and policy issues. These topics should align with our broader multidisciplinary focus.