{"title":"基于深度强化学习的暖通空调系统节能权衡决策","authors":"Suroor M. Dawood, A. Hatami, R. Homod","doi":"10.1080/19401493.2022.2099465","DOIUrl":null,"url":null,"abstract":"This paper presents Model-based Reinforcement Learning (MB-RL) techniques to control the indoor air temperature, and CO2 concentration level, and minimize the energy consumption of the heating, ventilating, and air conditioning (HVAC) systems, simultaneously. For this purpose, a trade-off is made between maintaining indoor comfort levels and minimizing energy consumption. The control of the HVAC system is performed using the Deterministic Policy RL (DP-RL) method. Moreover, the nonlinear autoregressive exogenous neural network (NARX-NN) is employed as an approximation function with DP-RL method to provide a hybrid DP-NARX-RL controller. By applying the DP-RL and DP-NARX-RL controllers to the HVAC system of a typical building, parameters such as the indoor comfort levels, the electrical power, and energy consumed, and the energy costs at various pricing schemes are evaluated for two case studies. In both cases, the results show the better performance of DP-NARX-RL compared to DP-RL, RL, and PID controllers.","PeriodicalId":49168,"journal":{"name":"Journal of Building Performance Simulation","volume":null,"pages":null},"PeriodicalIF":2.2000,"publicationDate":"2022-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Trade-off decisions in a novel deep reinforcement learning for energy savings in HVAC systems\",\"authors\":\"Suroor M. Dawood, A. Hatami, R. Homod\",\"doi\":\"10.1080/19401493.2022.2099465\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents Model-based Reinforcement Learning (MB-RL) techniques to control the indoor air temperature, and CO2 concentration level, and minimize the energy consumption of the heating, ventilating, and air conditioning (HVAC) systems, simultaneously. For this purpose, a trade-off is made between maintaining indoor comfort levels and minimizing energy consumption. The control of the HVAC system is performed using the Deterministic Policy RL (DP-RL) method. Moreover, the nonlinear autoregressive exogenous neural network (NARX-NN) is employed as an approximation function with DP-RL method to provide a hybrid DP-NARX-RL controller. By applying the DP-RL and DP-NARX-RL controllers to the HVAC system of a typical building, parameters such as the indoor comfort levels, the electrical power, and energy consumed, and the energy costs at various pricing schemes are evaluated for two case studies. In both cases, the results show the better performance of DP-NARX-RL compared to DP-RL, RL, and PID controllers.\",\"PeriodicalId\":49168,\"journal\":{\"name\":\"Journal of Building Performance Simulation\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2022-08-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Building Performance Simulation\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1080/19401493.2022.2099465\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CONSTRUCTION & BUILDING TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Building Performance Simulation","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1080/19401493.2022.2099465","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
Trade-off decisions in a novel deep reinforcement learning for energy savings in HVAC systems
This paper presents Model-based Reinforcement Learning (MB-RL) techniques to control the indoor air temperature, and CO2 concentration level, and minimize the energy consumption of the heating, ventilating, and air conditioning (HVAC) systems, simultaneously. For this purpose, a trade-off is made between maintaining indoor comfort levels and minimizing energy consumption. The control of the HVAC system is performed using the Deterministic Policy RL (DP-RL) method. Moreover, the nonlinear autoregressive exogenous neural network (NARX-NN) is employed as an approximation function with DP-RL method to provide a hybrid DP-NARX-RL controller. By applying the DP-RL and DP-NARX-RL controllers to the HVAC system of a typical building, parameters such as the indoor comfort levels, the electrical power, and energy consumed, and the energy costs at various pricing schemes are evaluated for two case studies. In both cases, the results show the better performance of DP-NARX-RL compared to DP-RL, RL, and PID controllers.
期刊介绍:
The Journal of Building Performance Simulation (JBPS) aims to make a substantial and lasting contribution to the international building community by supporting our authors and the high-quality, original research they submit. The journal also offers a forum for original review papers and researched case studies
We welcome building performance simulation contributions that explore the following topics related to buildings and communities:
-Theoretical aspects related to modelling and simulating the physical processes (thermal, air flow, moisture, lighting, acoustics).
-Theoretical aspects related to modelling and simulating conventional and innovative energy conversion, storage, distribution, and control systems.
-Theoretical aspects related to occupants, weather data, and other boundary conditions.
-Methods and algorithms for optimizing the performance of buildings and communities and the systems which service them, including interaction with the electrical grid.
-Uncertainty, sensitivity analysis, and calibration.
-Methods and algorithms for validating models and for verifying solution methods and tools.
-Development and validation of controls-oriented models that are appropriate for model predictive control and/or automated fault detection and diagnostics.
-Techniques for educating and training tool users.
-Software development techniques and interoperability issues with direct applicability to building performance simulation.
-Case studies involving the application of building performance simulation for any stage of the design, construction, commissioning, operation, or management of buildings and the systems which service them are welcomed if they include validation or aspects that make a novel contribution to the knowledge base.