Xu Han , Ali Malkawi , Zhuorui Li , Runyu Zhang , Na Li
{"title":"Optimizing radiant floor heating control with night setbacks using model-free reinforcement learning and transfer learning","authors":"Xu Han , Ali Malkawi , Zhuorui Li , Runyu Zhang , Na Li","doi":"10.1016/j.buildenv.2025.113771","DOIUrl":null,"url":null,"abstract":"<div><div>Controlling Radiant Floor Heating (RFH) systems with night setbacks presents a challenge due to their slow-response dynamics. Model Predictive Control (MPC) has demonstrated effectiveness in controlling such systems, but the need for model development constrains its scalability. This study investigates the feasibility and approaches of using model-free Reinforcement Learning (RL) and transfer learning for optimal control of RFH systems with night setbacks. A physics-based model is developed and validated as a virtual testbed. Four distinct RL control (RLC) strategies are proposed and evaluated, alongside a conventional Rule-Based Control (RBC) strategy as a baseline, and an MPC as an upper-bound performance benchmark. Our findings reveal that the Deep Q-Network (DQN) with n-step Temporal Difference learning incorporating weather forecasts as <em>states</em> achieve the best performance. The heating demand is reduced by 15-23 % with RLC and 13.1-28.5 % with MPC against RBC. However, the unmet hours with RLC are higher than those of MPC, suggesting further research for constraint satisfaction improvement. The transferability of the RLC is also evaluated by applying the trained RL agent to a new building using transfer learning through weights initialization, layer freezing and fine tuning. The results show that the training time for a new agent with a target building is significantly reduced taking advantage of transfer learning from an existing agent trained with a source building. In conclusion, this study demonstrates the potential of model-free RL and transfer learning in optimizing RFH systems with night setbacks, fostering advancements in scalable optimal building control strategies.</div></div>","PeriodicalId":9273,"journal":{"name":"Building and Environment","volume":"287 ","pages":"Article 113771"},"PeriodicalIF":7.6000,"publicationDate":"2025-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Building and Environment","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0360132325012417","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Controlling Radiant Floor Heating (RFH) systems with night setbacks presents a challenge due to their slow-response dynamics. Model Predictive Control (MPC) has demonstrated effectiveness in controlling such systems, but the need for model development constrains its scalability. This study investigates the feasibility and approaches of using model-free Reinforcement Learning (RL) and transfer learning for optimal control of RFH systems with night setbacks. A physics-based model is developed and validated as a virtual testbed. Four distinct RL control (RLC) strategies are proposed and evaluated, alongside a conventional Rule-Based Control (RBC) strategy as a baseline, and an MPC as an upper-bound performance benchmark. Our findings reveal that the Deep Q-Network (DQN) with n-step Temporal Difference learning incorporating weather forecasts as states achieve the best performance. The heating demand is reduced by 15-23 % with RLC and 13.1-28.5 % with MPC against RBC. However, the unmet hours with RLC are higher than those of MPC, suggesting further research for constraint satisfaction improvement. The transferability of the RLC is also evaluated by applying the trained RL agent to a new building using transfer learning through weights initialization, layer freezing and fine tuning. The results show that the training time for a new agent with a target building is significantly reduced taking advantage of transfer learning from an existing agent trained with a source building. In conclusion, this study demonstrates the potential of model-free RL and transfer learning in optimizing RFH systems with night setbacks, fostering advancements in scalable optimal building control strategies.
期刊介绍:
Building and Environment, an international journal, is dedicated to publishing original research papers, comprehensive review articles, editorials, and short communications in the fields of building science, urban physics, and human interaction with the indoor and outdoor built environment. The journal emphasizes innovative technologies and knowledge verified through measurement and analysis. It covers environmental performance across various spatial scales, from cities and communities to buildings and systems, fostering collaborative, multi-disciplinary research with broader significance.