{"title":"Improving HVAC control with transfer learning: Using padding techniques for cross-building pre-training and fine-tuning","authors":"Kevlyn Kadamala, Des Chambers, Enda Barrett","doi":"10.1016/j.egyai.2025.100531","DOIUrl":null,"url":null,"abstract":"<div><div>Recent advancements have shown that control strategies using Deep Reinforcement Learning (DRL) can significantly improve the management of HVAC control and energy systems in buildings, leading to significant energy savings and better comfort. Unlike conventional rule-based controllers, they demand considerable time and data to develop effective policies. Transfer learning using pre-trained models can help address this issue. In this work, we use imitation learning (IL) as a method of pre-training and reinforcement learning (RL) for fine-tuning. However, HVAC systems can vary depending on the location, building size, structure, construction materials and weather conditions. The diversity in HVAC control systems across different buildings complicates the use of IL and RL. Neural network weights trained on the source building cannot be directly transferred to the target building because of differences in input features and the number of control equipment. To overcome this problem, we propose a novel padding method to ensure that both the source and target buildings share the same state space dimensionality. Thus, the trained neural network weights are transferable, and only the output layer must be adjusted to fit the dimensionality of the target action space. Additionally, we evaluate the performance of an existing padding technique for comparison. Our experiments show that the novel padding technique outperforms zero padding by 1.37% and training from scratch by 4.59% on average.</div></div>","PeriodicalId":34138,"journal":{"name":"Energy and AI","volume":"21 ","pages":"Article 100531"},"PeriodicalIF":9.6000,"publicationDate":"2025-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy and AI","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666546825000631","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Recent advancements have shown that control strategies using Deep Reinforcement Learning (DRL) can significantly improve the management of HVAC control and energy systems in buildings, leading to significant energy savings and better comfort. Unlike conventional rule-based controllers, they demand considerable time and data to develop effective policies. Transfer learning using pre-trained models can help address this issue. In this work, we use imitation learning (IL) as a method of pre-training and reinforcement learning (RL) for fine-tuning. However, HVAC systems can vary depending on the location, building size, structure, construction materials and weather conditions. The diversity in HVAC control systems across different buildings complicates the use of IL and RL. Neural network weights trained on the source building cannot be directly transferred to the target building because of differences in input features and the number of control equipment. To overcome this problem, we propose a novel padding method to ensure that both the source and target buildings share the same state space dimensionality. Thus, the trained neural network weights are transferable, and only the output layer must be adjusted to fit the dimensionality of the target action space. Additionally, we evaluate the performance of an existing padding technique for comparison. Our experiments show that the novel padding technique outperforms zero padding by 1.37% and training from scratch by 4.59% on average.