District heating optimization in residential buildings using reinforcement learning with adaptive context-aware predictive environment

IF 9.6 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Sai Sushanth Varma Kalidindi , Hadi Banaee , Hans Karlsson , Amy Loutfi
{"title":"District heating optimization in residential buildings using reinforcement learning with adaptive context-aware predictive environment","authors":"Sai Sushanth Varma Kalidindi ,&nbsp;Hadi Banaee ,&nbsp;Hans Karlsson ,&nbsp;Amy Loutfi","doi":"10.1016/j.egyai.2025.100603","DOIUrl":null,"url":null,"abstract":"<div><div>As district heating networks evolve to meet climate-neutral objectives, optimizing their control under heterogeneous building characteristics and dynamic environmental conditions remains a significant challenge. Traditional control strategies often lack the adaptability necessary to account for building-specific dynamics and to ensure real-time adherence to operational safety constraints. In this work, we present an integrated machine learning-based framework that combines an adaptive context-aware transformer model with deep reinforcement learning to address these limitations. The proposed approach introduces an adaptive context-aware transformer as a predictive environment within a Deep Q-Network (DQN) framework, enabling data-driven, building-specific control of district heating systems. Utilizing real-world data from 148 residential buildings across Sweden and Finland, the model incorporates contextual embeddings and temporal features to predict indoor temperature trajectories with high accuracy, achieving root mean square error values between 0.18–0.24 °C for Swedish buildings and 0.26–0.32 °C for Finnish buildings. The DQN agent leverages these predictions to optimize heating control while ensuring compliance with operational safety limits and preserving occupant comfort. Experimental results demonstrate significant energy savings, with mid-rise buildings achieving up to 14.85% reduction in energy consumption, and peak seasonal savings exceeding 20% during spring months. This integrated approach illustrates the potential for substantial energy optimization and reliable indoor climate management in future district heating networks.</div></div>","PeriodicalId":34138,"journal":{"name":"Energy and AI","volume":"22 ","pages":"Article 100603"},"PeriodicalIF":9.6000,"publicationDate":"2025-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy and AI","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666546825001351","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

As district heating networks evolve to meet climate-neutral objectives, optimizing their control under heterogeneous building characteristics and dynamic environmental conditions remains a significant challenge. Traditional control strategies often lack the adaptability necessary to account for building-specific dynamics and to ensure real-time adherence to operational safety constraints. In this work, we present an integrated machine learning-based framework that combines an adaptive context-aware transformer model with deep reinforcement learning to address these limitations. The proposed approach introduces an adaptive context-aware transformer as a predictive environment within a Deep Q-Network (DQN) framework, enabling data-driven, building-specific control of district heating systems. Utilizing real-world data from 148 residential buildings across Sweden and Finland, the model incorporates contextual embeddings and temporal features to predict indoor temperature trajectories with high accuracy, achieving root mean square error values between 0.18–0.24 °C for Swedish buildings and 0.26–0.32 °C for Finnish buildings. The DQN agent leverages these predictions to optimize heating control while ensuring compliance with operational safety limits and preserving occupant comfort. Experimental results demonstrate significant energy savings, with mid-rise buildings achieving up to 14.85% reduction in energy consumption, and peak seasonal savings exceeding 20% during spring months. This integrated approach illustrates the potential for substantial energy optimization and reliable indoor climate management in future district heating networks.

Abstract Image

基于自适应情景感知预测环境的强化学习的住宅区域供热优化
随着区域供热网络向气候中和目标的发展,在异质建筑特征和动态环境条件下优化其控制仍然是一个重大挑战。传统的控制策略通常缺乏必要的适应性,无法考虑建筑物特定的动态,也无法确保实时遵守操作安全约束。在这项工作中,我们提出了一个基于机器学习的集成框架,该框架将自适应上下文感知转换器模型与深度强化学习相结合,以解决这些限制。提出的方法引入了一个自适应的环境感知变压器,作为Deep Q-Network (DQN)框架内的预测环境,实现数据驱动的、特定于建筑的区域供热系统控制。该模型利用来自瑞典和芬兰148座住宅建筑的真实数据,结合上下文嵌入和时间特征,以高精度预测室内温度轨迹,瑞典建筑的均方根误差值为0.18-0.24°C,芬兰建筑的均方根误差值为0.26-0.32°C。DQN代理利用这些预测来优化加热控制,同时确保符合操作安全限制并保持乘员舒适度。实验结果显示节能效果显著,中层建筑能耗降低高达14.85%,春季季节节能峰值超过20%。这种综合方法说明了未来区域供热网络中大量能源优化和可靠的室内气候管理的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Energy and AI
Energy and AI Engineering-Engineering (miscellaneous)
CiteScore
16.50
自引率
0.00%
发文量
64
审稿时长
56 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信