Enhancing performance and generalization in dormitory optimization using deep reinforcement learning with embedded surrogate model

IF 7.1 1区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY
Zewei Shi , Chenyu Huang , Jinyu Wang , Zhongqi Yu , Jiayan Fu , Jiawei Yao
{"title":"Enhancing performance and generalization in dormitory optimization using deep reinforcement learning with embedded surrogate model","authors":"Zewei Shi ,&nbsp;Chenyu Huang ,&nbsp;Jinyu Wang ,&nbsp;Zhongqi Yu ,&nbsp;Jiayan Fu ,&nbsp;Jiawei Yao","doi":"10.1016/j.buildenv.2025.112864","DOIUrl":null,"url":null,"abstract":"<div><div>The natural ventilation and daylighting of dormitories are key factors affecting student comfort and health, with early-stage design greatly impacting indoor performance. Existing optimization methods, mainly evolutionary algorithms like genetic algorithms, excel in global search but struggle with adapting to dynamic environments, handling multi-dimensional tasks, and ensuring model generalization. This study proposes a multi-objective optimization framework integrating GAN and DRL for dormitory indoor airflow and daylighting enhancement. The GAN model provides real-time predictions of global wind speed and useful daylight illuminance (UDI). By combining a GAN-based surrogate model with a DRL approach based on Deep Deterministic Policy Gradient (DDPG), the framework iteratively refines dormitory unit layouts through continuous interaction between the environment and the agent. The effectiveness and generalization of the DRL-based method were evaluated across three dormitory typologies. Results indicate that, on the test dataset, the pix2pix model achieved R² values of 0.979 and 0.988 for predicting indoor wind and lighting conditions, respectively. Compared to traditional genetic algorithms, the DRL model demonstrated superior performance in optimizing indoor environmental conditions, achieving up to a 9.33 % improvement in wind environment optimization. The pre-trained models exhibited a certain degree of generalization across the three scenarios. This approach provides valuable support for environmentally driven indoor architectural optimization.</div></div>","PeriodicalId":9273,"journal":{"name":"Building and Environment","volume":"276 ","pages":"Article 112864"},"PeriodicalIF":7.1000,"publicationDate":"2025-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Building and Environment","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0360132325003464","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

The natural ventilation and daylighting of dormitories are key factors affecting student comfort and health, with early-stage design greatly impacting indoor performance. Existing optimization methods, mainly evolutionary algorithms like genetic algorithms, excel in global search but struggle with adapting to dynamic environments, handling multi-dimensional tasks, and ensuring model generalization. This study proposes a multi-objective optimization framework integrating GAN and DRL for dormitory indoor airflow and daylighting enhancement. The GAN model provides real-time predictions of global wind speed and useful daylight illuminance (UDI). By combining a GAN-based surrogate model with a DRL approach based on Deep Deterministic Policy Gradient (DDPG), the framework iteratively refines dormitory unit layouts through continuous interaction between the environment and the agent. The effectiveness and generalization of the DRL-based method were evaluated across three dormitory typologies. Results indicate that, on the test dataset, the pix2pix model achieved R² values of 0.979 and 0.988 for predicting indoor wind and lighting conditions, respectively. Compared to traditional genetic algorithms, the DRL model demonstrated superior performance in optimizing indoor environmental conditions, achieving up to a 9.33 % improvement in wind environment optimization. The pre-trained models exhibited a certain degree of generalization across the three scenarios. This approach provides valuable support for environmentally driven indoor architectural optimization.
求助全文
约1分钟内获得全文 求助全文
来源期刊
Building and Environment
Building and Environment 工程技术-工程:环境
CiteScore
12.50
自引率
23.00%
发文量
1130
审稿时长
27 days
期刊介绍: Building and Environment, an international journal, is dedicated to publishing original research papers, comprehensive review articles, editorials, and short communications in the fields of building science, urban physics, and human interaction with the indoor and outdoor built environment. The journal emphasizes innovative technologies and knowledge verified through measurement and analysis. It covers environmental performance across various spatial scales, from cities and communities to buildings and systems, fostering collaborative, multi-disciplinary research with broader significance.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信