Enhancing performance and generalization in dormitory optimization using deep reinforcement learning with embedded surrogate model

IF 7.1 1区工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY

Building and Environment Pub Date : 2025-03-14 DOI:10.1016/j.buildenv.2025.112864

Zewei Shi , Chenyu Huang , Jinyu Wang , Zhongqi Yu , Jiayan Fu , Jiawei Yao

{"title":"Enhancing performance and generalization in dormitory optimization using deep reinforcement learning with embedded surrogate model","authors":"Zewei Shi , Chenyu Huang , Jinyu Wang , Zhongqi Yu , Jiayan Fu , Jiawei Yao","doi":"10.1016/j.buildenv.2025.112864","DOIUrl":null,"url":null,"abstract":"<div><div>The natural ventilation and daylighting of dormitories are key factors affecting student comfort and health, with early-stage design greatly impacting indoor performance. Existing optimization methods, mainly evolutionary algorithms like genetic algorithms, excel in global search but struggle with adapting to dynamic environments, handling multi-dimensional tasks, and ensuring model generalization. This study proposes a multi-objective optimization framework integrating GAN and DRL for dormitory indoor airflow and daylighting enhancement. The GAN model provides real-time predictions of global wind speed and useful daylight illuminance (UDI). By combining a GAN-based surrogate model with a DRL approach based on Deep Deterministic Policy Gradient (DDPG), the framework iteratively refines dormitory unit layouts through continuous interaction between the environment and the agent. The effectiveness and generalization of the DRL-based method were evaluated across three dormitory typologies. Results indicate that, on the test dataset, the pix2pix model achieved R² values of 0.979 and 0.988 for predicting indoor wind and lighting conditions, respectively. Compared to traditional genetic algorithms, the DRL model demonstrated superior performance in optimizing indoor environmental conditions, achieving up to a 9.33 % improvement in wind environment optimization. The pre-trained models exhibited a certain degree of generalization across the three scenarios. This approach provides valuable support for environmentally driven indoor architectural optimization.</div></div>","PeriodicalId":9273,"journal":{"name":"Building and Environment","volume":"276 ","pages":"Article 112864"},"PeriodicalIF":7.1000,"publicationDate":"2025-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Building and Environment","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0360132325003464","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

The natural ventilation and daylighting of dormitories are key factors affecting student comfort and health, with early-stage design greatly impacting indoor performance. Existing optimization methods, mainly evolutionary algorithms like genetic algorithms, excel in global search but struggle with adapting to dynamic environments, handling multi-dimensional tasks, and ensuring model generalization. This study proposes a multi-objective optimization framework integrating GAN and DRL for dormitory indoor airflow and daylighting enhancement. The GAN model provides real-time predictions of global wind speed and useful daylight illuminance (UDI). By combining a GAN-based surrogate model with a DRL approach based on Deep Deterministic Policy Gradient (DDPG), the framework iteratively refines dormitory unit layouts through continuous interaction between the environment and the agent. The effectiveness and generalization of the DRL-based method were evaluated across three dormitory typologies. Results indicate that, on the test dataset, the pix2pix model achieved R² values of 0.979 and 0.988 for predicting indoor wind and lighting conditions, respectively. Compared to traditional genetic algorithms, the DRL model demonstrated superior performance in optimizing indoor environmental conditions, achieving up to a 9.33 % improvement in wind environment optimization. The pre-trained models exhibited a certain degree of generalization across the three scenarios. This approach provides valuable support for environmentally driven indoor architectural optimization.

查看原文本刊更多论文

求助全文

约1分钟内获得全文求助全文

来源期刊

Building and Environment 工程技术-工程：环境

CiteScore

12.50

自引率

23.00%

发文量

1130

审稿时长

27 days

期刊介绍： Building and Environment, an international journal, is dedicated to publishing original research papers, comprehensive review articles, editorials, and short communications in the fields of building science, urban physics, and human interaction with the indoor and outdoor built environment. The journal emphasizes innovative technologies and knowledge verified through measurement and analysis. It covers environmental performance across various spatial scales, from cities and communities to buildings and systems, fostering collaborative, multi-disciplinary research with broader significance.