Wenzhe Shang , Junjie Liu , Han Meng , Lizhi Jia , Xilei Dai
{"title":"基于rl的人类行为导向的最佳通风策略,以提高能源效率和室内空气质量","authors":"Wenzhe Shang , Junjie Liu , Han Meng , Lizhi Jia , Xilei Dai","doi":"10.1016/j.enbuild.2025.116072","DOIUrl":null,"url":null,"abstract":"<div><div>In biological cleanrooms for pharmaceutical and biosafety laboratories, escalating cleanliness standards have made the high energy consumption associated with increased air change rates in ventilation systems untenable. Moreover, stringent pressure differential requirements are crucial in biological cleanrooms to ensure the physical isolation of pathogenic microorganisms. This study developed a multi-zone ventilation system model for a biopharmaceutical cleanroom using the Modelica language. A deep reinforcement learning (DRL) model was implemented in Python based on the actor–critic and proximal policy optimization (PPO) algorithms, utilising the Modelica model as the training environment. To maintain particulate matter (PM) concentrations and conserve energy, the DRL control model was trained to adjust air damper positions by identifying patterns in occupancy changes and pollutant concentration dynamics across various times and workspaces within the cleanroom. Results indicated that, relative to the conventional baseline control strategy, the developed reinforcement learning control approach achieved a 14.7 % reduction in energy consumption while maintaining pollutant concentrations within regulatory limits for cleanrooms, culminating in annual energy savings of 11,212.8 kWh. Additionally, pressure fluctuation ranges in the three controlled work zones of the cleanroom were diminished by 59.16 %, 9.58 %, and 29.32 %, respectively. SHapley Additive explanation (SHAP) analysis was employed to elucidate the contributing factors influencing the outputs of the developed DRL control model. Furthermore, the generalisation of the DRL control model was discussed by altering the control period and inner source of the PM.</div></div>","PeriodicalId":11641,"journal":{"name":"Energy and Buildings","volume":"345 ","pages":"Article 116072"},"PeriodicalIF":6.6000,"publicationDate":"2025-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A RL-based human behavior oriented optimal ventilation strategy for better energy efficiency and indoor air quality\",\"authors\":\"Wenzhe Shang , Junjie Liu , Han Meng , Lizhi Jia , Xilei Dai\",\"doi\":\"10.1016/j.enbuild.2025.116072\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In biological cleanrooms for pharmaceutical and biosafety laboratories, escalating cleanliness standards have made the high energy consumption associated with increased air change rates in ventilation systems untenable. Moreover, stringent pressure differential requirements are crucial in biological cleanrooms to ensure the physical isolation of pathogenic microorganisms. This study developed a multi-zone ventilation system model for a biopharmaceutical cleanroom using the Modelica language. A deep reinforcement learning (DRL) model was implemented in Python based on the actor–critic and proximal policy optimization (PPO) algorithms, utilising the Modelica model as the training environment. To maintain particulate matter (PM) concentrations and conserve energy, the DRL control model was trained to adjust air damper positions by identifying patterns in occupancy changes and pollutant concentration dynamics across various times and workspaces within the cleanroom. Results indicated that, relative to the conventional baseline control strategy, the developed reinforcement learning control approach achieved a 14.7 % reduction in energy consumption while maintaining pollutant concentrations within regulatory limits for cleanrooms, culminating in annual energy savings of 11,212.8 kWh. Additionally, pressure fluctuation ranges in the three controlled work zones of the cleanroom were diminished by 59.16 %, 9.58 %, and 29.32 %, respectively. SHapley Additive explanation (SHAP) analysis was employed to elucidate the contributing factors influencing the outputs of the developed DRL control model. Furthermore, the generalisation of the DRL control model was discussed by altering the control period and inner source of the PM.</div></div>\",\"PeriodicalId\":11641,\"journal\":{\"name\":\"Energy and Buildings\",\"volume\":\"345 \",\"pages\":\"Article 116072\"},\"PeriodicalIF\":6.6000,\"publicationDate\":\"2025-06-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Energy and Buildings\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0378778825008023\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CONSTRUCTION & BUILDING TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy and Buildings","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378778825008023","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
A RL-based human behavior oriented optimal ventilation strategy for better energy efficiency and indoor air quality
In biological cleanrooms for pharmaceutical and biosafety laboratories, escalating cleanliness standards have made the high energy consumption associated with increased air change rates in ventilation systems untenable. Moreover, stringent pressure differential requirements are crucial in biological cleanrooms to ensure the physical isolation of pathogenic microorganisms. This study developed a multi-zone ventilation system model for a biopharmaceutical cleanroom using the Modelica language. A deep reinforcement learning (DRL) model was implemented in Python based on the actor–critic and proximal policy optimization (PPO) algorithms, utilising the Modelica model as the training environment. To maintain particulate matter (PM) concentrations and conserve energy, the DRL control model was trained to adjust air damper positions by identifying patterns in occupancy changes and pollutant concentration dynamics across various times and workspaces within the cleanroom. Results indicated that, relative to the conventional baseline control strategy, the developed reinforcement learning control approach achieved a 14.7 % reduction in energy consumption while maintaining pollutant concentrations within regulatory limits for cleanrooms, culminating in annual energy savings of 11,212.8 kWh. Additionally, pressure fluctuation ranges in the three controlled work zones of the cleanroom were diminished by 59.16 %, 9.58 %, and 29.32 %, respectively. SHapley Additive explanation (SHAP) analysis was employed to elucidate the contributing factors influencing the outputs of the developed DRL control model. Furthermore, the generalisation of the DRL control model was discussed by altering the control period and inner source of the PM.
期刊介绍:
An international journal devoted to investigations of energy use and efficiency in buildings
Energy and Buildings is an international journal publishing articles with explicit links to energy use in buildings. The aim is to present new research results, and new proven practice aimed at reducing the energy needs of a building and improving indoor environment quality.