{"title":"An exploration-driven framework for path planning in complex buildings using improved MADDPG","authors":"Chong Zhang , Hong Liu , Wenhao Li","doi":"10.1016/j.jobe.2025.112626","DOIUrl":null,"url":null,"abstract":"<div><div>Multi-agent deep reinforcement learning (MADRL) methods have been extensively applied to crowd evacuation in complex building environments. However, the intricate architecture of modern buildings and high population densities hinder agent exploration efficiency, while existing approaches struggle to overcome the challenges posed by sparse rewards. To tackle these issues, this study proposes the Intrinsic Curiosity Distillation Multi-Agent Deep Deterministic Policy Gradient (ICD-MADDPG) algorithm, an exploration-driven framework for path planning in complex buildings. First, the ICD-MADDPG algorithm introduces a curiosity mechanism by integrating the Intrinsic Curiosity Module (ICM) and Random Network Distillation (RND), thereby refining the reward mechanism and significantly enhancing exploration efficiency. Next, the feature extraction process is enhanced using a Long Short-Term Memory (LSTM) network, enabling the model to effectively capture temporal dependencies in dynamic environments. Finally, a two-layer evacuation mechanism is adopted, where the crowd is divided into groups consisting of leaders and followers. Leaders utilize the ICD-MADDPG algorithm for global evacuation path planning, while followers employ the Reciprocal Velocity Obstacle (RVO) algorithm to follow the leaders and avoid collisions efficiently. Experimental results demonstrate that the ICD-MADDPG algorithm achieves superior rewards, improves evacuation efficiency, and effectively mitigates congestion. This framework provides a robust theoretical basis for optimizing evacuation strategies and offers practical value for intelligent building systems and emergency response planning.</div></div>","PeriodicalId":15064,"journal":{"name":"Journal of building engineering","volume":"107 ","pages":"Article 112626"},"PeriodicalIF":6.7000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of building engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352710225008630","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Multi-agent deep reinforcement learning (MADRL) methods have been extensively applied to crowd evacuation in complex building environments. However, the intricate architecture of modern buildings and high population densities hinder agent exploration efficiency, while existing approaches struggle to overcome the challenges posed by sparse rewards. To tackle these issues, this study proposes the Intrinsic Curiosity Distillation Multi-Agent Deep Deterministic Policy Gradient (ICD-MADDPG) algorithm, an exploration-driven framework for path planning in complex buildings. First, the ICD-MADDPG algorithm introduces a curiosity mechanism by integrating the Intrinsic Curiosity Module (ICM) and Random Network Distillation (RND), thereby refining the reward mechanism and significantly enhancing exploration efficiency. Next, the feature extraction process is enhanced using a Long Short-Term Memory (LSTM) network, enabling the model to effectively capture temporal dependencies in dynamic environments. Finally, a two-layer evacuation mechanism is adopted, where the crowd is divided into groups consisting of leaders and followers. Leaders utilize the ICD-MADDPG algorithm for global evacuation path planning, while followers employ the Reciprocal Velocity Obstacle (RVO) algorithm to follow the leaders and avoid collisions efficiently. Experimental results demonstrate that the ICD-MADDPG algorithm achieves superior rewards, improves evacuation efficiency, and effectively mitigates congestion. This framework provides a robust theoretical basis for optimizing evacuation strategies and offers practical value for intelligent building systems and emergency response planning.
期刊介绍:
The Journal of Building Engineering is an interdisciplinary journal that covers all aspects of science and technology concerned with the whole life cycle of the built environment; from the design phase through to construction, operation, performance, maintenance and its deterioration.