基于机器学习的紧急疏散管理人员动态路径决策

IF 2.8 3区物理与天体物理 Q2 PHYSICS, MULTIDISCIPLINARY

Physica A: Statistical Mechanics and its Applications Pub Date : 2025-03-27 DOI:10.1016/j.physa.2025.130561

Peng Yang, Bozheng Zhang, Kai Shi, Yi Hui

{"title":"基于机器学习的紧急疏散管理人员动态路径决策","authors":"Peng Yang, Bozheng Zhang, Kai Shi, Yi Hui","doi":"10.1016/j.physa.2025.130561","DOIUrl":null,"url":null,"abstract":"<div><div>In emergency situations such as indoor fires, emergency stewards can significantly influence the evacuation behavior of trapped individuals, and the rationality of their own path decisions is crucial to the overall evacuation effectiveness. This paper introduces an improved social force model to reflect the impact of stewards on the behavior of those in distress, and designs a decision-making framework based on simulation models and Deep Reinforcement Learning (DRL) technology to optimize the path decisions of stewards in dynamic scenarios. The simulation model is used to simulate various scenarios to obtain sufficient sample data; the role of DRL is to interact with the environment and dynamically guide individuals towards optimal paths using learned effective strategies. Within this framework, the decision training for emergency stewards employs a Modified Priority Experience Deep Q-Network (MPE-DQN), avoiding areas with high personnel density to optimize evacuation path decisions. The safety metric during the evacuation process is measured by personnel density per unit area, and evacuation time is chosen as the efficiency metric. Simulation experiments conducted in AnyLogic show that compared to the standard DQN algorithm, our framework, using the MPE-DQN algorithm, increased safety by 58.77 % and improved efficiency by 14.2 %.</div></div>","PeriodicalId":20152,"journal":{"name":"Physica A: Statistical Mechanics and its Applications","volume":"667 ","pages":"Article 130561"},"PeriodicalIF":2.8000,"publicationDate":"2025-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine learning-enhanced dynamic path decisions for emergency stewards in emergency evacuations\",\"authors\":\"Peng Yang, Bozheng Zhang, Kai Shi, Yi Hui\",\"doi\":\"10.1016/j.physa.2025.130561\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In emergency situations such as indoor fires, emergency stewards can significantly influence the evacuation behavior of trapped individuals, and the rationality of their own path decisions is crucial to the overall evacuation effectiveness. This paper introduces an improved social force model to reflect the impact of stewards on the behavior of those in distress, and designs a decision-making framework based on simulation models and Deep Reinforcement Learning (DRL) technology to optimize the path decisions of stewards in dynamic scenarios. The simulation model is used to simulate various scenarios to obtain sufficient sample data; the role of DRL is to interact with the environment and dynamically guide individuals towards optimal paths using learned effective strategies. Within this framework, the decision training for emergency stewards employs a Modified Priority Experience Deep Q-Network (MPE-DQN), avoiding areas with high personnel density to optimize evacuation path decisions. The safety metric during the evacuation process is measured by personnel density per unit area, and evacuation time is chosen as the efficiency metric. Simulation experiments conducted in AnyLogic show that compared to the standard DQN algorithm, our framework, using the MPE-DQN algorithm, increased safety by 58.77 % and improved efficiency by 14.2 %.</div></div>\",\"PeriodicalId\":20152,\"journal\":{\"name\":\"Physica A: Statistical Mechanics and its Applications\",\"volume\":\"667 \",\"pages\":\"Article 130561\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-03-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Physica A: Statistical Mechanics and its Applications\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0378437125002134\",\"RegionNum\":3,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"PHYSICS, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physica A: Statistical Mechanics and its Applications","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378437125002134","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PHYSICS, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

摘要

在室内火灾等紧急情况下，应急乘务员可以显著影响被困人员的疏散行为，其路径决策的合理性对整体疏散效果至关重要。本文引入了一种改进的社会力量模型来反映管理者对遇害者行为的影响，并设计了一个基于仿真模型和深度强化学习（DRL）技术的决策框架来优化动态场景下管理者的路径决策。仿真模型用于模拟各种场景，以获得足够的样本数据；DRL的作用是与环境相互作用，利用学习到的有效策略，动态地引导个体走向最优路径。在此框架下，应急管理人员的决策培训采用改进的优先级经验深度q网络（MPE-DQN），避开人员密度高的区域，优化疏散路径决策。疏散过程中的安全度量以单位面积人员密度度量，疏散时间作为效率度量。在AnyLogic中进行的仿真实验表明，与标准DQN算法相比，我们的框架使用MPE-DQN算法，安全性提高58.77 %，效率提高14.2 %。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Machine learning-enhanced dynamic path decisions for emergency stewards in emergency evacuations

In emergency situations such as indoor fires, emergency stewards can significantly influence the evacuation behavior of trapped individuals, and the rationality of their own path decisions is crucial to the overall evacuation effectiveness. This paper introduces an improved social force model to reflect the impact of stewards on the behavior of those in distress, and designs a decision-making framework based on simulation models and Deep Reinforcement Learning (DRL) technology to optimize the path decisions of stewards in dynamic scenarios. The simulation model is used to simulate various scenarios to obtain sufficient sample data; the role of DRL is to interact with the environment and dynamically guide individuals towards optimal paths using learned effective strategies. Within this framework, the decision training for emergency stewards employs a Modified Priority Experience Deep Q-Network (MPE-DQN), avoiding areas with high personnel density to optimize evacuation path decisions. The safety metric during the evacuation process is measured by personnel density per unit area, and evacuation time is chosen as the efficiency metric. Simulation experiments conducted in AnyLogic show that compared to the standard DQN algorithm, our framework, using the MPE-DQN algorithm, increased safety by 58.77 % and improved efficiency by 14.2 %.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Physica A: Statistical Mechanics and its Applications 物理-物理：综合

CiteScore

7.20

自引率

9.10%

发文量

852

审稿时长

6.6 months

期刊介绍： Physica A: Statistical Mechanics and its Applications Recognized by the European Physical Society Physica A publishes research in the field of statistical mechanics and its applications. Statistical mechanics sets out to explain the behaviour of macroscopic systems by studying the statistical properties of their microscopic constituents. Applications of the techniques of statistical mechanics are widespread, and include: applications to physical systems such as solids, liquids and gases; applications to chemical and biological systems (colloids, interfaces, complex fluids, polymers and biopolymers, cell physics); and other interdisciplinary applications to for instance biological, economical and sociological systems.