基于机器学习的紧急疏散管理人员动态路径决策

IF 2.8 3区 物理与天体物理 Q2 PHYSICS, MULTIDISCIPLINARY
Peng Yang, Bozheng Zhang, Kai Shi, Yi Hui
{"title":"基于机器学习的紧急疏散管理人员动态路径决策","authors":"Peng Yang,&nbsp;Bozheng Zhang,&nbsp;Kai Shi,&nbsp;Yi Hui","doi":"10.1016/j.physa.2025.130561","DOIUrl":null,"url":null,"abstract":"<div><div>In emergency situations such as indoor fires, emergency stewards can significantly influence the evacuation behavior of trapped individuals, and the rationality of their own path decisions is crucial to the overall evacuation effectiveness. This paper introduces an improved social force model to reflect the impact of stewards on the behavior of those in distress, and designs a decision-making framework based on simulation models and Deep Reinforcement Learning (DRL) technology to optimize the path decisions of stewards in dynamic scenarios. The simulation model is used to simulate various scenarios to obtain sufficient sample data; the role of DRL is to interact with the environment and dynamically guide individuals towards optimal paths using learned effective strategies. Within this framework, the decision training for emergency stewards employs a Modified Priority Experience Deep Q-Network (MPE-DQN), avoiding areas with high personnel density to optimize evacuation path decisions. The safety metric during the evacuation process is measured by personnel density per unit area, and evacuation time is chosen as the efficiency metric. Simulation experiments conducted in AnyLogic show that compared to the standard DQN algorithm, our framework, using the MPE-DQN algorithm, increased safety by 58.77 % and improved efficiency by 14.2 %.</div></div>","PeriodicalId":20152,"journal":{"name":"Physica A: Statistical Mechanics and its Applications","volume":"667 ","pages":"Article 130561"},"PeriodicalIF":2.8000,"publicationDate":"2025-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine learning-enhanced dynamic path decisions for emergency stewards in emergency evacuations\",\"authors\":\"Peng Yang,&nbsp;Bozheng Zhang,&nbsp;Kai Shi,&nbsp;Yi Hui\",\"doi\":\"10.1016/j.physa.2025.130561\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In emergency situations such as indoor fires, emergency stewards can significantly influence the evacuation behavior of trapped individuals, and the rationality of their own path decisions is crucial to the overall evacuation effectiveness. This paper introduces an improved social force model to reflect the impact of stewards on the behavior of those in distress, and designs a decision-making framework based on simulation models and Deep Reinforcement Learning (DRL) technology to optimize the path decisions of stewards in dynamic scenarios. The simulation model is used to simulate various scenarios to obtain sufficient sample data; the role of DRL is to interact with the environment and dynamically guide individuals towards optimal paths using learned effective strategies. Within this framework, the decision training for emergency stewards employs a Modified Priority Experience Deep Q-Network (MPE-DQN), avoiding areas with high personnel density to optimize evacuation path decisions. The safety metric during the evacuation process is measured by personnel density per unit area, and evacuation time is chosen as the efficiency metric. Simulation experiments conducted in AnyLogic show that compared to the standard DQN algorithm, our framework, using the MPE-DQN algorithm, increased safety by 58.77 % and improved efficiency by 14.2 %.</div></div>\",\"PeriodicalId\":20152,\"journal\":{\"name\":\"Physica A: Statistical Mechanics and its Applications\",\"volume\":\"667 \",\"pages\":\"Article 130561\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-03-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Physica A: Statistical Mechanics and its Applications\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0378437125002134\",\"RegionNum\":3,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"PHYSICS, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physica A: Statistical Mechanics and its Applications","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378437125002134","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PHYSICS, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

在室内火灾等紧急情况下,应急乘务员可以显著影响被困人员的疏散行为,其路径决策的合理性对整体疏散效果至关重要。本文引入了一种改进的社会力量模型来反映管理者对遇害者行为的影响,并设计了一个基于仿真模型和深度强化学习(DRL)技术的决策框架来优化动态场景下管理者的路径决策。仿真模型用于模拟各种场景,以获得足够的样本数据;DRL的作用是与环境相互作用,利用学习到的有效策略,动态地引导个体走向最优路径。在此框架下,应急管理人员的决策培训采用改进的优先级经验深度q网络(MPE-DQN),避开人员密度高的区域,优化疏散路径决策。疏散过程中的安全度量以单位面积人员密度度量,疏散时间作为效率度量。在AnyLogic中进行的仿真实验表明,与标准DQN算法相比,我们的框架使用MPE-DQN算法,安全性提高58.77 %,效率提高14.2 %。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Machine learning-enhanced dynamic path decisions for emergency stewards in emergency evacuations
In emergency situations such as indoor fires, emergency stewards can significantly influence the evacuation behavior of trapped individuals, and the rationality of their own path decisions is crucial to the overall evacuation effectiveness. This paper introduces an improved social force model to reflect the impact of stewards on the behavior of those in distress, and designs a decision-making framework based on simulation models and Deep Reinforcement Learning (DRL) technology to optimize the path decisions of stewards in dynamic scenarios. The simulation model is used to simulate various scenarios to obtain sufficient sample data; the role of DRL is to interact with the environment and dynamically guide individuals towards optimal paths using learned effective strategies. Within this framework, the decision training for emergency stewards employs a Modified Priority Experience Deep Q-Network (MPE-DQN), avoiding areas with high personnel density to optimize evacuation path decisions. The safety metric during the evacuation process is measured by personnel density per unit area, and evacuation time is chosen as the efficiency metric. Simulation experiments conducted in AnyLogic show that compared to the standard DQN algorithm, our framework, using the MPE-DQN algorithm, increased safety by 58.77 % and improved efficiency by 14.2 %.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
7.20
自引率
9.10%
发文量
852
审稿时长
6.6 months
期刊介绍: Physica A: Statistical Mechanics and its Applications Recognized by the European Physical Society Physica A publishes research in the field of statistical mechanics and its applications. Statistical mechanics sets out to explain the behaviour of macroscopic systems by studying the statistical properties of their microscopic constituents. Applications of the techniques of statistical mechanics are widespread, and include: applications to physical systems such as solids, liquids and gases; applications to chemical and biological systems (colloids, interfaces, complex fluids, polymers and biopolymers, cell physics); and other interdisciplinary applications to for instance biological, economical and sociological systems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信