整合人群模型和强化学习的路径优化

Yanyun Fu, Wenxi Shi, Hui Zhang, Xiaoxue Ma, Yang Gao, Danhuai Guo
{"title":"整合人群模型和强化学习的路径优化","authors":"Yanyun Fu, Wenxi Shi, Hui Zhang, Xiaoxue Ma, Yang Gao, Danhuai Guo","doi":"10.1145/3356998.3365765","DOIUrl":null,"url":null,"abstract":"Exit choice and path planning are critical in emergency decision-making. Traditional research focuses on the shortest path, which is not sensitive to environmental factors such as the crowd congestion, obstacles distribution, air pollution, etc. To solve the path optimization problem, a behavior agent model is developed and integrated in the large-scale crowd simulation. The Q-Learning algorithm is applied to adjust the agent behavior. Considering the architectural space key exits and doors as network nodes, the paper presents combining dynamic crowd model and reinforcement learning strategy. The strategy with high training efficiency considering obstacles setup, crowd movement, and exits environment, the learning agent interacts dynamically with surrounding environment, and learns the shortest time path to exit. Simulation utilizes social force model for occupant movement, avoiding collisions with other occupants and obstacles. The path optimization is verified with the pedestrian library of Anylogic.","PeriodicalId":133472,"journal":{"name":"Proceedings of the 5th ACM SIGSPATIAL International Workshop on the Use of GIS in Emergency Management","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Path optimization of integrating crowd model and reinforcement learning\",\"authors\":\"Yanyun Fu, Wenxi Shi, Hui Zhang, Xiaoxue Ma, Yang Gao, Danhuai Guo\",\"doi\":\"10.1145/3356998.3365765\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Exit choice and path planning are critical in emergency decision-making. Traditional research focuses on the shortest path, which is not sensitive to environmental factors such as the crowd congestion, obstacles distribution, air pollution, etc. To solve the path optimization problem, a behavior agent model is developed and integrated in the large-scale crowd simulation. The Q-Learning algorithm is applied to adjust the agent behavior. Considering the architectural space key exits and doors as network nodes, the paper presents combining dynamic crowd model and reinforcement learning strategy. The strategy with high training efficiency considering obstacles setup, crowd movement, and exits environment, the learning agent interacts dynamically with surrounding environment, and learns the shortest time path to exit. Simulation utilizes social force model for occupant movement, avoiding collisions with other occupants and obstacles. The path optimization is verified with the pedestrian library of Anylogic.\",\"PeriodicalId\":133472,\"journal\":{\"name\":\"Proceedings of the 5th ACM SIGSPATIAL International Workshop on the Use of GIS in Emergency Management\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 5th ACM SIGSPATIAL International Workshop on the Use of GIS in Emergency Management\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3356998.3365765\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 5th ACM SIGSPATIAL International Workshop on the Use of GIS in Emergency Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3356998.3365765","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

出口选择和路径规划是应急决策的关键。传统的研究侧重于最短路径,对人群拥挤、障碍物分布、空气污染等环境因素不敏感。为了解决路径优化问题,建立了行为智能体模型,并将其集成到大规模人群仿真中。采用Q-Learning算法调整agent的行为。将建筑空间关键出口和门作为网络节点,提出了动态人群模型与强化学习策略相结合的方法。该策略考虑了障碍物设置、人群运动和退出环境,使学习智能体与周围环境动态交互,并学习最短时间的退出路径,训练效率高。仿真利用社会力模型进行乘员运动,避免与其他乘员和障碍物发生碰撞。利用Anylogic的行人库对路径优化进行了验证。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Path optimization of integrating crowd model and reinforcement learning
Exit choice and path planning are critical in emergency decision-making. Traditional research focuses on the shortest path, which is not sensitive to environmental factors such as the crowd congestion, obstacles distribution, air pollution, etc. To solve the path optimization problem, a behavior agent model is developed and integrated in the large-scale crowd simulation. The Q-Learning algorithm is applied to adjust the agent behavior. Considering the architectural space key exits and doors as network nodes, the paper presents combining dynamic crowd model and reinforcement learning strategy. The strategy with high training efficiency considering obstacles setup, crowd movement, and exits environment, the learning agent interacts dynamically with surrounding environment, and learns the shortest time path to exit. Simulation utilizes social force model for occupant movement, avoiding collisions with other occupants and obstacles. The path optimization is verified with the pedestrian library of Anylogic.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信