整合人群模型和强化学习的路径优化

Proceedings of the 5th ACM SIGSPATIAL International Workshop on the Use of GIS in Emergency Management Pub Date : 2019-11-05 DOI:10.1145/3356998.3365765

Yanyun Fu, Wenxi Shi, Hui Zhang, Xiaoxue Ma, Yang Gao, Danhuai Guo

{"title":"整合人群模型和强化学习的路径优化","authors":"Yanyun Fu, Wenxi Shi, Hui Zhang, Xiaoxue Ma, Yang Gao, Danhuai Guo","doi":"10.1145/3356998.3365765","DOIUrl":null,"url":null,"abstract":"Exit choice and path planning are critical in emergency decision-making. Traditional research focuses on the shortest path, which is not sensitive to environmental factors such as the crowd congestion, obstacles distribution, air pollution, etc. To solve the path optimization problem, a behavior agent model is developed and integrated in the large-scale crowd simulation. The Q-Learning algorithm is applied to adjust the agent behavior. Considering the architectural space key exits and doors as network nodes, the paper presents combining dynamic crowd model and reinforcement learning strategy. The strategy with high training efficiency considering obstacles setup, crowd movement, and exits environment, the learning agent interacts dynamically with surrounding environment, and learns the shortest time path to exit. Simulation utilizes social force model for occupant movement, avoiding collisions with other occupants and obstacles. The path optimization is verified with the pedestrian library of Anylogic.","PeriodicalId":133472,"journal":{"name":"Proceedings of the 5th ACM SIGSPATIAL International Workshop on the Use of GIS in Emergency Management","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Path optimization of integrating crowd model and reinforcement learning\",\"authors\":\"Yanyun Fu, Wenxi Shi, Hui Zhang, Xiaoxue Ma, Yang Gao, Danhuai Guo\",\"doi\":\"10.1145/3356998.3365765\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Exit choice and path planning are critical in emergency decision-making. Traditional research focuses on the shortest path, which is not sensitive to environmental factors such as the crowd congestion, obstacles distribution, air pollution, etc. To solve the path optimization problem, a behavior agent model is developed and integrated in the large-scale crowd simulation. The Q-Learning algorithm is applied to adjust the agent behavior. Considering the architectural space key exits and doors as network nodes, the paper presents combining dynamic crowd model and reinforcement learning strategy. The strategy with high training efficiency considering obstacles setup, crowd movement, and exits environment, the learning agent interacts dynamically with surrounding environment, and learns the shortest time path to exit. Simulation utilizes social force model for occupant movement, avoiding collisions with other occupants and obstacles. The path optimization is verified with the pedestrian library of Anylogic.\",\"PeriodicalId\":133472,\"journal\":{\"name\":\"Proceedings of the 5th ACM SIGSPATIAL International Workshop on the Use of GIS in Emergency Management\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 5th ACM SIGSPATIAL International Workshop on the Use of GIS in Emergency Management\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3356998.3365765\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 5th ACM SIGSPATIAL International Workshop on the Use of GIS in Emergency Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3356998.3365765","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

出口选择和路径规划是应急决策的关键。传统的研究侧重于最短路径，对人群拥挤、障碍物分布、空气污染等环境因素不敏感。为了解决路径优化问题，建立了行为智能体模型，并将其集成到大规模人群仿真中。采用Q-Learning算法调整agent的行为。将建筑空间关键出口和门作为网络节点，提出了动态人群模型与强化学习策略相结合的方法。该策略考虑了障碍物设置、人群运动和退出环境，使学习智能体与周围环境动态交互，并学习最短时间的退出路径，训练效率高。仿真利用社会力模型进行乘员运动，避免与其他乘员和障碍物发生碰撞。利用Anylogic的行人库对路径优化进行了验证。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Path optimization of integrating crowd model and reinforcement learning

Exit choice and path planning are critical in emergency decision-making. Traditional research focuses on the shortest path, which is not sensitive to environmental factors such as the crowd congestion, obstacles distribution, air pollution, etc. To solve the path optimization problem, a behavior agent model is developed and integrated in the large-scale crowd simulation. The Q-Learning algorithm is applied to adjust the agent behavior. Considering the architectural space key exits and doors as network nodes, the paper presents combining dynamic crowd model and reinforcement learning strategy. The strategy with high training efficiency considering obstacles setup, crowd movement, and exits environment, the learning agent interacts dynamically with surrounding environment, and learns the shortest time path to exit. Simulation utilizes social force model for occupant movement, avoiding collisions with other occupants and obstacles. The path optimization is verified with the pedestrian library of Anylogic.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 5th ACM SIGSPATIAL International Workshop on the Use of GIS in Emergency Management

自引率

0.00%

发文量