{"title":"Path optimization of integrating crowd model and reinforcement learning","authors":"Yanyun Fu, Wenxi Shi, Hui Zhang, Xiaoxue Ma, Yang Gao, Danhuai Guo","doi":"10.1145/3356998.3365765","DOIUrl":null,"url":null,"abstract":"Exit choice and path planning are critical in emergency decision-making. Traditional research focuses on the shortest path, which is not sensitive to environmental factors such as the crowd congestion, obstacles distribution, air pollution, etc. To solve the path optimization problem, a behavior agent model is developed and integrated in the large-scale crowd simulation. The Q-Learning algorithm is applied to adjust the agent behavior. Considering the architectural space key exits and doors as network nodes, the paper presents combining dynamic crowd model and reinforcement learning strategy. The strategy with high training efficiency considering obstacles setup, crowd movement, and exits environment, the learning agent interacts dynamically with surrounding environment, and learns the shortest time path to exit. Simulation utilizes social force model for occupant movement, avoiding collisions with other occupants and obstacles. The path optimization is verified with the pedestrian library of Anylogic.","PeriodicalId":133472,"journal":{"name":"Proceedings of the 5th ACM SIGSPATIAL International Workshop on the Use of GIS in Emergency Management","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 5th ACM SIGSPATIAL International Workshop on the Use of GIS in Emergency Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3356998.3365765","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Exit choice and path planning are critical in emergency decision-making. Traditional research focuses on the shortest path, which is not sensitive to environmental factors such as the crowd congestion, obstacles distribution, air pollution, etc. To solve the path optimization problem, a behavior agent model is developed and integrated in the large-scale crowd simulation. The Q-Learning algorithm is applied to adjust the agent behavior. Considering the architectural space key exits and doors as network nodes, the paper presents combining dynamic crowd model and reinforcement learning strategy. The strategy with high training efficiency considering obstacles setup, crowd movement, and exits environment, the learning agent interacts dynamically with surrounding environment, and learns the shortest time path to exit. Simulation utilizes social force model for occupant movement, avoiding collisions with other occupants and obstacles. The path optimization is verified with the pedestrian library of Anylogic.