{"title":"多智能体仿真的深度强化学习","authors":"Yasuki Iizuka","doi":"10.1109/IIAIAAI55812.2022.00137","DOIUrl":null,"url":null,"abstract":"The purpose of this study is to easily create agent behavior for disaster evacuation simulation. Multi-agent simulation is commonly used in disaster evacuation simulations, but it is difficult to program agents. Floor field models and reinforcement learning have been proposed as a way of solving this problem. However, there are issues with unnatural stagnation or learning times. In this paper, we report the reinforcement learning method for an evacuation simulation agent, using the local environment. As a result of the experiment, it was confirmed that efficient movement of the agent can be achieved in a short learning time.","PeriodicalId":156230,"journal":{"name":"2022 12th International Congress on Advanced Applied Informatics (IIAI-AAI)","volume":"114 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deep Reinforcement Learning for Multi-agent Simulation using a partial floor field cutout\",\"authors\":\"Yasuki Iizuka\",\"doi\":\"10.1109/IIAIAAI55812.2022.00137\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The purpose of this study is to easily create agent behavior for disaster evacuation simulation. Multi-agent simulation is commonly used in disaster evacuation simulations, but it is difficult to program agents. Floor field models and reinforcement learning have been proposed as a way of solving this problem. However, there are issues with unnatural stagnation or learning times. In this paper, we report the reinforcement learning method for an evacuation simulation agent, using the local environment. As a result of the experiment, it was confirmed that efficient movement of the agent can be achieved in a short learning time.\",\"PeriodicalId\":156230,\"journal\":{\"name\":\"2022 12th International Congress on Advanced Applied Informatics (IIAI-AAI)\",\"volume\":\"114 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 12th International Congress on Advanced Applied Informatics (IIAI-AAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IIAIAAI55812.2022.00137\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 12th International Congress on Advanced Applied Informatics (IIAI-AAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IIAIAAI55812.2022.00137","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Deep Reinforcement Learning for Multi-agent Simulation using a partial floor field cutout
The purpose of this study is to easily create agent behavior for disaster evacuation simulation. Multi-agent simulation is commonly used in disaster evacuation simulations, but it is difficult to program agents. Floor field models and reinforcement learning have been proposed as a way of solving this problem. However, there are issues with unnatural stagnation or learning times. In this paper, we report the reinforcement learning method for an evacuation simulation agent, using the local environment. As a result of the experiment, it was confirmed that efficient movement of the agent can be achieved in a short learning time.