人群模拟中马尔可夫决策过程的并行求解器

Sergio Ruiz, Benjamín Hernández
{"title":"人群模拟中马尔可夫决策过程的并行求解器","authors":"Sergio Ruiz, Benjamín Hernández","doi":"10.1109/MICAI.2015.23","DOIUrl":null,"url":null,"abstract":"Classic path finding algorithms are not adequate in real world path planning, where environment information is incomplete or dynamic and Markov Decision Processes have been used as an alternative. The problem with the MDP formalism is that its state space grows exponentially with the number of domain variables, and its inference methods grow with the number of available actions. To overcome this issue, we formulate a MDP solver in terms of matrix multiplications, based on the Value Iteration algorithm, thus we can take advantage of the graphic processor units (GPUs) to produce interactively obstacle-free paths in the form of an Optimal Policy. We also propose a hexagonal grid navigation space, that reduces the cardinality of the MDP state set. We present a performance analysis of our technique using embedded systems, desktop CPU and GPUs and its application in crowd simulation. Our GPU algorithm presents 90x speed up in desktop platforms, and 30x speed up in embedded systems in contrast with its CPU multi-threaded version.","PeriodicalId":448255,"journal":{"name":"2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI)","volume":"80 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"21","resultStr":"{\"title\":\"A Parallel Solver for Markov Decision Process in Crowd Simulations\",\"authors\":\"Sergio Ruiz, Benjamín Hernández\",\"doi\":\"10.1109/MICAI.2015.23\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Classic path finding algorithms are not adequate in real world path planning, where environment information is incomplete or dynamic and Markov Decision Processes have been used as an alternative. The problem with the MDP formalism is that its state space grows exponentially with the number of domain variables, and its inference methods grow with the number of available actions. To overcome this issue, we formulate a MDP solver in terms of matrix multiplications, based on the Value Iteration algorithm, thus we can take advantage of the graphic processor units (GPUs) to produce interactively obstacle-free paths in the form of an Optimal Policy. We also propose a hexagonal grid navigation space, that reduces the cardinality of the MDP state set. We present a performance analysis of our technique using embedded systems, desktop CPU and GPUs and its application in crowd simulation. Our GPU algorithm presents 90x speed up in desktop platforms, and 30x speed up in embedded systems in contrast with its CPU multi-threaded version.\",\"PeriodicalId\":448255,\"journal\":{\"name\":\"2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI)\",\"volume\":\"80 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-10-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"21\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MICAI.2015.23\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MICAI.2015.23","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 21

摘要

经典的寻径算法在现实世界的路径规划中是不充分的,因为环境信息是不完整的或动态的,而马尔可夫决策过程被用作替代方法。MDP形式化的问题在于,它的状态空间随着领域变量的数量呈指数增长,它的推理方法随着可用操作的数量增长。为了克服这一问题,我们在值迭代算法的基础上制定了矩阵乘法的MDP求解器,从而我们可以利用图形处理器单元(gpu)以最优策略的形式生成交互式无障碍路径。我们还提出了一个六边形网格导航空间,它减少了MDP状态集的基数。本文介绍了该技术在嵌入式系统、桌面CPU和gpu上的性能分析及其在人群模拟中的应用。与CPU多线程版本相比,我们的GPU算法在桌面平台上的速度提高了90倍,在嵌入式系统上的速度提高了30倍。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Parallel Solver for Markov Decision Process in Crowd Simulations
Classic path finding algorithms are not adequate in real world path planning, where environment information is incomplete or dynamic and Markov Decision Processes have been used as an alternative. The problem with the MDP formalism is that its state space grows exponentially with the number of domain variables, and its inference methods grow with the number of available actions. To overcome this issue, we formulate a MDP solver in terms of matrix multiplications, based on the Value Iteration algorithm, thus we can take advantage of the graphic processor units (GPUs) to produce interactively obstacle-free paths in the form of an Optimal Policy. We also propose a hexagonal grid navigation space, that reduces the cardinality of the MDP state set. We present a performance analysis of our technique using embedded systems, desktop CPU and GPUs and its application in crowd simulation. Our GPU algorithm presents 90x speed up in desktop platforms, and 30x speed up in embedded systems in contrast with its CPU multi-threaded version.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信