人群模拟中马尔可夫决策过程的并行求解器

2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI) Pub Date : 2015-10-25 DOI:10.1109/MICAI.2015.23

Sergio Ruiz, Benjamín Hernández

{"title":"人群模拟中马尔可夫决策过程的并行求解器","authors":"Sergio Ruiz, Benjamín Hernández","doi":"10.1109/MICAI.2015.23","DOIUrl":null,"url":null,"abstract":"Classic path finding algorithms are not adequate in real world path planning, where environment information is incomplete or dynamic and Markov Decision Processes have been used as an alternative. The problem with the MDP formalism is that its state space grows exponentially with the number of domain variables, and its inference methods grow with the number of available actions. To overcome this issue, we formulate a MDP solver in terms of matrix multiplications, based on the Value Iteration algorithm, thus we can take advantage of the graphic processor units (GPUs) to produce interactively obstacle-free paths in the form of an Optimal Policy. We also propose a hexagonal grid navigation space, that reduces the cardinality of the MDP state set. We present a performance analysis of our technique using embedded systems, desktop CPU and GPUs and its application in crowd simulation. Our GPU algorithm presents 90x speed up in desktop platforms, and 30x speed up in embedded systems in contrast with its CPU multi-threaded version.","PeriodicalId":448255,"journal":{"name":"2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI)","volume":"80 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"21","resultStr":"{\"title\":\"A Parallel Solver for Markov Decision Process in Crowd Simulations\",\"authors\":\"Sergio Ruiz, Benjamín Hernández\",\"doi\":\"10.1109/MICAI.2015.23\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Classic path finding algorithms are not adequate in real world path planning, where environment information is incomplete or dynamic and Markov Decision Processes have been used as an alternative. The problem with the MDP formalism is that its state space grows exponentially with the number of domain variables, and its inference methods grow with the number of available actions. To overcome this issue, we formulate a MDP solver in terms of matrix multiplications, based on the Value Iteration algorithm, thus we can take advantage of the graphic processor units (GPUs) to produce interactively obstacle-free paths in the form of an Optimal Policy. We also propose a hexagonal grid navigation space, that reduces the cardinality of the MDP state set. We present a performance analysis of our technique using embedded systems, desktop CPU and GPUs and its application in crowd simulation. Our GPU algorithm presents 90x speed up in desktop platforms, and 30x speed up in embedded systems in contrast with its CPU multi-threaded version.\",\"PeriodicalId\":448255,\"journal\":{\"name\":\"2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI)\",\"volume\":\"80 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-10-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"21\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MICAI.2015.23\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MICAI.2015.23","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 21

摘要

经典的寻径算法在现实世界的路径规划中是不充分的，因为环境信息是不完整的或动态的，而马尔可夫决策过程被用作替代方法。MDP形式化的问题在于，它的状态空间随着领域变量的数量呈指数增长，它的推理方法随着可用操作的数量增长。为了克服这一问题，我们在值迭代算法的基础上制定了矩阵乘法的MDP求解器，从而我们可以利用图形处理器单元(gpu)以最优策略的形式生成交互式无障碍路径。我们还提出了一个六边形网格导航空间，它减少了MDP状态集的基数。本文介绍了该技术在嵌入式系统、桌面CPU和gpu上的性能分析及其在人群模拟中的应用。与CPU多线程版本相比，我们的GPU算法在桌面平台上的速度提高了90倍，在嵌入式系统上的速度提高了30倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Parallel Solver for Markov Decision Process in Crowd Simulations

Classic path finding algorithms are not adequate in real world path planning, where environment information is incomplete or dynamic and Markov Decision Processes have been used as an alternative. The problem with the MDP formalism is that its state space grows exponentially with the number of domain variables, and its inference methods grow with the number of available actions. To overcome this issue, we formulate a MDP solver in terms of matrix multiplications, based on the Value Iteration algorithm, thus we can take advantage of the graphic processor units (GPUs) to produce interactively obstacle-free paths in the form of an Optimal Policy. We also propose a hexagonal grid navigation space, that reduces the cardinality of the MDP state set. We present a performance analysis of our technique using embedded systems, desktop CPU and GPUs and its application in crowd simulation. Our GPU algorithm presents 90x speed up in desktop platforms, and 30x speed up in embedded systems in contrast with its CPU multi-threaded version.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI)

自引率

0.00%

发文量