部分可观察马尔可夫决策过程与机器人

Annu. Rev. Control. Robotics Auton. Syst. Pub Date : 2022-01-05 DOI:10.1146/annurev-control-042920-092451

H. Kurniawati

{"title":"部分可观察马尔可夫决策过程与机器人","authors":"H. Kurniawati","doi":"10.1146/annurev-control-042920-092451","DOIUrl":null,"url":null,"abstract":"Planning under uncertainty is critical to robotics. The partially observable Markov decision process (POMDP) is a mathematical framework for such planning problems. POMDPs are powerful because of their careful quantification of the nondeterministic effects of actions and the partial observability of the states. But for the same reason, they are notorious for their high computational complexity and have been deemed impractical for robotics. However, over the past two decades, the development of sampling-based approximate solvers has led to tremendous advances in POMDP-solving capabilities. Although these solvers do not generate the optimal solution, they can compute good POMDP solutions that significantly improve the robustness of robotics systems within reasonable computational resources, thereby making POMDPs practical for many realistic robotics problems. This article presents a review of POMDPs, emphasizing computational issues that have hindered their practicality in robotics and ideas in sampling-based solvers that have alleviated such difficulties, together with lessons learned from applying POMDPs to physical robots. Expected final online publication date for the Annual Review of Control, Robotics, and Autonomous Systems, Volume 5 is May 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.","PeriodicalId":147293,"journal":{"name":"Annu. Rev. Control. Robotics Auton. Syst.","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"38","resultStr":"{\"title\":\"Partially Observable Markov Decision Processes and Robotics\",\"authors\":\"H. Kurniawati\",\"doi\":\"10.1146/annurev-control-042920-092451\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Planning under uncertainty is critical to robotics. The partially observable Markov decision process (POMDP) is a mathematical framework for such planning problems. POMDPs are powerful because of their careful quantification of the nondeterministic effects of actions and the partial observability of the states. But for the same reason, they are notorious for their high computational complexity and have been deemed impractical for robotics. However, over the past two decades, the development of sampling-based approximate solvers has led to tremendous advances in POMDP-solving capabilities. Although these solvers do not generate the optimal solution, they can compute good POMDP solutions that significantly improve the robustness of robotics systems within reasonable computational resources, thereby making POMDPs practical for many realistic robotics problems. This article presents a review of POMDPs, emphasizing computational issues that have hindered their practicality in robotics and ideas in sampling-based solvers that have alleviated such difficulties, together with lessons learned from applying POMDPs to physical robots. Expected final online publication date for the Annual Review of Control, Robotics, and Autonomous Systems, Volume 5 is May 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.\",\"PeriodicalId\":147293,\"journal\":{\"name\":\"Annu. Rev. Control. Robotics Auton. Syst.\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-01-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"38\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Annu. Rev. Control. Robotics Auton. Syst.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1146/annurev-control-042920-092451\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annu. Rev. Control. Robotics Auton. Syst.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1146/annurev-control-042920-092451","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 38

摘要

不确定性下的规划是机器人技术的关键。部分可观察马尔可夫决策过程(POMDP)是解决这类规划问题的数学框架。pomdp是强大的，因为它们对行动的不确定性影响和状态的部分可观察性进行了仔细的量化。但出于同样的原因，它们因其高计算复杂性而臭名昭著，并且被认为对机器人来说不切实际。然而，在过去的二十年中，基于采样的近似求解器的发展导致了pomdp求解能力的巨大进步。尽管这些求解器不能生成最优解，但它们可以在合理的计算资源范围内计算出良好的POMDP解，从而显著提高机器人系统的鲁棒性，从而使POMDP在许多现实机器人问题中具有实用性。本文介绍了pomdp的回顾，强调了阻碍其在机器人技术中的实用性的计算问题，以及基于采样的求解器的想法，这些想法减轻了这些困难，以及将pomdp应用于物理机器人的经验教训。预计《控制、机器人和自主系统年度评论》第5卷的最终在线出版日期是2022年5月。修订后的估计数请参阅http://www.annualreviews.org/page/journal/pubdates。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Partially Observable Markov Decision Processes and Robotics

Planning under uncertainty is critical to robotics. The partially observable Markov decision process (POMDP) is a mathematical framework for such planning problems. POMDPs are powerful because of their careful quantification of the nondeterministic effects of actions and the partial observability of the states. But for the same reason, they are notorious for their high computational complexity and have been deemed impractical for robotics. However, over the past two decades, the development of sampling-based approximate solvers has led to tremendous advances in POMDP-solving capabilities. Although these solvers do not generate the optimal solution, they can compute good POMDP solutions that significantly improve the robustness of robotics systems within reasonable computational resources, thereby making POMDPs practical for many realistic robotics problems. This article presents a review of POMDPs, emphasizing computational issues that have hindered their practicality in robotics and ideas in sampling-based solvers that have alleviated such difficulties, together with lessons learned from applying POMDPs to physical robots. Expected final online publication date for the Annual Review of Control, Robotics, and Autonomous Systems, Volume 5 is May 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Annu. Rev. Control. Robotics Auton. Syst.

自引率

0.00%

发文量