Mobile robotics planning using abstract Markov decision processes

Proceedings 11th International Conference on Tools with Artificial Intelligence Pub Date : 1999-11-08 DOI:10.1109/TAI.1999.809804

Pierre Laroche, F. Charpillet, R. Schott

引用次数: 13

Abstract

Markov decision processes have been successfully used in robotics for indoor robot navigation problems. They allow the computation of optimal sequences of actions in order to achieve a given goal, accounting for actuator uncertainties. However, MDPs are unsatisfactory at avoiding unknown obstacles. On the other hand, reactive navigators are particularly adapted to that, and don't need any prior knowledge about the environment, but they are unable to plan the set of actions that will permit the realization of a given mission. We present a new state aggregation technique for Markov decision processes, such that part of the work usually dedicated to the planner is achieved by a reactive navigator. Thus some characteristics of our environments, such as the width of corridors, have not been considered, which allows to cluster states together, significantly reducing the state space. As a consequence, policies are computed faster and are shown to be at least as efficient as optimal ones.

查看原文本刊更多论文

基于抽象马尔可夫决策过程的移动机器人规划

马尔可夫决策过程已成功地应用于机器人室内机器人导航问题。它们允许计算最佳的动作序列，以实现给定的目标，考虑执行器的不确定性。然而，民主党在避免未知障碍方面并不令人满意。另一方面，反应式导航器特别适应这种情况，不需要任何关于环境的先验知识，但它们无法计划一系列行动，从而允许实现给定的任务。我们提出了一种新的马尔可夫决策过程的状态聚合技术，使得通常由计划器完成的部分工作由响应式导航器完成。因此，我们的环境的一些特征，如走廊的宽度，没有被考虑在内，这允许将状态聚集在一起，大大减少了状态空间。因此，策略的计算速度更快，并且至少与最优策略一样有效。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings 11th International Conference on Tools with Artificial Intelligence

自引率

0.00%

发文量