When to Commit to an Action in Online Planning and Search

Symposium on Combinatorial Search Pub Date : 2022-07-17 DOI:10.1609/socs.v15i1.21755

Tianyi Gu, Wheeler Ruml, Shahaf S. Shperberg, E. Shimony, E. Karpas

引用次数: 1

Abstract

In online planning, search is concurrent with execution. Under the formulation of planning as heuristic search, when a planner commits to an action, it re-roots its search tree at the node representing the outcome of that action. For the system to remain controlled, the planner must commit to a new action (perhaps a no-op) before the previously chosen action completes. This time pressure results in a real-time search. In this time-bounded setting, it can be beneficial to commit early, in order to perform more lookahead search focused below an upcoming state. In this paper, we propose a principled method for making this commitment decision. Our experimental evaluation shows that our scheme can outperform previously-proposed fixed strategies.

查看原文本刊更多论文

何时在网上计划和搜索中采取行动

在在线规划中，搜索与执行是同步的。在启发式搜索(heuristic search)的规划表述下，当规划者提交一个行动时，它在表示该行动结果的节点上重新扎根其搜索树。为了保持对系统的控制，计划者必须在先前选择的操作完成之前提交一个新的操作(可能是无操作)。这种时间压力导致了实时搜索。在这种有时间限制的设置中，尽早提交可能是有益的，以便在即将到来的状态下执行更多的前瞻性搜索。在本文中，我们提出了一种原则性的方法来做出这种承诺决策。我们的实验评估表明，我们的方案优于先前提出的固定策略。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Symposium on Combinatorial Search

自引率

0.00%

发文量