Avoiding Errors in Learned Heuristics in Bounded-Suboptimal Search

Symposium on Combinatorial Search Pub Date : 2022-07-17 DOI:10.1609/socs.v15i1.21804

M. Greco, Jorge A. Baier

引用次数: 0

Abstract

Despite being very effective, learned heuristics in bounded-suboptimal search can produce heuristic plateaus or move the search to zones of the state space that do not lead to a solution. In addition, it produces inadmissible cost-to-go estimates; therefore, it cannot be exploited with classical algorithms like WA* to produce w-optimal solutions. In this paper, we present two ways in which Focal Search can be modified to exploit a learned heuristic in a bounded suboptimal search: Focal Discrepancy Search, which, to evaluate each state, uses a discrepancy score based on the best-predicted heuristic value; and K-Focal Search, which expands more than one node from the FOCAL list in each expansion cycle. Both algorithms return w-optimal solutions and explore different zones of the state space than the ones that focal search, using the learned heuristic to sort the FOCAL list, would explore.

查看原文本刊更多论文

有界次优搜索中学习启发式算法的避免错误

尽管有界次优搜索中的学习启发式算法非常有效，但它可能会产生启发式平台，或者将搜索移动到状态空间中无法找到解决方案的区域。此外，它还会产生令人难以接受的预估成本;因此，它不能被像WA*这样的经典算法利用来产生w-最优解。在本文中，我们提出了两种方法，可以修改焦点搜索来利用有界次优搜索中的学习启发式:焦点差异搜索，它使用基于最佳预测启发式值的差异评分来评估每个状态;以及K-Focal Search，它在每个扩展周期中从FOCAL列表中扩展多个节点。这两种算法都返回w-最优解，并且探索不同于焦点搜索的状态空间区域，使用学习的启发式对焦点列表进行排序，将探索。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Symposium on Combinatorial Search

自引率

0.00%

发文量