Policy Rollout Action Selection with Knowledge Gradient for Sensor Path Planning

2021 IEEE 24th International Conference on Information Fusion (FUSION) Pub Date : 2021-11-01 DOI:10.23919/fusion49465.2021.9626874

Thore Gerlach, Folker Hoffmann, A. Charlish

引用次数: 3

Abstract

This paper considers the problem of finding the best action in a policy rollout algorithm. Policy rollout is an online computation method used in approximate dynamic programming. We applied two different versions of the knowledge gradient (KG) policy to a sensor path planning problem. The goal of this problem is to localize an emitter using only bearing measurements. To the authors’ knowledge, this was the first time the KG was applied in a policy rollout context. The performance of the KG policy was found to be comparable with methods used in prior work while also having a potentially wider applicability.

查看原文本刊更多论文

基于知识梯度的传感器路径规划策略展开动作选择

本文研究了策略展开算法中寻找最佳行动的问题。策略rollout是一种用于近似动态规划的在线计算方法。我们将两种不同版本的知识梯度(KG)策略应用于传感器路径规划问题。该问题的目标是仅使用方位测量来定位发射器。据作者所知，这是KG首次在政策推出上下文中应用。发现KG策略的性能与先前工作中使用的方法相当，同时也具有潜在的更广泛的适用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE 24th International Conference on Information Fusion (FUSION)

自引率

0.00%

发文量