Few-shot Subgoal Planning with Language Models

North American Chapter of the Association for Computational Linguistics Pub Date : 2022-05-28 DOI:10.48550/arXiv.2205.14288

Lajanugen Logeswaran, Yao Fu, Moontae Lee, Honglak Lee

引用次数: 16

Abstract

Pre-trained language models have shown successful progress in many text understanding benchmarks. This work explores the capability of these models to predict actionable plans in real-world environments. Given a text instruction, we show that language priors encoded in pre-trained models allow us to infer fine-grained subgoal sequences. In contrast to recent methods which make strong assumptions about subgoal supervision, our experiments show that language models can infer detailed subgoal sequences from few training sequences without any fine-tuning. We further propose a simple strategy to re-rank language model predictions based on interaction and feedback from the environment. Combined with pre-trained navigation and visual reasoning components, our approach demonstrates competitive performance on subgoal prediction and task completion in the ALFRED benchmark compared to prior methods that assume more subgoal supervision.

查看原文本刊更多论文

基于语言模型的次目标规划

预先训练的语言模型在许多文本理解基准中显示出成功的进展。这项工作探索了这些模型在现实环境中预测可操作计划的能力。给定文本指令，我们展示了在预训练模型中编码的语言先验，使我们能够推断出细粒度的子目标序列。与最近的方法对子目标监督做了很强的假设相反，我们的实验表明，语言模型可以从很少的训练序列中推断出详细的子目标序列，而无需进行任何微调。我们进一步提出了一种简单的策略，基于环境的交互和反馈对语言模型预测进行重新排序。结合预先训练的导航和视觉推理组件，我们的方法在ALFRED基准测试中展示了与先前假设更多子目标监督的方法相比，在子目标预测和任务完成方面的竞争力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

North American Chapter of the Association for Computational Linguistics

自引率

0.00%

发文量