Transformer-based Localization from Embodied Dialog with Large-scale Pre-training

Q3 Environmental Science

AACL Bioflux Pub Date : 2022-10-10 DOI:10.48550/arXiv.2210.04864

Meera Hahn, James M. Rehg

引用次数: 1

Abstract

We address the challenging task of Localization via Embodied Dialog (LED). Given a dialog from two agents, an Observer navigating through an unknown environment and a Locator who is attempting to identify the Observer’s location, the goal is to predict the Observer’s final location in a map. We develop a novel LED-Bert architecture and present an effective pretraining strategy. We show that a graph-based scene representation is more effective than the top-down 2D maps used in prior works. Our approach outperforms previous baselines.

查看原文本刊更多论文

基于变压器的大规模预训练嵌入对话定位

我们通过嵌入式对话(LED)解决了本地化的挑战性任务。给定来自两个代理的对话，一个是在未知环境中导航的观察者，另一个是试图识别观察者位置的定位器，目标是预测观察者在地图上的最终位置。我们开发了一种新的LED-Bert架构，并提出了一种有效的预训练策略。我们表明，基于图形的场景表示比先前工作中使用的自上而下的2D地图更有效。我们的方法优于以前的基线。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

AACL Bioflux Environmental Science-Management, Monitoring, Policy and Law

CiteScore

1.40

自引率

0.00%

发文量