Embodied Multi-Agent Task Planning from Ambiguous Instruction

Robotics: Science and Systems XVIII Pub Date : 2022-06-27 DOI:10.15607/rss.2022.xviii.032

Xinzhu Liu, Xinghang Li, Di Guo, Sinan Tan, Huaping Liu, F. Sun

{"title":"Embodied Multi-Agent Task Planning from Ambiguous Instruction","authors":"Xinzhu Liu, Xinghang Li, Di Guo, Sinan Tan, Huaping Liu, F. Sun","doi":"10.15607/rss.2022.xviii.032","DOIUrl":null,"url":null,"abstract":"—In human-robots collaboration scenarios, a human would give robots an instruction that is intuitive for the human himself to accomplish. However, the instruction given to robots is likely ambiguous for them to understand as some information is implicit in the instruction. Therefore, it is necessary for the robots to jointly reason the operation details and perform the embodied multi-agent task planning given the ambiguous instruction. This problem exhibits significant challenges in both language understanding and dynamic task planning with the perception information. In this work, an embodied multi-agent task planning framework is proposed to utilize external knowledge sources and dynamically perceived visual information to resolve the high-level instructions, and dynamically allocate the decomposed tasks to multiple agents. Furthermore, we utilize the semantic information to perform environment perception and generate sub-goals to achieve the navigation motion. This model effectively bridges the difference between the simulation environment and the physical environment, thus it can be simultaneously applied in both simulation and physical scenarios and avoid the notori- ous sim2real problem. Finally, we build a benchmark dataset to validate the embodied multi-agent task planning problem, which includes three types of high-level instructions in which some target objects are implicit in instructions. We perform the evaluation experiments on the simulation platform and in physical scenarios, demonstrating that the proposed model can achieve promising results for multi-agent collaborative tasks.","PeriodicalId":340265,"journal":{"name":"Robotics: Science and Systems XVIII","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Robotics: Science and Systems XVIII","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15607/rss.2022.xviii.032","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 8

Abstract

—In human-robots collaboration scenarios, a human would give robots an instruction that is intuitive for the human himself to accomplish. However, the instruction given to robots is likely ambiguous for them to understand as some information is implicit in the instruction. Therefore, it is necessary for the robots to jointly reason the operation details and perform the embodied multi-agent task planning given the ambiguous instruction. This problem exhibits significant challenges in both language understanding and dynamic task planning with the perception information. In this work, an embodied multi-agent task planning framework is proposed to utilize external knowledge sources and dynamically perceived visual information to resolve the high-level instructions, and dynamically allocate the decomposed tasks to multiple agents. Furthermore, we utilize the semantic information to perform environment perception and generate sub-goals to achieve the navigation motion. This model effectively bridges the difference between the simulation environment and the physical environment, thus it can be simultaneously applied in both simulation and physical scenarios and avoid the notori- ous sim2real problem. Finally, we build a benchmark dataset to validate the embodied multi-agent task planning problem, which includes three types of high-level instructions in which some target objects are implicit in instructions. We perform the evaluation experiments on the simulation platform and in physical scenarios, demonstrating that the proposed model can achieve promising results for multi-agent collaborative tasks.

查看原文本刊更多论文

基于模糊指令的多智能体任务规划

在人-机器人协作的场景中，人类会给机器人一个指令，而这个指令是人类自己能够直观地完成的。然而，给机器人的指令可能是模棱两可的，因为一些信息隐含在指令中。因此，在存在歧义指令的情况下，机器人有必要共同推理操作细节并执行具身的多智能体任务规划。这一问题在语言理解和基于感知信息的动态任务规划方面都面临着重大挑战。提出了一种嵌入式多智能体任务规划框架，利用外部知识来源和动态感知的视觉信息来解析高级指令，并将分解后的任务动态分配给多个智能体。利用语义信息进行环境感知，生成子目标，实现导航运动。该模型有效地弥合了仿真环境和物理环境之间的差异，从而可以同时应用于仿真和物理场景，避免了众所周知的sim2real问题。最后，我们建立了一个基准数据集来验证嵌入的多智能体任务规划问题，该问题包括三种类型的高级指令，其中一些目标对象隐含在指令中。我们在仿真平台和物理场景上进行了评估实验，证明了所提出的模型在多智能体协作任务中取得了令人满意的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Robotics: Science and Systems XVIII

自引率

0.00%

发文量