Toward Universal Embodied Planning in Scalable Heterogeneous Field Robots Collaboration and Control

IF 4.2 2区 计算机科学 Q2 ROBOTICS
Hanwen Wan, Yuhan Zhang, Junjie Wang, Donghao Wu, Mengkang Li, Xilun Chen, Yixuan Deng, Yuxuan Huang, Zhenglong Sun, Lin Zhang, Xiaoqiang Ji
{"title":"Toward Universal Embodied Planning in Scalable Heterogeneous Field Robots Collaboration and Control","authors":"Hanwen Wan,&nbsp;Yuhan Zhang,&nbsp;Junjie Wang,&nbsp;Donghao Wu,&nbsp;Mengkang Li,&nbsp;Xilun Chen,&nbsp;Yixuan Deng,&nbsp;Yuxuan Huang,&nbsp;Zhenglong Sun,&nbsp;Lin Zhang,&nbsp;Xiaoqiang Ji","doi":"10.1002/rob.22522","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Multi-robot systems offer substantial enhancements in efficiency, scalability, robustness, and flexibility for executing complex tasks through collaborative efforts. However, existing methodologies are constrained by their lack of generalizability, the need for extensive modeling, and most importantly, limitations in their applicability in complex scenarios. This paper presents a novel approach to multi-robot task planning and coordination, introducing a comprehensive pipeline encompassing data generation, supervised fine-tuning, and rigorous error analysis using the Multi-Robot collaboration Error Diagnostic (MRED) metrics. Bridging the gap between natural language commands and physical groundings in robot collaboration tasks, we present <i>MultiPlan</i>: the first data set specifically designed for LLM fine-tuning. The MultiPlan data set encompasses 100 distinct indoor and outdoor scenarios, ranging from office to garden. Experiments underscore the efficacy of the proposed methodology, including comparative analyses against state-of-the-art LLMs and generalization studies on previously unseen tasks. Results reveal that the fine-tuned model achieves a 24.8% relative improvement over the GPT-4 model in addressing complex multi-robot planning scenarios. We also conducted field evaluations in both office and urban settings to demonstrate the deployment performance of the proposed method. These results demonstrate the model's superior capabilities in task decomposition, error management, and adaptation to novel contexts.</p></div>","PeriodicalId":192,"journal":{"name":"Journal of Field Robotics","volume":"42 5","pages":"2318-2336"},"PeriodicalIF":4.2000,"publicationDate":"2025-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Field Robotics","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/rob.22522","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ROBOTICS","Score":null,"Total":0}
引用次数: 0

Abstract

Multi-robot systems offer substantial enhancements in efficiency, scalability, robustness, and flexibility for executing complex tasks through collaborative efforts. However, existing methodologies are constrained by their lack of generalizability, the need for extensive modeling, and most importantly, limitations in their applicability in complex scenarios. This paper presents a novel approach to multi-robot task planning and coordination, introducing a comprehensive pipeline encompassing data generation, supervised fine-tuning, and rigorous error analysis using the Multi-Robot collaboration Error Diagnostic (MRED) metrics. Bridging the gap between natural language commands and physical groundings in robot collaboration tasks, we present MultiPlan: the first data set specifically designed for LLM fine-tuning. The MultiPlan data set encompasses 100 distinct indoor and outdoor scenarios, ranging from office to garden. Experiments underscore the efficacy of the proposed methodology, including comparative analyses against state-of-the-art LLMs and generalization studies on previously unseen tasks. Results reveal that the fine-tuned model achieves a 24.8% relative improvement over the GPT-4 model in addressing complex multi-robot planning scenarios. We also conducted field evaluations in both office and urban settings to demonstrate the deployment performance of the proposed method. These results demonstrate the model's superior capabilities in task decomposition, error management, and adaptation to novel contexts.

面向可扩展异构场机器人协同与控制的通用具体化规划
多机器人系统在效率、可扩展性、健壮性和灵活性方面提供了实质性的增强,可以通过协作来执行复杂的任务。然而,现有的方法由于缺乏通用性、需要广泛的建模以及最重要的是在复杂场景中的适用性方面的限制而受到限制。本文提出了一种多机器人任务规划和协调的新方法,引入了一个全面的管道,包括数据生成、监督微调和使用多机器人协作错误诊断(MRED)指标的严格错误分析。为了弥合机器人协作任务中自然语言命令和物理基础之间的差距,我们提出了MultiPlan:第一个专门为LLM微调设计的数据集。MultiPlan数据集包含100个不同的室内和室外场景,范围从办公室到花园。实验强调了所提出方法的有效性,包括与最先进的法学硕士的比较分析和对以前未见过的任务的概括研究。结果表明,与GPT-4模型相比,该模型在处理复杂的多机器人规划场景时实现了24.8%的相对改进。我们还在办公室和城市环境中进行了实地评估,以证明所建议方法的部署性能。这些结果证明了该模型在任务分解、错误管理和适应新环境方面的优越能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Field Robotics
Journal of Field Robotics 工程技术-机器人学
CiteScore
15.00
自引率
3.60%
发文量
80
审稿时长
6 months
期刊介绍: The Journal of Field Robotics seeks to promote scholarly publications dealing with the fundamentals of robotics in unstructured and dynamic environments. The Journal focuses on experimental robotics and encourages publication of work that has both theoretical and practical significance.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信