Learning to Allocate Time-Bound and Dynamic Tasks to Multiple Robots using Covariant Attention Neural Networks

Steve Paul, Souma Chowdhury
{"title":"Learning to Allocate Time-Bound and Dynamic Tasks to Multiple Robots using Covariant Attention Neural Networks","authors":"Steve Paul, Souma Chowdhury","doi":"10.1115/1.4065883","DOIUrl":null,"url":null,"abstract":"\n In various applications of multi-robotics in disaster response, warehouse management, and manufacturing, tasks that are known apriori and tasks added during runtime need to be assigned efficiently and without conflicts to robots in the team. This multi-robot task allocation (MRTA) process presents itself as a combinatorial optimization (CO) problem that is usually challenging to be solved in meaningful timescales using typical (mixed)integer (non)linear programming tools. Building on a growing body of work in using graph reinforcement learning to learn search heuristics for such complex CO problems, this paper presents a new graph neural network architecture called the Covariant Attention Mechanism (CAM). CAM can not only generalize but also scale to larger problems than that encountered in training, and handle dynamic tasks. This architecture combines the concept of Covariant Compositional Networks used here to embed the local structures in task graphs, with a context module that encodes the robots' states. The encoded information is passed onto a decoder designed using Multi-head Attention mechanism. When applied to a class of MRTA problems with time deadlines, robot ferry range constraints, and multi-trip settings, CAM surpasses a state-of-art graph learning approach based on the attention mechanism, as well as a feasible random-walk baseline across various generalizability and scalability tests. Performance of CAM is also found to be at par with a high-performing non-learning baseline called BiG-MRTA, while noting up to a 70-fold improvement in decision-making efficiency over this baseline.","PeriodicalId":502719,"journal":{"name":"Journal of Computing and Information Science in Engineering","volume":"2 3","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computing and Information Science in Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1115/1.4065883","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In various applications of multi-robotics in disaster response, warehouse management, and manufacturing, tasks that are known apriori and tasks added during runtime need to be assigned efficiently and without conflicts to robots in the team. This multi-robot task allocation (MRTA) process presents itself as a combinatorial optimization (CO) problem that is usually challenging to be solved in meaningful timescales using typical (mixed)integer (non)linear programming tools. Building on a growing body of work in using graph reinforcement learning to learn search heuristics for such complex CO problems, this paper presents a new graph neural network architecture called the Covariant Attention Mechanism (CAM). CAM can not only generalize but also scale to larger problems than that encountered in training, and handle dynamic tasks. This architecture combines the concept of Covariant Compositional Networks used here to embed the local structures in task graphs, with a context module that encodes the robots' states. The encoded information is passed onto a decoder designed using Multi-head Attention mechanism. When applied to a class of MRTA problems with time deadlines, robot ferry range constraints, and multi-trip settings, CAM surpasses a state-of-art graph learning approach based on the attention mechanism, as well as a feasible random-walk baseline across various generalizability and scalability tests. Performance of CAM is also found to be at par with a high-performing non-learning baseline called BiG-MRTA, while noting up to a 70-fold improvement in decision-making efficiency over this baseline.
利用共变注意神经网络学习为多个机器人分配有时限的动态任务
在灾难响应、仓库管理和生产制造中的各种多机器人应用中,需要将事先已知的任务和运行时添加的任务高效、无冲突地分配给团队中的机器人。这种多机器人任务分配(MRTA)过程本身就是一个组合优化(CO)问题,通常很难使用典型的(混合)整数(非)线性编程工具在有意义的时间尺度内加以解决。利用图强化学习来学习搜索启发式方法以解决此类复杂的组合优化问题的工作越来越多,在此基础上,本文提出了一种新的图神经网络架构,称为 "变量注意机制"(Covariant Attention Mechanism,CAM)。CAM 不仅可以泛化,还可以扩展到比训练中遇到的更大的问题,并能处理动态任务。该架构将用于嵌入任务图中局部结构的共变构成网络概念与编码机器人状态的上下文模块相结合。编码后的信息被传递到利用多头注意力机制设计的解码器上。当应用于一类具有时间期限、机器人摆渡范围限制和多行程设置的 MRTA 问题时,CAM 在各种通用性和可扩展性测试中超越了基于注意力机制的最先进图学习方法,以及可行的随机行走基线。研究还发现,CAM 的性能与名为 BiG-MRTA 的高性能非学习基线相当,同时决策效率比该基线提高了 70 倍。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信