Learning to Allocate Time-Bound and Dynamic Tasks to Multiple Robots using Covariant Attention Neural Networks

Journal of Computing and Information Science in Engineering Pub Date : 2024-07-03 DOI:10.1115/1.4065883

Steve Paul, Souma Chowdhury

{"title":"Learning to Allocate Time-Bound and Dynamic Tasks to Multiple Robots using Covariant Attention Neural Networks","authors":"Steve Paul, Souma Chowdhury","doi":"10.1115/1.4065883","DOIUrl":null,"url":null,"abstract":"\n In various applications of multi-robotics in disaster response, warehouse management, and manufacturing, tasks that are known apriori and tasks added during runtime need to be assigned efficiently and without conflicts to robots in the team. This multi-robot task allocation (MRTA) process presents itself as a combinatorial optimization (CO) problem that is usually challenging to be solved in meaningful timescales using typical (mixed)integer (non)linear programming tools. Building on a growing body of work in using graph reinforcement learning to learn search heuristics for such complex CO problems, this paper presents a new graph neural network architecture called the Covariant Attention Mechanism (CAM). CAM can not only generalize but also scale to larger problems than that encountered in training, and handle dynamic tasks. This architecture combines the concept of Covariant Compositional Networks used here to embed the local structures in task graphs, with a context module that encodes the robots' states. The encoded information is passed onto a decoder designed using Multi-head Attention mechanism. When applied to a class of MRTA problems with time deadlines, robot ferry range constraints, and multi-trip settings, CAM surpasses a state-of-art graph learning approach based on the attention mechanism, as well as a feasible random-walk baseline across various generalizability and scalability tests. Performance of CAM is also found to be at par with a high-performing non-learning baseline called BiG-MRTA, while noting up to a 70-fold improvement in decision-making efficiency over this baseline.","PeriodicalId":502719,"journal":{"name":"Journal of Computing and Information Science in Engineering","volume":"2 3","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computing and Information Science in Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1115/1.4065883","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In various applications of multi-robotics in disaster response, warehouse management, and manufacturing, tasks that are known apriori and tasks added during runtime need to be assigned efficiently and without conflicts to robots in the team. This multi-robot task allocation (MRTA) process presents itself as a combinatorial optimization (CO) problem that is usually challenging to be solved in meaningful timescales using typical (mixed)integer (non)linear programming tools. Building on a growing body of work in using graph reinforcement learning to learn search heuristics for such complex CO problems, this paper presents a new graph neural network architecture called the Covariant Attention Mechanism (CAM). CAM can not only generalize but also scale to larger problems than that encountered in training, and handle dynamic tasks. This architecture combines the concept of Covariant Compositional Networks used here to embed the local structures in task graphs, with a context module that encodes the robots' states. The encoded information is passed onto a decoder designed using Multi-head Attention mechanism. When applied to a class of MRTA problems with time deadlines, robot ferry range constraints, and multi-trip settings, CAM surpasses a state-of-art graph learning approach based on the attention mechanism, as well as a feasible random-walk baseline across various generalizability and scalability tests. Performance of CAM is also found to be at par with a high-performing non-learning baseline called BiG-MRTA, while noting up to a 70-fold improvement in decision-making efficiency over this baseline.

查看原文本刊更多论文

利用共变注意神经网络学习为多个机器人分配有时限的动态任务

在灾难响应、仓库管理和生产制造中的各种多机器人应用中，需要将事先已知的任务和运行时添加的任务高效、无冲突地分配给团队中的机器人。这种多机器人任务分配（MRTA）过程本身就是一个组合优化（CO）问题，通常很难使用典型的（混合）整数（非）线性编程工具在有意义的时间尺度内加以解决。利用图强化学习来学习搜索启发式方法以解决此类复杂的组合优化问题的工作越来越多，在此基础上，本文提出了一种新的图神经网络架构，称为 "变量注意机制"（Covariant Attention Mechanism，CAM）。CAM 不仅可以泛化，还可以扩展到比训练中遇到的更大的问题，并能处理动态任务。该架构将用于嵌入任务图中局部结构的共变构成网络概念与编码机器人状态的上下文模块相结合。编码后的信息被传递到利用多头注意力机制设计的解码器上。当应用于一类具有时间期限、机器人摆渡范围限制和多行程设置的 MRTA 问题时，CAM 在各种通用性和可扩展性测试中超越了基于注意力机制的最先进图学习方法，以及可行的随机行走基线。研究还发现，CAM 的性能与名为 BiG-MRTA 的高性能非学习基线相当，同时决策效率比该基线提高了 70 倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Computing and Information Science in Engineering

自引率

0.00%

发文量