A graph reinforcement learning framework for real-time distributed multi-robot task allocation

Q3 Earth and Planetary Sciences

Aerospace Systems Pub Date : 2025-01-23 DOI:10.1007/s42401-024-00334-w

Dian Zhang, Peng Dong, Pai Peng, Yubo Dong

{"title":"A graph reinforcement learning framework for real-time distributed multi-robot task allocation","authors":"Dian Zhang, Peng Dong, Pai Peng, Yubo Dong","doi":"10.1007/s42401-024-00334-w","DOIUrl":null,"url":null,"abstract":"<div><p>Dynamic multi-robot task allocation (MRTA) requires real-time responsiveness and adaptability to rapidly changing conditions. Existing methods, primarily based on static data and centralized architectures, often fail in dynamic environments that require decentralized, context-aware decisions. To address these challenges, this paper proposes a novel graph reinforcement learning (GRL) architecture, named Spatial-Temporal Fusing Reinforcement Learning (STFRL), to address real-time distributed target allocation problems in search and rescue scenarios. The proposed policy network includes an encoder, which employs a Temporal-Spatial Fusing Encoder (TSFE) to extract input features and a decoder uses multi-head attention (MHA) to perform distributed allocation based on the encoder’s output and context. The policy network is trained with the REINFORCE algorithm. Experimental comparisons with state-of-the-art baselines demonstrate that STFRL achieves superior performance in path cost, inference speed, and scalability, highlighting its robustness and efficiency in complex, dynamic environments.</p></div>","PeriodicalId":36309,"journal":{"name":"Aerospace Systems","volume":"8 1","pages":"105 - 116"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s42401-024-00334-w.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Aerospace Systems","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1007/s42401-024-00334-w","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Earth and Planetary Sciences","Score":null,"Total":0}

引用次数: 0

Abstract

Dynamic multi-robot task allocation (MRTA) requires real-time responsiveness and adaptability to rapidly changing conditions. Existing methods, primarily based on static data and centralized architectures, often fail in dynamic environments that require decentralized, context-aware decisions. To address these challenges, this paper proposes a novel graph reinforcement learning (GRL) architecture, named Spatial-Temporal Fusing Reinforcement Learning (STFRL), to address real-time distributed target allocation problems in search and rescue scenarios. The proposed policy network includes an encoder, which employs a Temporal-Spatial Fusing Encoder (TSFE) to extract input features and a decoder uses multi-head attention (MHA) to perform distributed allocation based on the encoder’s output and context. The policy network is trained with the REINFORCE algorithm. Experimental comparisons with state-of-the-art baselines demonstrate that STFRL achieves superior performance in path cost, inference speed, and scalability, highlighting its robustness and efficiency in complex, dynamic environments.

查看原文本刊更多论文

面向实时分布式多机器人任务分配的图强化学习框架

动态多机器人任务分配（MRTA）需要实时响应和适应快速变化的条件。现有的方法主要基于静态数据和集中式架构，在需要分散的、上下文感知的决策的动态环境中往往会失败。为了解决这些挑战，本文提出了一种新的图强化学习（GRL）架构，称为时空融合强化学习（STFRL），以解决搜索和救援场景中的实时分布式目标分配问题。所提出的策略网络包括一个编码器，该编码器采用时空融合编码器（TSFE）提取输入特征，解码器使用多头注意（MHA）根据编码器的输出和上下文进行分布式分配。采用强化算法对策略网络进行训练。与最先进基线的实验比较表明，STFRL在路径成本、推理速度和可扩展性方面具有优越的性能，突出了其在复杂动态环境中的鲁棒性和效率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Aerospace Systems Social Sciences-Social Sciences (miscellaneous)

CiteScore

1.80

自引率

0.00%

发文量

期刊介绍： Aerospace Systems provides an international, peer-reviewed forum which focuses on system-level research and development regarding aeronautics and astronautics. The journal emphasizes the unique role and increasing importance of informatics on aerospace. It fills a gap in current publishing coverage from outer space vehicles to atmospheric vehicles by highlighting interdisciplinary science, technology and engineering. Potential topics include, but are not limited to: Trans-space vehicle systems design and integration Air vehicle systems Space vehicle systems Near-space vehicle systems Aerospace robotics and unmanned system Communication, navigation and surveillance Aerodynamics and aircraft design Dynamics and control Aerospace propulsion Avionics system Opto-electronic system Air traffic management Earth observation Deep space exploration Bionic micro-aircraft/spacecraft Intelligent sensing and Information fusion