Reinforcement learning–based task allocation and path-finding in multi-robot systems under environment uncertainty

IF 9.1 1区工程技术 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Computer-Aided Civil and Infrastructure Engineering Pub Date : 2025-06-15 DOI:10.1111/mice.13535

Songjun Huang, Chuanneng Sun, Jie Gong, Dario Pompili

{"title":"Reinforcement learning–based task allocation and path-finding in multi-robot systems under environment uncertainty","authors":"Songjun Huang, Chuanneng Sun, Jie Gong, Dario Pompili","doi":"10.1111/mice.13535","DOIUrl":null,"url":null,"abstract":"Autonomous robots have the potential to significantly improve the operational efficiency of multi-robot systems (MRSs) under environment uncertainties. Achieving robust performance in these settings requires effective task allocation and adaptive path-finding. However, conventional model-based frameworks often rely on centralized control or global information, making them impractical when communication is intermittent or maps are unavailable. Although recent studies have shown that reinforcement learning (RL)-based frameworks offer improved performance, problems related to synchronization and adaptability in diverse environments remain unresolved. To address these problems, this study proposes the “RL-based Task-Allocation and Path-Finding under Uncertainty (RL-TAPU)” framework. This framework incorporates an Action-Selective Double-Q-Learning (ASDQ) algorithm for real-time task allocation and a Context-Aware Meta-Q-Learning (CA-MQL) algorithm for adaptive path-finding. Unlike previous RL-based frameworks, RL-TAPU is designed to operate without global maps, uses only local state information, and functions reliably under intermittent and low-bandwidth communication conditions. The task allocator communicates only minimal information, and the path-finding component adapts to new environments without the need for complete environmental data. Experimental results show that the RL-TAPU framework achieves better adaptability and works more efficiently with a shorter total execution time than competitors.","PeriodicalId":156,"journal":{"name":"Computer-Aided Civil and Infrastructure Engineering","volume":"40 22","pages":"3408-3429"},"PeriodicalIF":9.1000,"publicationDate":"2025-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/mice.13535","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer-Aided Civil and Infrastructure Engineering","FirstCategoryId":"5","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/mice.13535","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

Autonomous robots have the potential to significantly improve the operational efficiency of multi-robot systems (MRSs) under environment uncertainties. Achieving robust performance in these settings requires effective task allocation and adaptive path-finding. However, conventional model-based frameworks often rely on centralized control or global information, making them impractical when communication is intermittent or maps are unavailable. Although recent studies have shown that reinforcement learning (RL)-based frameworks offer improved performance, problems related to synchronization and adaptability in diverse environments remain unresolved. To address these problems, this study proposes the “RL-based Task-Allocation and Path-Finding under Uncertainty (RL-TAPU)” framework. This framework incorporates an Action-Selective Double-Q-Learning (ASDQ) algorithm for real-time task allocation and a Context-Aware Meta-Q-Learning (CA-MQL) algorithm for adaptive path-finding. Unlike previous RL-based frameworks, RL-TAPU is designed to operate without global maps, uses only local state information, and functions reliably under intermittent and low-bandwidth communication conditions. The task allocator communicates only minimal information, and the path-finding component adapts to new environments without the need for complete environmental data. Experimental results show that the RL-TAPU framework achieves better adaptability and works more efficiently with a shorter total execution time than competitors.

Abstract Image

查看原文本刊更多论文

环境不确定性下基于强化学习的多机器人系统任务分配与寻径

自主机器人具有显著提高多机器人系统（MRSs）在环境不确定性下的运行效率的潜力。在这些设置中实现稳健的性能需要有效的任务分配和自适应寻径。然而，传统的基于模型的框架通常依赖于集中控制或全局信息，这使得它们在通信时断时续或无法获得地图时不切实际。尽管最近的研究表明，基于强化学习（RL）的框架提供了改进的性能，但与不同环境中的同步和适应性相关的问题仍未解决。为了解决这些问题，本研究提出了“基于RL的任务分配和不确定性下的路径查找（RL - TAPU）”框架。该框架结合了用于实时任务分配的动作选择性双Q学习（ASDQ）算法和用于自适应寻路的上下文感知元Q学习（CA - MQL）算法。与之前基于RL的框架不同，RL - TAPU的设计无需全局地图，仅使用本地状态信息，并且在间歇性和低带宽通信条件下可靠地运行。任务分配器只传递最少的信息，寻径组件无需完整的环境数据即可适应新环境。实验结果表明，RL - TAPU框架比竞争对手具有更好的适应性和更高的工作效率，总执行时间更短。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computer-Aided Civil and Infrastructure Engineering 工程技术-工程：土木

CiteScore

17.60

自引率

19.80%

发文量

146

审稿时长

1 months

期刊介绍： Computer-Aided Civil and Infrastructure Engineering stands as a scholarly, peer-reviewed archival journal, serving as a vital link between advancements in computer technology and civil and infrastructure engineering. The journal serves as a distinctive platform for the publication of original articles, spotlighting novel computational techniques and inventive applications of computers. Specifically, it concentrates on recent progress in computer and information technologies, fostering the development and application of emerging computing paradigms. Encompassing a broad scope, the journal addresses bridge, construction, environmental, highway, geotechnical, structural, transportation, and water resources engineering. It extends its reach to the management of infrastructure systems, covering domains such as highways, bridges, pavements, airports, and utilities. The journal delves into areas like artificial intelligence, cognitive modeling, concurrent engineering, database management, distributed computing, evolutionary computing, fuzzy logic, genetic algorithms, geometric modeling, internet-based technologies, knowledge discovery and engineering, machine learning, mobile computing, multimedia technologies, networking, neural network computing, optimization and search, parallel processing, robotics, smart structures, software engineering, virtual reality, and visualization techniques.