Songjun Huang, Chuanneng Sun, Jie Gong, Dario Pompili
{"title":"Reinforcement learning–based task allocation and path‐finding in multi‐robot systems under environment uncertainty","authors":"Songjun Huang, Chuanneng Sun, Jie Gong, Dario Pompili","doi":"10.1111/mice.13535","DOIUrl":null,"url":null,"abstract":"Autonomous robots have the potential to significantly improve the operational efficiency of multi‐robot systems (MRSs) under environment uncertainties. Achieving robust performance in these settings requires effective task allocation and adaptive path‐finding. However, conventional model‐based frameworks often rely on centralized control or global information, making them impractical when communication is intermittent or maps are unavailable. Although recent studies have shown that reinforcement learning (RL)‐based frameworks offer improved performance, problems related to synchronization and adaptability in diverse environments remain unresolved. To address these problems, this study proposes the “RL‐based Task‐Allocation and Path‐Finding under Uncertainty (RL‐TAPU)” framework. This framework incorporates an Action‐Selective Double‐Q‐Learning (ASDQ) algorithm for real‐time task allocation and a Context‐Aware Meta‐Q‐Learning (CA‐MQL) algorithm for adaptive path‐finding. Unlike previous RL‐based frameworks, RL‐TAPU is designed to operate without global maps, uses only local state information, and functions reliably under intermittent and low‐bandwidth communication conditions. The task allocator communicates only minimal information, and the path‐finding component adapts to new environments without the need for complete environmental data. Experimental results show that the RL‐TAPU framework achieves better adaptability and works more efficiently with a shorter total execution time than competitors.","PeriodicalId":156,"journal":{"name":"Computer-Aided Civil and Infrastructure Engineering","volume":"13 1","pages":""},"PeriodicalIF":8.5000,"publicationDate":"2025-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer-Aided Civil and Infrastructure Engineering","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1111/mice.13535","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Autonomous robots have the potential to significantly improve the operational efficiency of multi‐robot systems (MRSs) under environment uncertainties. Achieving robust performance in these settings requires effective task allocation and adaptive path‐finding. However, conventional model‐based frameworks often rely on centralized control or global information, making them impractical when communication is intermittent or maps are unavailable. Although recent studies have shown that reinforcement learning (RL)‐based frameworks offer improved performance, problems related to synchronization and adaptability in diverse environments remain unresolved. To address these problems, this study proposes the “RL‐based Task‐Allocation and Path‐Finding under Uncertainty (RL‐TAPU)” framework. This framework incorporates an Action‐Selective Double‐Q‐Learning (ASDQ) algorithm for real‐time task allocation and a Context‐Aware Meta‐Q‐Learning (CA‐MQL) algorithm for adaptive path‐finding. Unlike previous RL‐based frameworks, RL‐TAPU is designed to operate without global maps, uses only local state information, and functions reliably under intermittent and low‐bandwidth communication conditions. The task allocator communicates only minimal information, and the path‐finding component adapts to new environments without the need for complete environmental data. Experimental results show that the RL‐TAPU framework achieves better adaptability and works more efficiently with a shorter total execution time than competitors.
期刊介绍:
Computer-Aided Civil and Infrastructure Engineering stands as a scholarly, peer-reviewed archival journal, serving as a vital link between advancements in computer technology and civil and infrastructure engineering. The journal serves as a distinctive platform for the publication of original articles, spotlighting novel computational techniques and inventive applications of computers. Specifically, it concentrates on recent progress in computer and information technologies, fostering the development and application of emerging computing paradigms.
Encompassing a broad scope, the journal addresses bridge, construction, environmental, highway, geotechnical, structural, transportation, and water resources engineering. It extends its reach to the management of infrastructure systems, covering domains such as highways, bridges, pavements, airports, and utilities. The journal delves into areas like artificial intelligence, cognitive modeling, concurrent engineering, database management, distributed computing, evolutionary computing, fuzzy logic, genetic algorithms, geometric modeling, internet-based technologies, knowledge discovery and engineering, machine learning, mobile computing, multimedia technologies, networking, neural network computing, optimization and search, parallel processing, robotics, smart structures, software engineering, virtual reality, and visualization techniques.