Underwater Multiple AUV Cooperative Target Tracking Based on Minimal Reward Participation-Embedded MARL

IF 7.7 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Transactions on Mobile Computing Pub Date : 2024-12-20 DOI:10.1109/TMC.2024.3521028

Shengchao Zhu;Guangjie Han;Chuan Lin;Fan Zhang

{"title":"Underwater Multiple AUV Cooperative Target Tracking Based on Minimal Reward Participation-Embedded MARL","authors":"Shengchao Zhu;Guangjie Han;Chuan Lin;Fan Zhang","doi":"10.1109/TMC.2024.3521028","DOIUrl":null,"url":null,"abstract":"Recently, the rapid advancement of Multi-Agent Reinforcement Learning (MARL) has introduced a new paradigm for intelligent underwater target tracking within Autonomous Underwater Vehicle (AUV) cluster networks, enabling these networks to intelligently collaborate in target tracking. However, the limited scalability of MARL poses significant challenges to the performance of AUV cluster networks in tracking tasks. Specifically, MARL models trained on a fixed agents lose their effectiveness when the agent count changes, underscoring the critical need to enhance MARL’s scalability to accommodate an arbitrary number of agents. This paper addresses the pressing issue of MARL’s scalability in the context of AUV cluster network-based target tracking. Specifically, we propose an Elastic Software-Defined Multi-Agent Reinforcement Learning (ESD-MARL) architecture to enhance the scalability of AUV cluster networks. Moreover, we propose an Incremental Multi-Agent Reinforcement Learning algorithm based on Minimal Reward Participation (IMARL-MRP) that allows for the expansion of the agents without retraining. By integrating the ESD-MARL with the IMARL-MRP, we propose an elastic underwater target tracking scheme, achieving high-performance target tracking with enhanced scalability. Evaluation results demonstrate that the proposed approach effectively enhances the scalability of MARL, enabling the arbitrary expansion of the AUV cluster network, thus supporting scalable and efficient underwater target tracking.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"24 5","pages":"4169-4182"},"PeriodicalIF":7.7000,"publicationDate":"2024-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Mobile Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10811865/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Recently, the rapid advancement of Multi-Agent Reinforcement Learning (MARL) has introduced a new paradigm for intelligent underwater target tracking within Autonomous Underwater Vehicle (AUV) cluster networks, enabling these networks to intelligently collaborate in target tracking. However, the limited scalability of MARL poses significant challenges to the performance of AUV cluster networks in tracking tasks. Specifically, MARL models trained on a fixed agents lose their effectiveness when the agent count changes, underscoring the critical need to enhance MARL’s scalability to accommodate an arbitrary number of agents. This paper addresses the pressing issue of MARL’s scalability in the context of AUV cluster network-based target tracking. Specifically, we propose an Elastic Software-Defined Multi-Agent Reinforcement Learning (ESD-MARL) architecture to enhance the scalability of AUV cluster networks. Moreover, we propose an Incremental Multi-Agent Reinforcement Learning algorithm based on Minimal Reward Participation (IMARL-MRP) that allows for the expansion of the agents without retraining. By integrating the ESD-MARL with the IMARL-MRP, we propose an elastic underwater target tracking scheme, achieving high-performance target tracking with enhanced scalability. Evaluation results demonstrate that the proposed approach effectively enhances the scalability of MARL, enabling the arbitrary expansion of the AUV cluster network, thus supporting scalable and efficient underwater target tracking.

查看原文本刊更多论文

基于最小奖励参与-嵌入式MARL的水下多AUV协同目标跟踪

近年来，多智能体强化学习（MARL）的快速发展为自主水下航行器（AUV）集群网络中的智能水下目标跟踪提供了一种新的范式，使这些网络能够智能地协同进行目标跟踪。然而，MARL有限的可扩展性给AUV集群网络在跟踪任务中的性能带来了重大挑战。具体来说，当代理数量发生变化时，在固定代理上训练的MARL模型会失去其有效性，这强调了增强MARL的可伸缩性以适应任意数量代理的迫切需要。本文研究了基于AUV集群网络的目标跟踪中MARL的可扩展性问题。具体来说，我们提出了一种弹性软件定义的多智能体强化学习（ESD-MARL）架构来增强AUV集群网络的可扩展性。此外，我们提出了一种基于最小奖励参与（IMARL-MRP）的增量多智能体强化学习算法，该算法允许在不进行再训练的情况下扩展智能体。通过将ESD-MARL与IMARL-MRP相结合，提出了一种弹性水下目标跟踪方案，实现了具有增强可扩展性的高性能目标跟踪。评估结果表明，该方法有效增强了MARL的可扩展性，实现了AUV集群网络的任意扩展，从而支持可扩展的、高效的水下目标跟踪。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Mobile Computing 工程技术-电信学

CiteScore

12.90

自引率

2.50%

发文量

403

审稿时长

6.6 months

期刊介绍： IEEE Transactions on Mobile Computing addresses key technical issues related to various aspects of mobile computing. This includes (a) architectures, (b) support services, (c) algorithm/protocol design and analysis, (d) mobile environments, (e) mobile communication systems, (f) applications, and (g) emerging technologies. Topics of interest span a wide range, covering aspects like mobile networks and hosts, mobility management, multimedia, operating system support, power management, online and mobile environments, security, scalability, reliability, and emerging technologies such as wearable computers, body area networks, and wireless sensor networks. The journal serves as a comprehensive platform for advancements in mobile computing research.