Mengqi Wang , Rongshun Juan , Zezhong Li , Zhongke Gao
{"title":"Formation control and intention compensating of AUVs using multi-agent reinforcement learning and predict network","authors":"Mengqi Wang , Rongshun Juan , Zezhong Li , Zhongke Gao","doi":"10.1016/j.oceaneng.2025.122854","DOIUrl":null,"url":null,"abstract":"<div><div>Autonomous Underwater Vehicles (AUVs) have played an important role in numerous marine tasks, such as resource exploration, hydrological data acquisition, rescue operations, and military missions. In contrast to single AUV deployment, multi-AUV formations exhibit higher efficiency and improved task completion rates. Recently, multi-agent reinforcement learning (MARL) has emerged as a promising technique for AUV formation control. Nevertheless, conventional MARL approaches often suffer from instability in formation shapes, especially when managing a large number of AUVs. Additionally, communication delay and information dropout can further compromise formation performance. In this paper, we propose a novel method called Policy Compensate Multi-agent Twin Delayed Deep Deterministic Policy Gradient (PC-MATD3), which integrates imitation learning (IL) with MARL to improve formation stability. The proposed framework is designed to alleviate adverse effects caused by communication interruptions or information delays. We define distance and angular errors as key performance metrics and evaluate our method through two distinct simulation scenarios. Experimental results show that, under ideal communication conditions, our approach substantially reduces formation errors and improves overall stability. Additionally, in scenarios involving communication dropouts, the proposed method effectively predicts the positions of neighboring AUVs, enabling the restoration of the desired formation geometry.</div></div>","PeriodicalId":19403,"journal":{"name":"Ocean Engineering","volume":"342 ","pages":"Article 122854"},"PeriodicalIF":5.5000,"publicationDate":"2025-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ocean Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0029801825025375","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0
Abstract
Autonomous Underwater Vehicles (AUVs) have played an important role in numerous marine tasks, such as resource exploration, hydrological data acquisition, rescue operations, and military missions. In contrast to single AUV deployment, multi-AUV formations exhibit higher efficiency and improved task completion rates. Recently, multi-agent reinforcement learning (MARL) has emerged as a promising technique for AUV formation control. Nevertheless, conventional MARL approaches often suffer from instability in formation shapes, especially when managing a large number of AUVs. Additionally, communication delay and information dropout can further compromise formation performance. In this paper, we propose a novel method called Policy Compensate Multi-agent Twin Delayed Deep Deterministic Policy Gradient (PC-MATD3), which integrates imitation learning (IL) with MARL to improve formation stability. The proposed framework is designed to alleviate adverse effects caused by communication interruptions or information delays. We define distance and angular errors as key performance metrics and evaluate our method through two distinct simulation scenarios. Experimental results show that, under ideal communication conditions, our approach substantially reduces formation errors and improves overall stability. Additionally, in scenarios involving communication dropouts, the proposed method effectively predicts the positions of neighboring AUVs, enabling the restoration of the desired formation geometry.
期刊介绍:
Ocean Engineering provides a medium for the publication of original research and development work in the field of ocean engineering. Ocean Engineering seeks papers in the following topics.