{"title":"Formation control of multiple AUVs using decentralized self-attention based soft actor–critic model","authors":"Meiyan Zhang , Ziqiang Liu , Wenyu Cai","doi":"10.1016/j.robot.2025.105187","DOIUrl":null,"url":null,"abstract":"<div><div>One of most important topics studied in the field of multiple robots is cooperative formation control. Formation structure is a combination in which agents maintain the desired form and at the same time execute the assigned commands. This article deals with the formation control of multiple Autonomous Underwater Vehicles (AUVs) based on a distributed reinforcement learning algorithm. The proposed Decentralized Self-Attention based Soft Actor–Critic (DEC-ASAC in short) method uses an attention mechanism and maximum entropy reinforcement learning control, so as to enable AUVs to learn formation control independently. The corresponding environmental states, action space and reward schemes are designed for leader and follower AUVs. Simulations and lake test verify that the proposed DEC-ASAC algorithm can stably and effectively learn control policies during training process, achieving effective control of multiple AUVs to keep different formation shapes.</div></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":"195 ","pages":"Article 105187"},"PeriodicalIF":5.2000,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Robotics and Autonomous Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0921889025002842","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
One of most important topics studied in the field of multiple robots is cooperative formation control. Formation structure is a combination in which agents maintain the desired form and at the same time execute the assigned commands. This article deals with the formation control of multiple Autonomous Underwater Vehicles (AUVs) based on a distributed reinforcement learning algorithm. The proposed Decentralized Self-Attention based Soft Actor–Critic (DEC-ASAC in short) method uses an attention mechanism and maximum entropy reinforcement learning control, so as to enable AUVs to learn formation control independently. The corresponding environmental states, action space and reward schemes are designed for leader and follower AUVs. Simulations and lake test verify that the proposed DEC-ASAC algorithm can stably and effectively learn control policies during training process, achieving effective control of multiple AUVs to keep different formation shapes.
期刊介绍:
Robotics and Autonomous Systems will carry articles describing fundamental developments in the field of robotics, with special emphasis on autonomous systems. An important goal of this journal is to extend the state of the art in both symbolic and sensory based robot control and learning in the context of autonomous systems.
Robotics and Autonomous Systems will carry articles on the theoretical, computational and experimental aspects of autonomous systems, or modules of such systems.