Formation control of multiple AUVs using decentralized self-attention based soft actor–critic model

IF 5.2 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Robotics and Autonomous Systems Pub Date : 2025-10-08 DOI:10.1016/j.robot.2025.105187

Meiyan Zhang , Ziqiang Liu , Wenyu Cai

引用次数: 0

Abstract

One of most important topics studied in the field of multiple robots is cooperative formation control. Formation structure is a combination in which agents maintain the desired form and at the same time execute the assigned commands. This article deals with the formation control of multiple Autonomous Underwater Vehicles (AUVs) based on a distributed reinforcement learning algorithm. The proposed Decentralized Self-Attention based Soft Actor–Critic (DEC-ASAC in short) method uses an attention mechanism and maximum entropy reinforcement learning control, so as to enable AUVs to learn formation control independently. The corresponding environmental states, action space and reward schemes are designed for leader and follower AUVs. Simulations and lake test verify that the proposed DEC-ASAC algorithm can stably and effectively learn control policies during training process, achieving effective control of multiple AUVs to keep different formation shapes.

查看原文本刊更多论文

基于分散自关注的软actor-critic模型的多auv编队控制

协同编队控制是多机器人领域研究的重要课题之一。编队结构是一种组合，在这种组合中，agent保持所需的形式，同时执行分配的命令。本文研究了基于分布式强化学习算法的多自主水下航行器（auv）编队控制。本文提出的基于分散自注意的软行为者-批评家（decc - asac）方法采用注意机制和最大熵强化学习控制，使auv能够独立学习编队控制。设计了领导者和追随者auv相应的环境状态、行动空间和奖励方案。仿真和湖泊试验验证了所提出的DEC-ASAC算法在训练过程中能够稳定有效地学习控制策略，实现对多个auv的有效控制，保持不同的编队形状。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Robotics and Autonomous Systems 工程技术-机器人学

CiteScore

9.00

自引率

7.00%

发文量

164

审稿时长

4.5 months

期刊介绍： Robotics and Autonomous Systems will carry articles describing fundamental developments in the field of robotics, with special emphasis on autonomous systems. An important goal of this journal is to extend the state of the art in both symbolic and sensory based robot control and learning in the context of autonomous systems. Robotics and Autonomous Systems will carry articles on the theoretical, computational and experimental aspects of autonomous systems, or modules of such systems.