{"title":"Hierarchical RL-ESN control for autonomous underwater vehicle: 6-DOF large angle rotation maneuvering","authors":"Zihan Xia, Bing Huang, Cheng Zhu","doi":"10.1016/j.isatra.2025.07.036","DOIUrl":null,"url":null,"abstract":"<div><div>The attitude representation of the autonomous underwater vehicle (AUV) typically relies on Euler angles and unit quaternions. However, existing works have demonstrated that both approaches exhibit inherent limitations when solving large-angle rotation maneuvers, primarily characterized by singularities and unwinding phenomena. In light of these limitations, this article investigates a model-free tracking control scheme for AUV to perform arbitrary large-angle rotation maneuvers in six degrees of freedom (6-DOF). Specifically, a rotation matrix-based error dynamics is constructed to achieve a globally unique attitude representation. A key technical obstacle is that defining attitude error directly via the rotation matrix complicates controller design. To circumvent this problem, an alternative error metric is presented to transform the rotation matrix-based attitude error from the special orthogonal group in three-dimensional space (SO(3)) to Euclidean space. Distinguished from a single estimation network, a hierarchical echo state network (ESN) is established to estimate hydrodynamic coefficients and actuator faults, which comprises multiple independent subnetworks to conduct various estimation tasks. Moreover, a collaborative critic network (CCN) is utilized to generate reinforcement signals, endowing the hierarchical ESN with favorable learning capability. In this way, the system robustness is significantly enhanced while ensuring high tracking performance. Finally, rigorous theoretical analysis and numerical simulations are presented to show the effectiveness and superiority of the proposed control scheme.</div></div>","PeriodicalId":14660,"journal":{"name":"ISA transactions","volume":"166 ","pages":"Pages 393-404"},"PeriodicalIF":6.5000,"publicationDate":"2025-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ISA transactions","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0019057825003854","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The attitude representation of the autonomous underwater vehicle (AUV) typically relies on Euler angles and unit quaternions. However, existing works have demonstrated that both approaches exhibit inherent limitations when solving large-angle rotation maneuvers, primarily characterized by singularities and unwinding phenomena. In light of these limitations, this article investigates a model-free tracking control scheme for AUV to perform arbitrary large-angle rotation maneuvers in six degrees of freedom (6-DOF). Specifically, a rotation matrix-based error dynamics is constructed to achieve a globally unique attitude representation. A key technical obstacle is that defining attitude error directly via the rotation matrix complicates controller design. To circumvent this problem, an alternative error metric is presented to transform the rotation matrix-based attitude error from the special orthogonal group in three-dimensional space (SO(3)) to Euclidean space. Distinguished from a single estimation network, a hierarchical echo state network (ESN) is established to estimate hydrodynamic coefficients and actuator faults, which comprises multiple independent subnetworks to conduct various estimation tasks. Moreover, a collaborative critic network (CCN) is utilized to generate reinforcement signals, endowing the hierarchical ESN with favorable learning capability. In this way, the system robustness is significantly enhanced while ensuring high tracking performance. Finally, rigorous theoretical analysis and numerical simulations are presented to show the effectiveness and superiority of the proposed control scheme.
期刊介绍:
ISA Transactions serves as a platform for showcasing advancements in measurement and automation, catering to both industrial practitioners and applied researchers. It covers a wide array of topics within measurement, including sensors, signal processing, data analysis, and fault detection, supported by techniques such as artificial intelligence and communication systems. Automation topics encompass control strategies, modelling, system reliability, and maintenance, alongside optimization and human-machine interaction. The journal targets research and development professionals in control systems, process instrumentation, and automation from academia and industry.