{"title":"Lifelong Safe Optimal Adaptive Tracking Control of Nonlinear Strict-Feedback Discrete-Time Systems","authors":"Behzad Farzanegan, S. Jagannathan","doi":"10.1002/acs.3950","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>This paper presents a comprehensive approach for achieving multi-task safe optimal adaptive tracking (MSOAT) for a class of nonlinear discrete-time systems, particularly those in strict-feedback form, utilizing a multi-layer neural network (MNN)-based framework. To begin, a cost function with a novel Barrier function (BF) term is introduced for each subsystem to address the weak safely reachable problem, serving as a crucial tool for guiding the system's trajectory toward the safe set while avoiding unwanted sets. To deal with the tracking problem, the Hamilton-Jacobi-Bellman (HJB) framework is used through the actor-critic MNN-based backstepping technique to estimate the solution of the value functions and obtain both virtual and actual optimal control policies for each subsystem, effectively circumventing non-causality issues. Further, to mitigate catastrophic forgetting in multi-tasking scenarios, a regularizer term, which is derived from the online version of the Elastic Weight Consolidation (EWC) method, is included in the critic and actor MNN update laws without directly computing the Fisher information matrix. To enhance the convergence rate, the critic MNN is tuned with a hybrid learning technique involving weight adjustments both at specific sampling instants and iteratively within those intervals. A control barrier function (CBF) with a time-varying BF is also integrated into the actor update law, collaborating with the BF to keep the trajectory in the safe set with a smaller trade-off factor, simultaneously validating the safety condition in real-time. Finally, the overall stability is established. An example of a 6-DOF autonomous underwater vehicle (AUV) is used to assess the effectiveness of the proposed approach.</p>\n </div>","PeriodicalId":50347,"journal":{"name":"International Journal of Adaptive Control and Signal Processing","volume":"39 3","pages":"451-470"},"PeriodicalIF":3.9000,"publicationDate":"2024-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Adaptive Control and Signal Processing","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/acs.3950","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
This paper presents a comprehensive approach for achieving multi-task safe optimal adaptive tracking (MSOAT) for a class of nonlinear discrete-time systems, particularly those in strict-feedback form, utilizing a multi-layer neural network (MNN)-based framework. To begin, a cost function with a novel Barrier function (BF) term is introduced for each subsystem to address the weak safely reachable problem, serving as a crucial tool for guiding the system's trajectory toward the safe set while avoiding unwanted sets. To deal with the tracking problem, the Hamilton-Jacobi-Bellman (HJB) framework is used through the actor-critic MNN-based backstepping technique to estimate the solution of the value functions and obtain both virtual and actual optimal control policies for each subsystem, effectively circumventing non-causality issues. Further, to mitigate catastrophic forgetting in multi-tasking scenarios, a regularizer term, which is derived from the online version of the Elastic Weight Consolidation (EWC) method, is included in the critic and actor MNN update laws without directly computing the Fisher information matrix. To enhance the convergence rate, the critic MNN is tuned with a hybrid learning technique involving weight adjustments both at specific sampling instants and iteratively within those intervals. A control barrier function (CBF) with a time-varying BF is also integrated into the actor update law, collaborating with the BF to keep the trajectory in the safe set with a smaller trade-off factor, simultaneously validating the safety condition in real-time. Finally, the overall stability is established. An example of a 6-DOF autonomous underwater vehicle (AUV) is used to assess the effectiveness of the proposed approach.
期刊介绍:
The International Journal of Adaptive Control and Signal Processing is concerned with the design, synthesis and application of estimators or controllers where adaptive features are needed to cope with uncertainties.Papers on signal processing should also have some relevance to adaptive systems. The journal focus is on model based control design approaches rather than heuristic or rule based control design methods. All papers will be expected to include significant novel material.
Both the theory and application of adaptive systems and system identification are areas of interest. Papers on applications can include problems in the implementation of algorithms for real time signal processing and control. The stability, convergence, robustness and numerical aspects of adaptive algorithms are also suitable topics. The related subjects of controller tuning, filtering, networks and switching theory are also of interest. Principal areas to be addressed include:
Auto-Tuning, Self-Tuning and Model Reference Adaptive Controllers
Nonlinear, Robust and Intelligent Adaptive Controllers
Linear and Nonlinear Multivariable System Identification and Estimation
Identification of Linear Parameter Varying, Distributed and Hybrid Systems
Multiple Model Adaptive Control
Adaptive Signal processing Theory and Algorithms
Adaptation in Multi-Agent Systems
Condition Monitoring Systems
Fault Detection and Isolation Methods
Fault Detection and Isolation Methods
Fault-Tolerant Control (system supervision and diagnosis)
Learning Systems and Adaptive Modelling
Real Time Algorithms for Adaptive Signal Processing and Control
Adaptive Signal Processing and Control Applications
Adaptive Cloud Architectures and Networking
Adaptive Mechanisms for Internet of Things
Adaptive Sliding Mode Control.