Lifelong Safe Optimal Adaptive Tracking Control of Nonlinear Strict-Feedback Discrete-Time Systems

IF 3.9 4区计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS

International Journal of Adaptive Control and Signal Processing Pub Date : 2024-12-12 DOI:10.1002/acs.3950

Behzad Farzanegan, S. Jagannathan

{"title":"Lifelong Safe Optimal Adaptive Tracking Control of Nonlinear Strict-Feedback Discrete-Time Systems","authors":"Behzad Farzanegan, S. Jagannathan","doi":"10.1002/acs.3950","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>This paper presents a comprehensive approach for achieving multi-task safe optimal adaptive tracking (MSOAT) for a class of nonlinear discrete-time systems, particularly those in strict-feedback form, utilizing a multi-layer neural network (MNN)-based framework. To begin, a cost function with a novel Barrier function (BF) term is introduced for each subsystem to address the weak safely reachable problem, serving as a crucial tool for guiding the system's trajectory toward the safe set while avoiding unwanted sets. To deal with the tracking problem, the Hamilton-Jacobi-Bellman (HJB) framework is used through the actor-critic MNN-based backstepping technique to estimate the solution of the value functions and obtain both virtual and actual optimal control policies for each subsystem, effectively circumventing non-causality issues. Further, to mitigate catastrophic forgetting in multi-tasking scenarios, a regularizer term, which is derived from the online version of the Elastic Weight Consolidation (EWC) method, is included in the critic and actor MNN update laws without directly computing the Fisher information matrix. To enhance the convergence rate, the critic MNN is tuned with a hybrid learning technique involving weight adjustments both at specific sampling instants and iteratively within those intervals. A control barrier function (CBF) with a time-varying BF is also integrated into the actor update law, collaborating with the BF to keep the trajectory in the safe set with a smaller trade-off factor, simultaneously validating the safety condition in real-time. Finally, the overall stability is established. An example of a 6-DOF autonomous underwater vehicle (AUV) is used to assess the effectiveness of the proposed approach.</p>\n </div>","PeriodicalId":50347,"journal":{"name":"International Journal of Adaptive Control and Signal Processing","volume":"39 3","pages":"451-470"},"PeriodicalIF":3.9000,"publicationDate":"2024-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Adaptive Control and Signal Processing","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/acs.3950","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

This paper presents a comprehensive approach for achieving multi-task safe optimal adaptive tracking (MSOAT) for a class of nonlinear discrete-time systems, particularly those in strict-feedback form, utilizing a multi-layer neural network (MNN)-based framework. To begin, a cost function with a novel Barrier function (BF) term is introduced for each subsystem to address the weak safely reachable problem, serving as a crucial tool for guiding the system's trajectory toward the safe set while avoiding unwanted sets. To deal with the tracking problem, the Hamilton-Jacobi-Bellman (HJB) framework is used through the actor-critic MNN-based backstepping technique to estimate the solution of the value functions and obtain both virtual and actual optimal control policies for each subsystem, effectively circumventing non-causality issues. Further, to mitigate catastrophic forgetting in multi-tasking scenarios, a regularizer term, which is derived from the online version of the Elastic Weight Consolidation (EWC) method, is included in the critic and actor MNN update laws without directly computing the Fisher information matrix. To enhance the convergence rate, the critic MNN is tuned with a hybrid learning technique involving weight adjustments both at specific sampling instants and iteratively within those intervals. A control barrier function (CBF) with a time-varying BF is also integrated into the actor update law, collaborating with the BF to keep the trajectory in the safe set with a smaller trade-off factor, simultaneously validating the safety condition in real-time. Finally, the overall stability is established. An example of a 6-DOF autonomous underwater vehicle (AUV) is used to assess the effectiveness of the proposed approach.

查看原文本刊更多论文

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Adaptive Control and Signal Processing 工程技术-工程：电子与电气

CiteScore

5.30

自引率

16.10%

发文量

163

审稿时长

5 months

期刊介绍： The International Journal of Adaptive Control and Signal Processing is concerned with the design, synthesis and application of estimators or controllers where adaptive features are needed to cope with uncertainties.Papers on signal processing should also have some relevance to adaptive systems. The journal focus is on model based control design approaches rather than heuristic or rule based control design methods. All papers will be expected to include significant novel material. Both the theory and application of adaptive systems and system identification are areas of interest. Papers on applications can include problems in the implementation of algorithms for real time signal processing and control. The stability, convergence, robustness and numerical aspects of adaptive algorithms are also suitable topics. The related subjects of controller tuning, filtering, networks and switching theory are also of interest. Principal areas to be addressed include: Auto-Tuning, Self-Tuning and Model Reference Adaptive Controllers Nonlinear, Robust and Intelligent Adaptive Controllers Linear and Nonlinear Multivariable System Identification and Estimation Identification of Linear Parameter Varying, Distributed and Hybrid Systems Multiple Model Adaptive Control Adaptive Signal processing Theory and Algorithms Adaptation in Multi-Agent Systems Condition Monitoring Systems Fault Detection and Isolation Methods Fault Detection and Isolation Methods Fault-Tolerant Control (system supervision and diagnosis) Learning Systems and Adaptive Modelling Real Time Algorithms for Adaptive Signal Processing and Control Adaptive Signal Processing and Control Applications Adaptive Cloud Architectures and Networking Adaptive Mechanisms for Internet of Things Adaptive Sliding Mode Control.