执行器失效情况下非线性典型多代理系统的容错优化共识控制的自适应强化学习

IF 4 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS
Boyan Zhu;Liang Zhang;Ben Niu;Ning Zhao
{"title":"执行器失效情况下非线性典型多代理系统的容错优化共识控制的自适应强化学习","authors":"Boyan Zhu;Liang Zhang;Ben Niu;Ning Zhao","doi":"10.1109/JSYST.2024.3433023","DOIUrl":null,"url":null,"abstract":"This article addresses the adaptive optimized consensus tracking control problem of nonlinear multiagent systems (MASs) via a reinforcement learning (RL) algorithm. Specifically, the nonlinear high-order MASs are formulated in a canonical form, with considerations for both actuator effectiveness loss and time-varying bias faults. First, neural networks (NNs) are utilized to approximate unknown nonlinear dynamics, and a state identifier and a fault estimator based on NNs are established, both of which are essential for evaluating state information and bias faults, respectively. Second, to achieve a high-order canonical dynamic consensus and enhance the efficiency of the consensus control strategy, a sliding-mode mechanism is employed to regulate tracking errors. Moreover, we develop an adaptive NN-based fault-tolerant optimal control method by integrating the sliding-mode mechanism with an actor–critic structured RL algorithm. It is proved that the outputs of the MASs precisely align with the desired reference signals, while ensuring the boundedness of all closed-loop signals. Finally, the proposed control methodology's effectiveness is validated through a simulation example.","PeriodicalId":55017,"journal":{"name":"IEEE Systems Journal","volume":"18 3","pages":"1681-1692"},"PeriodicalIF":4.0000,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Adaptive Reinforcement Learning for Fault-Tolerant Optimal Consensus Control of Nonlinear Canonical Multiagent Systems With Actuator Loss of Effectiveness\",\"authors\":\"Boyan Zhu;Liang Zhang;Ben Niu;Ning Zhao\",\"doi\":\"10.1109/JSYST.2024.3433023\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article addresses the adaptive optimized consensus tracking control problem of nonlinear multiagent systems (MASs) via a reinforcement learning (RL) algorithm. Specifically, the nonlinear high-order MASs are formulated in a canonical form, with considerations for both actuator effectiveness loss and time-varying bias faults. First, neural networks (NNs) are utilized to approximate unknown nonlinear dynamics, and a state identifier and a fault estimator based on NNs are established, both of which are essential for evaluating state information and bias faults, respectively. Second, to achieve a high-order canonical dynamic consensus and enhance the efficiency of the consensus control strategy, a sliding-mode mechanism is employed to regulate tracking errors. Moreover, we develop an adaptive NN-based fault-tolerant optimal control method by integrating the sliding-mode mechanism with an actor–critic structured RL algorithm. It is proved that the outputs of the MASs precisely align with the desired reference signals, while ensuring the boundedness of all closed-loop signals. Finally, the proposed control methodology's effectiveness is validated through a simulation example.\",\"PeriodicalId\":55017,\"journal\":{\"name\":\"IEEE Systems Journal\",\"volume\":\"18 3\",\"pages\":\"1681-1692\"},\"PeriodicalIF\":4.0000,\"publicationDate\":\"2024-08-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Systems Journal\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10634586/\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Systems Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10634586/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

本文通过强化学习(RL)算法解决了非线性多代理系统(MAS)的自适应优化共识跟踪控制问题。具体来说,非线性高阶 MAS 采用典型形式,同时考虑了执行器效力损失和时变偏差故障。首先,利用神经网络(NN)来逼近未知的非线性动力学,并建立了基于 NN 的状态识别器和故障估计器,这两者分别对评估状态信息和偏差故障至关重要。其次,为了实现高阶典型动态共识并提高共识控制策略的效率,我们采用了滑模机制来调节跟踪误差。此外,我们还将滑模机制与行为批判结构化 RL 算法相结合,开发了一种基于 NN 的自适应容错优化控制方法。事实证明,MAS 的输出与所需的参考信号精确一致,同时确保所有闭环信号的有界性。最后,通过一个仿真实例验证了所提出的控制方法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Adaptive Reinforcement Learning for Fault-Tolerant Optimal Consensus Control of Nonlinear Canonical Multiagent Systems With Actuator Loss of Effectiveness
This article addresses the adaptive optimized consensus tracking control problem of nonlinear multiagent systems (MASs) via a reinforcement learning (RL) algorithm. Specifically, the nonlinear high-order MASs are formulated in a canonical form, with considerations for both actuator effectiveness loss and time-varying bias faults. First, neural networks (NNs) are utilized to approximate unknown nonlinear dynamics, and a state identifier and a fault estimator based on NNs are established, both of which are essential for evaluating state information and bias faults, respectively. Second, to achieve a high-order canonical dynamic consensus and enhance the efficiency of the consensus control strategy, a sliding-mode mechanism is employed to regulate tracking errors. Moreover, we develop an adaptive NN-based fault-tolerant optimal control method by integrating the sliding-mode mechanism with an actor–critic structured RL algorithm. It is proved that the outputs of the MASs precisely align with the desired reference signals, while ensuring the boundedness of all closed-loop signals. Finally, the proposed control methodology's effectiveness is validated through a simulation example.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
IEEE Systems Journal
IEEE Systems Journal 工程技术-电信学
CiteScore
9.80
自引率
6.80%
发文量
572
审稿时长
4.9 months
期刊介绍: This publication provides a systems-level, focused forum for application-oriented manuscripts that address complex systems and system-of-systems of national and global significance. It intends to encourage and facilitate cooperation and interaction among IEEE Societies with systems-level and systems engineering interest, and to attract non-IEEE contributors and readers from around the globe. Our IEEE Systems Council job is to address issues in new ways that are not solvable in the domains of the existing IEEE or other societies or global organizations. These problems do not fit within traditional hierarchical boundaries. For example, disaster response such as that triggered by Hurricane Katrina, tsunamis, or current volcanic eruptions is not solvable by pure engineering solutions. We need to think about changing and enlarging the paradigm to include systems issues.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信