具有不连续约束条件的多代理系统的基于估计器的强化学习共识控制

IF 10.2 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE transactions on neural networks and learning systems Pub Date : 2024-09-02 DOI:10.1109/TNNLS.2024.3445880

Ao Luo, Hui Ma, Hongru Ren, Hongyi Li

{"title":"具有不连续约束条件的多代理系统的基于估计器的强化学习共识控制","authors":"Ao Luo, Hui Ma, Hongru Ren, Hongyi Li","doi":"10.1109/TNNLS.2024.3445880","DOIUrl":null,"url":null,"abstract":"This article focuses on the optimal consensus control problem for multiagent systems (MASs) with discontinuous constraints. The case of discontinuous constraints is a particular instance of state constraints, which has been studied less but occurs in many practical situations. Due to the discontinuous constraint boundaries, the traditional barrier function-based backstepping methods cannot be used directly. In response to this thorny problem, a novel constraint boundary reconstruction technique is proposed by designing a class of switch-like functions. The technique can convert discontinuous constraint boundaries into continuous ones, and it strictly proves that when the states satisfy the transformed constraint boundaries, the original constraints are also absolutely fulfilled. Meanwhile, with the aid of the barrier function and distributed event-triggered estimator, an improved coordinate transformation is constructed, which can remove the \"feasibility condition\" and simplify the controller design. In addition, by introducing prediction error and revised term into the learning process of neural networks (NNs), the optimal consensus problem is resolved by constructing a modified reinforcement learning strategy. Finally, the stability of the MASs is testified through the Lyapunov stability theory, and a simulation example verifies the effectiveness of the proposed method.","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"PP ","pages":""},"PeriodicalIF":10.2000,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Estimator-Based Reinforcement Learning Consensus Control for Multiagent Systems With Discontinuous Constraints.\",\"authors\":\"Ao Luo, Hui Ma, Hongru Ren, Hongyi Li\",\"doi\":\"10.1109/TNNLS.2024.3445880\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article focuses on the optimal consensus control problem for multiagent systems (MASs) with discontinuous constraints. The case of discontinuous constraints is a particular instance of state constraints, which has been studied less but occurs in many practical situations. Due to the discontinuous constraint boundaries, the traditional barrier function-based backstepping methods cannot be used directly. In response to this thorny problem, a novel constraint boundary reconstruction technique is proposed by designing a class of switch-like functions. The technique can convert discontinuous constraint boundaries into continuous ones, and it strictly proves that when the states satisfy the transformed constraint boundaries, the original constraints are also absolutely fulfilled. Meanwhile, with the aid of the barrier function and distributed event-triggered estimator, an improved coordinate transformation is constructed, which can remove the \\\"feasibility condition\\\" and simplify the controller design. In addition, by introducing prediction error and revised term into the learning process of neural networks (NNs), the optimal consensus problem is resolved by constructing a modified reinforcement learning strategy. Finally, the stability of the MASs is testified through the Lyapunov stability theory, and a simulation example verifies the effectiveness of the proposed method.\",\"PeriodicalId\":13303,\"journal\":{\"name\":\"IEEE transactions on neural networks and learning systems\",\"volume\":\"PP \",\"pages\":\"\"},\"PeriodicalIF\":10.2000,\"publicationDate\":\"2024-09-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE transactions on neural networks and learning systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1109/TNNLS.2024.3445880\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on neural networks and learning systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1109/TNNLS.2024.3445880","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

本文重点讨论具有不连续约束的多代理系统（MAS）的最优共识控制问题。不连续约束是状态约束的一种特殊情况，虽然研究较少，但在很多实际情况中都会出现。由于约束边界不连续，传统的基于障碍函数的反步进方法无法直接使用。针对这一棘手问题，我们通过设计一类开关类函数，提出了一种新颖的约束边界重构技术。该技术能将不连续的约束边界转换为连续的约束边界，并严格证明当状态满足转换后的约束边界时，原始约束也绝对满足。同时，借助障碍函数和分布式事件触发估计器，构建了一种改进的坐标变换，可以消除 "可行性条件"，简化控制器设计。此外，在神经网络（NN）的学习过程中引入预测误差和修正项，通过构建改进的强化学习策略解决了最优共识问题。最后，通过李雅普诺夫稳定性理论验证了 MAS 的稳定性，并通过仿真实例验证了所提方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Estimator-Based Reinforcement Learning Consensus Control for Multiagent Systems With Discontinuous Constraints.

This article focuses on the optimal consensus control problem for multiagent systems (MASs) with discontinuous constraints. The case of discontinuous constraints is a particular instance of state constraints, which has been studied less but occurs in many practical situations. Due to the discontinuous constraint boundaries, the traditional barrier function-based backstepping methods cannot be used directly. In response to this thorny problem, a novel constraint boundary reconstruction technique is proposed by designing a class of switch-like functions. The technique can convert discontinuous constraint boundaries into continuous ones, and it strictly proves that when the states satisfy the transformed constraint boundaries, the original constraints are also absolutely fulfilled. Meanwhile, with the aid of the barrier function and distributed event-triggered estimator, an improved coordinate transformation is constructed, which can remove the "feasibility condition" and simplify the controller design. In addition, by introducing prediction error and revised term into the learning process of neural networks (NNs), the optimal consensus problem is resolved by constructing a modified reinforcement learning strategy. Finally, the stability of the MASs is testified through the Lyapunov stability theory, and a simulation example verifies the effectiveness of the proposed method.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE transactions on neural networks and learning systems COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

CiteScore

23.80

自引率

9.60%

发文量

2102

审稿时长

3-8 weeks

期刊介绍： The focus of IEEE Transactions on Neural Networks and Learning Systems is to present scholarly articles discussing the theory, design, and applications of neural networks as well as other learning systems. The journal primarily highlights technical and scientific research in this domain.