Nearly optimal fixed time sliding mode controller for leader–follower consensus problem with partially unknown nonlinear agents

IF 2.5 3区计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS

European Journal of Control Pub Date : 2024-07-26 DOI:10.1016/j.ejcon.2024.101093

Zahra Sharifi, Marzieh Kamali, Farid Sheikholeslam

{"title":"Nearly optimal fixed time sliding mode controller for leader–follower consensus problem with partially unknown nonlinear agents","authors":"Zahra Sharifi, Marzieh Kamali, Farid Sheikholeslam","doi":"10.1016/j.ejcon.2024.101093","DOIUrl":null,"url":null,"abstract":"<div><p>This paper presents a conceptual framework for addressing the consensus problem in multi-agent systems with unknown nonlinear dynamics using reinforcement learning (RL) based nearly optimal sliding mode controller (SMC). The agents’ dynamics are assumed to have uncertainty and mismatched disturbance. An adaptive fixed-time estimator is introduced to estimate uncertain dynamics and disturbances for each agent at a certain time. The paper proposes two control strategies. In the first strategy, a controller is designed, incorporating adaptive SMC and an optimal controller. Adaption law in SMC estimates the bound of fixed time estimator error before convergence, ultimately achieving asymptotic convergence to the sliding surface and converting the agent’s dynamics to linear. This enables solving a linear consensus problem using an RL-based adaptive optimal controller through an on-policy critic–actor method. The second control strategy enhances the adaptive SMC into a fixed-time controller, reducing the time to reach the sliding surface regardless of initial conditions. Consequently, the convergence time of the consensus error to zero is diminished. This reduction in reaching time results in faster convergence of the consensus error to zero. The effectiveness of both strategies is validated through numerical experiments on two real system models, aligning with the theoretical proofs.</p></div>","PeriodicalId":50489,"journal":{"name":"European Journal of Control","volume":"79 ","pages":"Article 101093"},"PeriodicalIF":2.5000,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Control","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0947358024001535","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

This paper presents a conceptual framework for addressing the consensus problem in multi-agent systems with unknown nonlinear dynamics using reinforcement learning (RL) based nearly optimal sliding mode controller (SMC). The agents’ dynamics are assumed to have uncertainty and mismatched disturbance. An adaptive fixed-time estimator is introduced to estimate uncertain dynamics and disturbances for each agent at a certain time. The paper proposes two control strategies. In the first strategy, a controller is designed, incorporating adaptive SMC and an optimal controller. Adaption law in SMC estimates the bound of fixed time estimator error before convergence, ultimately achieving asymptotic convergence to the sliding surface and converting the agent’s dynamics to linear. This enables solving a linear consensus problem using an RL-based adaptive optimal controller through an on-policy critic–actor method. The second control strategy enhances the adaptive SMC into a fixed-time controller, reducing the time to reach the sliding surface regardless of initial conditions. Consequently, the convergence time of the consensus error to zero is diminished. This reduction in reaching time results in faster convergence of the consensus error to zero. The effectiveness of both strategies is validated through numerical experiments on two real system models, aligning with the theoretical proofs.

查看原文本刊更多论文

部分未知非线性代理的领导者-追随者共识问题的近最优固定时间滑动模式控制器

本文提出了一个概念框架，利用基于强化学习（RL）的近乎最优滑模控制器（SMC）来解决具有未知非线性动力学特性的多代理系统中的共识问题。假设代理动态具有不确定性和不匹配干扰。本文引入了一个自适应固定时间估算器，用于估算每个代理在特定时间的不确定动态和干扰。本文提出了两种控制策略。在第一种策略中，设计了一种包含自适应 SMC 和最优控制器的控制器。SMC 中的自适应法则会在收敛前估计固定时间估计误差的边界，最终实现对滑动面的渐近收敛，并将代理动态转换为线性。这样就能利用基于 RL 的自适应最优控制器，通过策略上的批判者-行动者方法来解决线性共识问题。第二种控制策略将自适应 SMC 增强为固定时间控制器，缩短了到达滑动面的时间，而不受初始条件的影响。因此，共识误差收敛到零的时间缩短了。到达滑动面时间的缩短使得一致误差更快地趋于零。通过对两个真实系统模型的数值实验，验证了这两种策略的有效性，并与理论证明相吻合。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

European Journal of Control 工程技术-自动化与控制系统

CiteScore

5.80

自引率

5.90%

发文量

131

审稿时长

1 months

期刊介绍： The European Control Association (EUCA) has among its objectives to promote the development of the discipline. Apart from the European Control Conferences, the European Journal of Control is the Association''s main channel for the dissemination of important contributions in the field. The aim of the Journal is to publish high quality papers on the theory and practice of control and systems engineering. The scope of the Journal will be wide and cover all aspects of the discipline including methodologies, techniques and applications. Research in control and systems engineering is necessary to develop new concepts and tools which enhance our understanding and improve our ability to design and implement high performance control systems. Submitted papers should stress the practical motivations and relevance of their results. The design and implementation of a successful control system requires the use of a range of techniques: Modelling Robustness Analysis Identification Optimization Control Law Design Numerical analysis Fault Detection, and so on.