{"title":"Prescribed-time reinforcement learning formation control of nonlinear MASs with an unknown dynamic leader","authors":"Benxin Zhao , Yuan-Xin Li , Zhongsheng Hou","doi":"10.1016/j.amc.2025.129711","DOIUrl":null,"url":null,"abstract":"<div><div>This paper investigates the prescribed-time optimal formation control problem for nonlinear multi-agent systems (MASs) with an unknown dynamic leader. The agents need to not only form a predefined formation pattern but also track the leader’s trajectory in a prescribed time. A hierarchical control framework, which includes the communication layer and the tracking control layer, is established to address the formulated issue. In the communication layer, a distributed prescribed-time observer is established to accurately estimate the leader’s information, which can make observation error convergences to zero in a prescribed time. In particular, the leaders’ uncertainties are solved by constructing a novel adaptive law. With the estimated results, a novel transformation relationship and the prescribed-time adjustment function are constructed to guarantee that formation tracking error converges to the predefined accuracy in a prescribed time. Subsequently, the reinforcement learning (RL) algorithm with the fuzzy logic systems (FLSs) is devised to optimize system performance. Based on the Lyapunov stability theory, it is shown that the formation errors <span><math><mrow><msub><mi>ξ</mi><mrow><mi>i</mi><mo>,</mo><mn>1</mn></mrow></msub><mo>=</mo><msub><mi>x</mi><mrow><mi>i</mi><mo>,</mo><mn>1</mn></mrow></msub><mo>−</mo><msub><mover><mi>y</mi><mo>^</mo></mover><mrow><mn>0</mn><mo>,</mo><mi>i</mi></mrow></msub><mo>−</mo><msub><mi>η</mi><mrow><mi>i</mi><mo>,</mo><mi>d</mi></mrow></msub></mrow></math></span> are consistently confined within an interval <span><math><mrow><mo>(</mo><mi>tan</mi><mrow><mo>(</mo><mo>−</mo><mfrac><mi>π</mi><mrow><mn>2</mn><msub><mi>μ</mi><mn>2</mn></msub></mrow></mfrac><mo>)</mo></mrow><mo>,</mo><mi>tan</mi><mrow><mo>(</mo><mfrac><mi>π</mi><mrow><mn>2</mn><msub><mi>μ</mi><mn>2</mn></msub></mrow></mfrac><mo>)</mo></mrow><mo>)</mo></mrow></math></span>, while all signals of the closed-loop system are bounded. Ultimately, the superiority of the devised algorithm is demonstrated by a representative example.</div></div>","PeriodicalId":55496,"journal":{"name":"Applied Mathematics and Computation","volume":"510 ","pages":"Article 129711"},"PeriodicalIF":3.4000,"publicationDate":"2025-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Mathematics and Computation","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0096300325004370","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICS, APPLIED","Score":null,"Total":0}
引用次数: 0
Abstract
This paper investigates the prescribed-time optimal formation control problem for nonlinear multi-agent systems (MASs) with an unknown dynamic leader. The agents need to not only form a predefined formation pattern but also track the leader’s trajectory in a prescribed time. A hierarchical control framework, which includes the communication layer and the tracking control layer, is established to address the formulated issue. In the communication layer, a distributed prescribed-time observer is established to accurately estimate the leader’s information, which can make observation error convergences to zero in a prescribed time. In particular, the leaders’ uncertainties are solved by constructing a novel adaptive law. With the estimated results, a novel transformation relationship and the prescribed-time adjustment function are constructed to guarantee that formation tracking error converges to the predefined accuracy in a prescribed time. Subsequently, the reinforcement learning (RL) algorithm with the fuzzy logic systems (FLSs) is devised to optimize system performance. Based on the Lyapunov stability theory, it is shown that the formation errors are consistently confined within an interval , while all signals of the closed-loop system are bounded. Ultimately, the superiority of the devised algorithm is demonstrated by a representative example.
期刊介绍:
Applied Mathematics and Computation addresses work at the interface between applied mathematics, numerical computation, and applications of systems – oriented ideas to the physical, biological, social, and behavioral sciences, and emphasizes papers of a computational nature focusing on new algorithms, their analysis and numerical results.
In addition to presenting research papers, Applied Mathematics and Computation publishes review articles and single–topics issues.