{"title":"基于模糊强化学习的二阶多智能体系统规定时间最优群体控制。","authors":"Li Shu,Shengyuan Xu","doi":"10.1109/tcyb.2025.3605589","DOIUrl":null,"url":null,"abstract":"This article investigates the prescribed-time (PT) optimal formation control issue for second-order MAS. A novel formation scheme that integrates RL with a FLS is presented, incorporating actor, critic, and identifier components to estimate the optimal control, the optimal cost function, and the uncertain system dynamics (including unknown nonlinearities, external disturbances, and leader input), respectively. To achieve PT formation, we introduce a prescribed performance function and a filtered variable, which are then used to develop an error transformation function for the controller design. Unlike existing PT control approaches, this method eliminates initial value limitations, ensuring that both the prescribed performance function's initial condition and the error transformation parameter are independent of the initial tracking error and system dynamics. We further demonstrate that the developed scheme ensures the prescribed performance of the filtered error, guaranteeing that all formation errors converge to a bounded region within the PT while achieving satisfactory transient performance. Finally, we illustrate the effectiveness of the scheme through two simulated examples.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"35 1","pages":""},"PeriodicalIF":10.5000,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prescribed-Time Optimal Formation Control Using Fuzzy Reinforcement Learning for Second-Order Multiagent Systems.\",\"authors\":\"Li Shu,Shengyuan Xu\",\"doi\":\"10.1109/tcyb.2025.3605589\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article investigates the prescribed-time (PT) optimal formation control issue for second-order MAS. A novel formation scheme that integrates RL with a FLS is presented, incorporating actor, critic, and identifier components to estimate the optimal control, the optimal cost function, and the uncertain system dynamics (including unknown nonlinearities, external disturbances, and leader input), respectively. To achieve PT formation, we introduce a prescribed performance function and a filtered variable, which are then used to develop an error transformation function for the controller design. Unlike existing PT control approaches, this method eliminates initial value limitations, ensuring that both the prescribed performance function's initial condition and the error transformation parameter are independent of the initial tracking error and system dynamics. We further demonstrate that the developed scheme ensures the prescribed performance of the filtered error, guaranteeing that all formation errors converge to a bounded region within the PT while achieving satisfactory transient performance. Finally, we illustrate the effectiveness of the scheme through two simulated examples.\",\"PeriodicalId\":13112,\"journal\":{\"name\":\"IEEE Transactions on Cybernetics\",\"volume\":\"35 1\",\"pages\":\"\"},\"PeriodicalIF\":10.5000,\"publicationDate\":\"2025-09-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Cybernetics\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1109/tcyb.2025.3605589\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Cybernetics","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1109/tcyb.2025.3605589","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Prescribed-Time Optimal Formation Control Using Fuzzy Reinforcement Learning for Second-Order Multiagent Systems.
This article investigates the prescribed-time (PT) optimal formation control issue for second-order MAS. A novel formation scheme that integrates RL with a FLS is presented, incorporating actor, critic, and identifier components to estimate the optimal control, the optimal cost function, and the uncertain system dynamics (including unknown nonlinearities, external disturbances, and leader input), respectively. To achieve PT formation, we introduce a prescribed performance function and a filtered variable, which are then used to develop an error transformation function for the controller design. Unlike existing PT control approaches, this method eliminates initial value limitations, ensuring that both the prescribed performance function's initial condition and the error transformation parameter are independent of the initial tracking error and system dynamics. We further demonstrate that the developed scheme ensures the prescribed performance of the filtered error, guaranteeing that all formation errors converge to a bounded region within the PT while achieving satisfactory transient performance. Finally, we illustrate the effectiveness of the scheme through two simulated examples.
期刊介绍:
The scope of the IEEE Transactions on Cybernetics includes computational approaches to the field of cybernetics. Specifically, the transactions welcomes papers on communication and control across machines or machine, human, and organizations. The scope includes such areas as computational intelligence, computer vision, neural networks, genetic algorithms, machine learning, fuzzy systems, cognitive systems, decision making, and robotics, to the extent that they contribute to the theme of cybernetics or demonstrate an application of cybernetics principles.