基于粒子群优化的多参与者非线性系统非零和博弈神经动态规划

2022 IEEE International Conference on Real-time Computing and Robotics (RCAR) Pub Date : 2022-07-17 DOI:10.1109/RCAR54675.2022.9872183

Qiuye Wu, Bo Zhao, Derong Liu

{"title":"基于粒子群优化的多参与者非线性系统非零和博弈神经动态规划","authors":"Qiuye Wu, Bo Zhao, Derong Liu","doi":"10.1109/RCAR54675.2022.9872183","DOIUrl":null,"url":null,"abstract":"This paper focuses on an integral reinforcement learning (IRL)-based optimal control scheme using particle swarm optimized neural networks for nonzero-sum games of multi-player nonlinear systems with unknown drift dynamics. By combining IRL with neuro-dynamic programming method, the identification procedure is obviated. The optimal control policy of each player is acquired by solving the coupled Hamilton-Jacobi equation via the particle swarm optimized critic neural network, which avoids the difficulty in selecting the initial weight vector manually. The closed-loop system is ensured to be stable according to the Lyapunov’s direct method. The effectiveness of the developed scheme is demonstrated by numerical simulations.","PeriodicalId":304963,"journal":{"name":"2022 IEEE International Conference on Real-time Computing and Robotics (RCAR)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Particle Swarm optimization-Based Neuro-Dynamic Programming for Nonzero-Sum Games of Multi-Player Nonlinear Systems\",\"authors\":\"Qiuye Wu, Bo Zhao, Derong Liu\",\"doi\":\"10.1109/RCAR54675.2022.9872183\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper focuses on an integral reinforcement learning (IRL)-based optimal control scheme using particle swarm optimized neural networks for nonzero-sum games of multi-player nonlinear systems with unknown drift dynamics. By combining IRL with neuro-dynamic programming method, the identification procedure is obviated. The optimal control policy of each player is acquired by solving the coupled Hamilton-Jacobi equation via the particle swarm optimized critic neural network, which avoids the difficulty in selecting the initial weight vector manually. The closed-loop system is ensured to be stable according to the Lyapunov’s direct method. The effectiveness of the developed scheme is demonstrated by numerical simulations.\",\"PeriodicalId\":304963,\"journal\":{\"name\":\"2022 IEEE International Conference on Real-time Computing and Robotics (RCAR)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference on Real-time Computing and Robotics (RCAR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/RCAR54675.2022.9872183\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Real-time Computing and Robotics (RCAR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RCAR54675.2022.9872183","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

研究了一种基于积分强化学习(IRL)的粒子群优化神经网络最优控制方案，用于求解具有未知漂移动力学的多参与者非线性系统的非零和博弈。将IRL与神经动态规划方法相结合，简化了辨识过程。通过粒子群优化评价神经网络求解耦合Hamilton-Jacobi方程，获得每个参与者的最优控制策略，避免了手动选择初始权向量的困难。根据李亚普诺夫直接法，保证了闭环系统的稳定。数值模拟结果表明了该方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Particle Swarm optimization-Based Neuro-Dynamic Programming for Nonzero-Sum Games of Multi-Player Nonlinear Systems

This paper focuses on an integral reinforcement learning (IRL)-based optimal control scheme using particle swarm optimized neural networks for nonzero-sum games of multi-player nonlinear systems with unknown drift dynamics. By combining IRL with neuro-dynamic programming method, the identification procedure is obviated. The optimal control policy of each player is acquired by solving the coupled Hamilton-Jacobi equation via the particle swarm optimized critic neural network, which avoids the difficulty in selecting the initial weight vector manually. The closed-loop system is ensured to be stable according to the Lyapunov’s direct method. The effectiveness of the developed scheme is demonstrated by numerical simulations.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 IEEE International Conference on Real-time Computing and Robotics (RCAR)

自引率

0.00%

发文量