{"title":"Integral Reinforcement Learning-Based Optimal Control for Nonzero-Sum Games of Multi-Player Input-Constrained Nonlinear Systems","authors":"Bo Zhao, Qiuye Wu, Derong Liu","doi":"10.1145/3505688.3505698","DOIUrl":null,"url":null,"abstract":"This paper investigates an integral reinforcement learning (IRL)-based optimal control scheme to solve nonzero-sum games of multi-player input-constrained nonlinear systems with unknown drift dynamics. The IRL method is introduced to obviate the identification procedure of the unknown drift dynamics. In order to achieve Nash equilibrium for each player, the simplified optimal control policy for each player is obtained by solving the coupled Hamilton-Jacobi equation with the critic neural network only. Thus, the computational resource is saved and the control structure is easy to realize. The closed-loop system is guaranteed to be uniformly ultimately bounded based on the Lyapunov stability analysis. Simulation results illustrate the effectiveness of the developed control scheme.","PeriodicalId":375528,"journal":{"name":"Proceedings of the 7th International Conference on Robotics and Artificial Intelligence","volume":"09 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 7th International Conference on Robotics and Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3505688.3505698","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper investigates an integral reinforcement learning (IRL)-based optimal control scheme to solve nonzero-sum games of multi-player input-constrained nonlinear systems with unknown drift dynamics. The IRL method is introduced to obviate the identification procedure of the unknown drift dynamics. In order to achieve Nash equilibrium for each player, the simplified optimal control policy for each player is obtained by solving the coupled Hamilton-Jacobi equation with the critic neural network only. Thus, the computational resource is saved and the control structure is easy to realize. The closed-loop system is guaranteed to be uniformly ultimately bounded based on the Lyapunov stability analysis. Simulation results illustrate the effectiveness of the developed control scheme.