{"title":"Adaptive Dynamic Programming for a Nonlinear Two-Player Non-Zero-Sum Differential Game With State and Input Constraints","authors":"Yinglu Zhou, Yinya Li, Andong Sheng, Guoqing Qi","doi":"10.1002/rnc.7710","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>This paper investigates a nonlinear two-player non-zero-sum differential game with state and input constraints. To solve this problem, this paper constructs a neural network (NN) framework to approximate the solution of the Hamilton-Jacobi-Isaacs (HJI) equation. The adaptive dynamic programming (ADP) method is utilized where each player only needs one critic NN. To solve the issue of state and input saturations, this paper develops a novel constrained system for the differential game, firstly to make the states within the predetermined constraint set. Then, the non-quadratic expression is used to substitute the traditional quadratic expression for the two-player non-zero-sum differential game, and both of the inputs of the two players are constrained. With these treatments, the control input and the system are more in line with real-world applications. Moreover, the stability of the system is also analyzed using the Lyapunov theorem. Two numerical examples are presented to illustrate that the critic NN weights estimation errors and the system are uniformly ultimately bounded (UUB), and the state and input constraints can be achieved.</p>\n </div>","PeriodicalId":50291,"journal":{"name":"International Journal of Robust and Nonlinear Control","volume":"35 3","pages":"1117-1133"},"PeriodicalIF":3.2000,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Robust and Nonlinear Control","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/rnc.7710","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
This paper investigates a nonlinear two-player non-zero-sum differential game with state and input constraints. To solve this problem, this paper constructs a neural network (NN) framework to approximate the solution of the Hamilton-Jacobi-Isaacs (HJI) equation. The adaptive dynamic programming (ADP) method is utilized where each player only needs one critic NN. To solve the issue of state and input saturations, this paper develops a novel constrained system for the differential game, firstly to make the states within the predetermined constraint set. Then, the non-quadratic expression is used to substitute the traditional quadratic expression for the two-player non-zero-sum differential game, and both of the inputs of the two players are constrained. With these treatments, the control input and the system are more in line with real-world applications. Moreover, the stability of the system is also analyzed using the Lyapunov theorem. Two numerical examples are presented to illustrate that the critic NN weights estimation errors and the system are uniformly ultimately bounded (UUB), and the state and input constraints can be achieved.
期刊介绍:
Papers that do not include an element of robust or nonlinear control and estimation theory will not be considered by the journal, and all papers will be expected to include significant novel content. The focus of the journal is on model based control design approaches rather than heuristic or rule based methods. Papers on neural networks will have to be of exceptional novelty to be considered for the journal.