C. Qin, Yingchun Wang, Yanhong Luo, Huaguang Zhang
{"title":"Neural network-based near-optimal control for nonlinear discrete-time zero-sum differential games associated with the H∞ control problem","authors":"C. Qin, Yingchun Wang, Yanhong Luo, Huaguang Zhang","doi":"10.1109/ICICIP.2014.7010275","DOIUrl":null,"url":null,"abstract":"In this paper, we will present a new method to solve online the Hamilton-Jacobi-Isaacs (HJI) equation appearing in the two-player zero-sum differential game of the nonlinear system. First, an online parametric structure is designed by using a neural network to approximate the value function associating with the two-player zero-sum differential game. Second, online approximator-based controller designs are presented by using two neural networks to find (saddle point) equilibria. Third, Novel weight update laws for the critic, action and disturbance networks are given, and all parameters are tuned online. Fourth, it is shown that the system state, all neural networks weight estimation errors are uniformly ultimately bounded by using Lyapunov techniques. Further, it is shown that the output of the action network approaches the optimal control input with small bounded error and the output of the disturbance network approaches the worst disturbance with small bounded error and. Finally, a numerical example is given to demonstrate the effectiveness of the proposed method.","PeriodicalId":408041,"journal":{"name":"Fifth International Conference on Intelligent Control and Information Processing","volume":"195 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fifth International Conference on Intelligent Control and Information Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICIP.2014.7010275","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
In this paper, we will present a new method to solve online the Hamilton-Jacobi-Isaacs (HJI) equation appearing in the two-player zero-sum differential game of the nonlinear system. First, an online parametric structure is designed by using a neural network to approximate the value function associating with the two-player zero-sum differential game. Second, online approximator-based controller designs are presented by using two neural networks to find (saddle point) equilibria. Third, Novel weight update laws for the critic, action and disturbance networks are given, and all parameters are tuned online. Fourth, it is shown that the system state, all neural networks weight estimation errors are uniformly ultimately bounded by using Lyapunov techniques. Further, it is shown that the output of the action network approaches the optimal control input with small bounded error and the output of the disturbance network approaches the worst disturbance with small bounded error and. Finally, a numerical example is given to demonstrate the effectiveness of the proposed method.