{"title":"基于启发式动态规划的非线性切换系统零和博弈最优控制","authors":"Xingjian Fu, Zizheng Li","doi":"10.1002/oca.3005","DOIUrl":null,"url":null,"abstract":"In this article, the zero‐sum game problem of control and disturbance is studied for discrete affine nonlinear switched systems with unknown dynamical models. A two‐level heuristic dynamic programming iterative algorithm for solving the zero‐sum game problem is proposed, which can be used to solve the Hamilton‐Jacobi‐Isaacs equation associated with the optimal regulation control problem. The convergence analysis based on the value function and control strategy is given. For algorithm implementation, neural networks are used to approximate the control strategy, disturbance strategy, and value function of the game dynamic programming, respectively. A model network is used to approximate the system states with an unknown dynamical model. The iterative algorithm steps are given. Finally, the validity of the method is verified by simulation examples.","PeriodicalId":105945,"journal":{"name":"Optimal Control Applications and Methods","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Zero‐sum game optimal control for the nonlinear switched systems based on heuristic dynamic programming\",\"authors\":\"Xingjian Fu, Zizheng Li\",\"doi\":\"10.1002/oca.3005\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this article, the zero‐sum game problem of control and disturbance is studied for discrete affine nonlinear switched systems with unknown dynamical models. A two‐level heuristic dynamic programming iterative algorithm for solving the zero‐sum game problem is proposed, which can be used to solve the Hamilton‐Jacobi‐Isaacs equation associated with the optimal regulation control problem. The convergence analysis based on the value function and control strategy is given. For algorithm implementation, neural networks are used to approximate the control strategy, disturbance strategy, and value function of the game dynamic programming, respectively. A model network is used to approximate the system states with an unknown dynamical model. The iterative algorithm steps are given. Finally, the validity of the method is verified by simulation examples.\",\"PeriodicalId\":105945,\"journal\":{\"name\":\"Optimal Control Applications and Methods\",\"volume\":\"32 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Optimal Control Applications and Methods\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1002/oca.3005\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Optimal Control Applications and Methods","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/oca.3005","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Zero‐sum game optimal control for the nonlinear switched systems based on heuristic dynamic programming
In this article, the zero‐sum game problem of control and disturbance is studied for discrete affine nonlinear switched systems with unknown dynamical models. A two‐level heuristic dynamic programming iterative algorithm for solving the zero‐sum game problem is proposed, which can be used to solve the Hamilton‐Jacobi‐Isaacs equation associated with the optimal regulation control problem. The convergence analysis based on the value function and control strategy is given. For algorithm implementation, neural networks are used to approximate the control strategy, disturbance strategy, and value function of the game dynamic programming, respectively. A model network is used to approximate the system states with an unknown dynamical model. The iterative algorithm steps are given. Finally, the validity of the method is verified by simulation examples.