{"title":"非线性系统有限视界多参与者非零和微分对策的微分动态规划","authors":"Yuqi Zhang, Bin Zhang","doi":"10.1109/DDCLS58216.2023.10166320","DOIUrl":null,"url":null,"abstract":"In this paper, an iterative algorithm based on differential dynamic programming (DDP) is developed to solve the finite-horizon multi-player non-zero-sum (NZS) games. By using the DDP, the coupled Hamilton-Jacobi (HJ) equations are expanded from partial differential forms to higher-order differential forms. By approximating the value functions and optimal control policies through several finite sets of basis functions, the DDP expansions are transformed into algebraic matrix equations in integral forms. Then a policy iteration (PI) algorithm is provided to solve the feedback Nash equilibrium of above NZS games. Finally, two simulation examples are given to demonstrate the feasibility of the developed algorithm.","PeriodicalId":415532,"journal":{"name":"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Differential Dynamic Programming for Finite-Horizon Multi-Player Non-Zero-Sum Differential Games of Nonlinear Systems\",\"authors\":\"Yuqi Zhang, Bin Zhang\",\"doi\":\"10.1109/DDCLS58216.2023.10166320\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, an iterative algorithm based on differential dynamic programming (DDP) is developed to solve the finite-horizon multi-player non-zero-sum (NZS) games. By using the DDP, the coupled Hamilton-Jacobi (HJ) equations are expanded from partial differential forms to higher-order differential forms. By approximating the value functions and optimal control policies through several finite sets of basis functions, the DDP expansions are transformed into algebraic matrix equations in integral forms. Then a policy iteration (PI) algorithm is provided to solve the feedback Nash equilibrium of above NZS games. Finally, two simulation examples are given to demonstrate the feasibility of the developed algorithm.\",\"PeriodicalId\":415532,\"journal\":{\"name\":\"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DDCLS58216.2023.10166320\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DDCLS58216.2023.10166320","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Differential Dynamic Programming for Finite-Horizon Multi-Player Non-Zero-Sum Differential Games of Nonlinear Systems
In this paper, an iterative algorithm based on differential dynamic programming (DDP) is developed to solve the finite-horizon multi-player non-zero-sum (NZS) games. By using the DDP, the coupled Hamilton-Jacobi (HJ) equations are expanded from partial differential forms to higher-order differential forms. By approximating the value functions and optimal control policies through several finite sets of basis functions, the DDP expansions are transformed into algebraic matrix equations in integral forms. Then a policy iteration (PI) algorithm is provided to solve the feedback Nash equilibrium of above NZS games. Finally, two simulation examples are given to demonstrate the feasibility of the developed algorithm.