{"title":"Continuous-time Value-Iteration-Based Learning for Constrained-Input Nonlinear Nonzero-Sum Game","authors":"Geyang Xiao, Yuan Liang, Linlin Yan, Xiaoyu Yi, Congqi Shen, Huifeng Zhang","doi":"10.1109/DOCS55193.2022.9967754","DOIUrl":null,"url":null,"abstract":"A continuous-time value iteration based learning method is proposed for constrained-input nonlinear nonzero-sum game in this paper. Most existing studies were based on policy iteration, and thus they require an initial admissible control policy as the initial condition or some proper control policy to make the states satisfy the persistent excitation (PE) condition. However, no mater the initial admissible control policy nor a PE satisfied control policy, they can not be derived by a general feasible way. Such difficulty of choosing control policy may limit the actual application. The proposed method is developed based on value iteration and the requirement of choosing proper control policy can be avoided. Moreover, since the control signal should always be designed within limits in practice, the constrained-input property is taken into consideration. Simulation results are displayed to show the effectiveness.","PeriodicalId":348545,"journal":{"name":"2022 4th International Conference on Data-driven Optimization of Complex Systems (DOCS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 4th International Conference on Data-driven Optimization of Complex Systems (DOCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DOCS55193.2022.9967754","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
A continuous-time value iteration based learning method is proposed for constrained-input nonlinear nonzero-sum game in this paper. Most existing studies were based on policy iteration, and thus they require an initial admissible control policy as the initial condition or some proper control policy to make the states satisfy the persistent excitation (PE) condition. However, no mater the initial admissible control policy nor a PE satisfied control policy, they can not be derived by a general feasible way. Such difficulty of choosing control policy may limit the actual application. The proposed method is developed based on value iteration and the requirement of choosing proper control policy can be avoided. Moreover, since the control signal should always be designed within limits in practice, the constrained-input property is taken into consideration. Simulation results are displayed to show the effectiveness.