{"title":"Data-driven optimal control for a class of unknown continuous-time nonlinear system using a novel ADP method","authors":"Kun Zhang, Huaguang Zhang, He Jiang, Chong Liu","doi":"10.1109/ICICIP.2016.7885887","DOIUrl":null,"url":null,"abstract":"This paper is concerned with the optimal control problem for a class of unknown continuous-time nonlinear system. A system identification method by date-driven model is established to reconstruct the unknown system dynamic by the input-output data. Then considering the optimal control problem, a novel critic neural networks design is proposed based on the policy iteration (PI), where the updating laws of parameters are designed by the normalized gradient descent algorithm and convex optimization method. And the computational burden of cost error get reduced during the iteration procedure using the new method. Based on this adaptive dynamic programming algorithm, the weight convergence is obtained and stability is guaranteed by Lyapunov theory. Finally, two simulation examples are shown to verify the effectiveness of this novel method.","PeriodicalId":226381,"journal":{"name":"2016 Seventh International Conference on Intelligent Control and Information Processing (ICICIP)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Seventh International Conference on Intelligent Control and Information Processing (ICICIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICIP.2016.7885887","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This paper is concerned with the optimal control problem for a class of unknown continuous-time nonlinear system. A system identification method by date-driven model is established to reconstruct the unknown system dynamic by the input-output data. Then considering the optimal control problem, a novel critic neural networks design is proposed based on the policy iteration (PI), where the updating laws of parameters are designed by the normalized gradient descent algorithm and convex optimization method. And the computational burden of cost error get reduced during the iteration procedure using the new method. Based on this adaptive dynamic programming algorithm, the weight convergence is obtained and stability is guaranteed by Lyapunov theory. Finally, two simulation examples are shown to verify the effectiveness of this novel method.