{"title":"Adaptive optimal control for a class of nonlinear partially uncertain dynamic systems via policy iteration","authors":"Derong Liu, Xiong Yang, Hongliang Li","doi":"10.1109/ICICIP.2012.6391520","DOIUrl":null,"url":null,"abstract":"In this paper, by employing an online algorithm based on policy iteration (PI), an adaptive optimal control problem for continuous-time (CT) nonlinear partially uncertain dynamic systems is investigated. In this proposed algorithm, a discounted cost function is discussed, which is considered to be a more general case for optimal control problems. Two neural networks (NNs) are used to implement the algorithm, which aims at approximating the cost function and the control law, respectively. The uniform convergence to the optimal control is proven, and the stability of the system is guaranteed. An illustrating example is given.","PeriodicalId":376265,"journal":{"name":"2012 Third International Conference on Intelligent Control and Information Processing","volume":"199 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 Third International Conference on Intelligent Control and Information Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICIP.2012.6391520","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
In this paper, by employing an online algorithm based on policy iteration (PI), an adaptive optimal control problem for continuous-time (CT) nonlinear partially uncertain dynamic systems is investigated. In this proposed algorithm, a discounted cost function is discussed, which is considered to be a more general case for optimal control problems. Two neural networks (NNs) are used to implement the algorithm, which aims at approximating the cost function and the control law, respectively. The uniform convergence to the optimal control is proven, and the stability of the system is guaranteed. An illustrating example is given.