{"title":"连续时间非线性最优控制的在线单网络自适应算法","authors":"Jae Young Lee, Jin Bae Park, Y. Choi, Keun Uk Lee","doi":"10.1109/ICCAS.2013.6704205","DOIUrl":null,"url":null,"abstract":"In this paper, we propose an online adaptive neural-algorithm to solve the CT nonlinear optimal control problems. Compared to the existing methods, which adopt the architecture with two neural networks (NNs) for actor-critic implementations, only one NN for critic is used to implement the algorithm, simplifying the structure of the computation model. Moreover, we also provide a generalized learning rule for updating the NN weights, which covers the existing critic update rules as special cases. The theoretical and numerical results are given under the required persistent excitation condition to verify and analyze stability and performance of the proposed method.","PeriodicalId":415263,"journal":{"name":"2013 13th International Conference on Control, Automation and Systems (ICCAS 2013)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An online single-network adaptive algorithm for continuous-time nonlinear optimal control\",\"authors\":\"Jae Young Lee, Jin Bae Park, Y. Choi, Keun Uk Lee\",\"doi\":\"10.1109/ICCAS.2013.6704205\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we propose an online adaptive neural-algorithm to solve the CT nonlinear optimal control problems. Compared to the existing methods, which adopt the architecture with two neural networks (NNs) for actor-critic implementations, only one NN for critic is used to implement the algorithm, simplifying the structure of the computation model. Moreover, we also provide a generalized learning rule for updating the NN weights, which covers the existing critic update rules as special cases. The theoretical and numerical results are given under the required persistent excitation condition to verify and analyze stability and performance of the proposed method.\",\"PeriodicalId\":415263,\"journal\":{\"name\":\"2013 13th International Conference on Control, Automation and Systems (ICCAS 2013)\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 13th International Conference on Control, Automation and Systems (ICCAS 2013)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCAS.2013.6704205\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 13th International Conference on Control, Automation and Systems (ICCAS 2013)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCAS.2013.6704205","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An online single-network adaptive algorithm for continuous-time nonlinear optimal control
In this paper, we propose an online adaptive neural-algorithm to solve the CT nonlinear optimal control problems. Compared to the existing methods, which adopt the architecture with two neural networks (NNs) for actor-critic implementations, only one NN for critic is used to implement the algorithm, simplifying the structure of the computation model. Moreover, we also provide a generalized learning rule for updating the NN weights, which covers the existing critic update rules as special cases. The theoretical and numerical results are given under the required persistent excitation condition to verify and analyze stability and performance of the proposed method.