{"title":"基于混合迭代的离散线性系统自适应最优控制","authors":"Omar Qasem, Weinan Gao, Hector M. Gutierrez","doi":"10.1109/DDCLS58216.2023.10167108","DOIUrl":null,"url":null,"abstract":"In this paper, a novel dynamic programming (DP) and adaptive dynamic programming (ADP) algorithms are proposed, namely hybrid iteration (HI), for discrete-time linear systems. The proposed HI approach fill up the performance gap of two well- known DP algorithms, i.e., policy iteration (PI) and value iteration (VI). In particular, HI drops the need of the prior knowledge of an initial stabilizing control policy required in PI, and at the same time it maintains a fast quadratic convergence rate compared with VI. A data-driven adaptive optimal controller design is also proposed based on the proposed HI algorithm. Simulation results for randomly generated discrete-time linear systems with different system orders demonstrate that the proposed HI approach can significantly save CPU time and reduce the number of learning iterations to converge to the optimal solution comparing with the VI approach. The data-driven HI method is implemented to an application of turbocharged diesel engine with exhaust gas recirculation, and the simulation results illustrate the efficacy of the proposed HI method.","PeriodicalId":415532,"journal":{"name":"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"130 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Adaptive Optimal Control for Discrete-Time Linear Systems via Hybrid Iteration\",\"authors\":\"Omar Qasem, Weinan Gao, Hector M. Gutierrez\",\"doi\":\"10.1109/DDCLS58216.2023.10167108\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, a novel dynamic programming (DP) and adaptive dynamic programming (ADP) algorithms are proposed, namely hybrid iteration (HI), for discrete-time linear systems. The proposed HI approach fill up the performance gap of two well- known DP algorithms, i.e., policy iteration (PI) and value iteration (VI). In particular, HI drops the need of the prior knowledge of an initial stabilizing control policy required in PI, and at the same time it maintains a fast quadratic convergence rate compared with VI. A data-driven adaptive optimal controller design is also proposed based on the proposed HI algorithm. Simulation results for randomly generated discrete-time linear systems with different system orders demonstrate that the proposed HI approach can significantly save CPU time and reduce the number of learning iterations to converge to the optimal solution comparing with the VI approach. The data-driven HI method is implemented to an application of turbocharged diesel engine with exhaust gas recirculation, and the simulation results illustrate the efficacy of the proposed HI method.\",\"PeriodicalId\":415532,\"journal\":{\"name\":\"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)\",\"volume\":\"130 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DDCLS58216.2023.10167108\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DDCLS58216.2023.10167108","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Adaptive Optimal Control for Discrete-Time Linear Systems via Hybrid Iteration
In this paper, a novel dynamic programming (DP) and adaptive dynamic programming (ADP) algorithms are proposed, namely hybrid iteration (HI), for discrete-time linear systems. The proposed HI approach fill up the performance gap of two well- known DP algorithms, i.e., policy iteration (PI) and value iteration (VI). In particular, HI drops the need of the prior knowledge of an initial stabilizing control policy required in PI, and at the same time it maintains a fast quadratic convergence rate compared with VI. A data-driven adaptive optimal controller design is also proposed based on the proposed HI algorithm. Simulation results for randomly generated discrete-time linear systems with different system orders demonstrate that the proposed HI approach can significantly save CPU time and reduce the number of learning iterations to converge to the optimal solution comparing with the VI approach. The data-driven HI method is implemented to an application of turbocharged diesel engine with exhaust gas recirculation, and the simulation results illustrate the efficacy of the proposed HI method.