{"title":"Adaptive Optimal Control of Continuous-Time Linear Systems via Hybrid Iteration","authors":"Omar Qasem, Weinan Gao, T. Bian","doi":"10.1109/SSCI50451.2021.9660016","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a novel dynamic programming (DP) algorithm, under the name of hybrid iteration (HI), for continuous-time linear systems. The proposed HI approach combines the advantages of two well-known DP algorithms, i.e., policy iteration (PI) and value iteration (VI). In particular, HI drops the need of an initial stabilizing control policy required in PI, and at the same time it maintains a faster convergence rate compared with VI. Based on the proposed HI algorithm, a data-driven adaptive optimal controller design is also proposed. Simulation results for randomly generated continuous-time linear systems with different system orders demonstrate that the proposed HI approach can save CPU time up to 73% and reduce the number of iterations to converge up to 98% comparing with the VI approach.","PeriodicalId":255763,"journal":{"name":"2021 IEEE Symposium Series on Computational Intelligence (SSCI)","volume":"65 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Symposium Series on Computational Intelligence (SSCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SSCI50451.2021.9660016","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
In this paper, we propose a novel dynamic programming (DP) algorithm, under the name of hybrid iteration (HI), for continuous-time linear systems. The proposed HI approach combines the advantages of two well-known DP algorithms, i.e., policy iteration (PI) and value iteration (VI). In particular, HI drops the need of an initial stabilizing control policy required in PI, and at the same time it maintains a faster convergence rate compared with VI. Based on the proposed HI algorithm, a data-driven adaptive optimal controller design is also proposed. Simulation results for randomly generated continuous-time linear systems with different system orders demonstrate that the proposed HI approach can save CPU time up to 73% and reduce the number of iterations to converge up to 98% comparing with the VI approach.