{"title":"Data-based adaptive critic design for discrete-time zero-sum games using output feedback","authors":"Lili Cui, Huaguang Zhang, Xin Zhang, Yanhong Luo","doi":"10.1109/ADPRL.2011.5967351","DOIUrl":null,"url":null,"abstract":"A novel data-based adaptive critic design (ACD) using output feedback is proposed for discrete-time zero-sum games in this paper. The proposed data-based adaptive critic design (ACD) is actually a direct adaptive output feedback control scheme. The main contribution of this paper is that not only knowledge of system model but also information of system states are not required. Only the data measured from input and output are required for reaching the saddle point of the zero-sum games by using proposed data-based iterative ACD algorithm. Moreover, the property of the proposed data-based iterative ACD algorithm is discussed. Simulation results demonstrate satisfactory performance of the proposed controller","PeriodicalId":406195,"journal":{"name":"2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ADPRL.2011.5967351","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 13
Abstract
A novel data-based adaptive critic design (ACD) using output feedback is proposed for discrete-time zero-sum games in this paper. The proposed data-based adaptive critic design (ACD) is actually a direct adaptive output feedback control scheme. The main contribution of this paper is that not only knowledge of system model but also information of system states are not required. Only the data measured from input and output are required for reaching the saddle point of the zero-sum games by using proposed data-based iterative ACD algorithm. Moreover, the property of the proposed data-based iterative ACD algorithm is discussed. Simulation results demonstrate satisfactory performance of the proposed controller