{"title":"使用输出反馈的离散时间零和博弈的基于数据的自适应批评设计","authors":"Lili Cui, Huaguang Zhang, Xin Zhang, Yanhong Luo","doi":"10.1109/ADPRL.2011.5967351","DOIUrl":null,"url":null,"abstract":"A novel data-based adaptive critic design (ACD) using output feedback is proposed for discrete-time zero-sum games in this paper. The proposed data-based adaptive critic design (ACD) is actually a direct adaptive output feedback control scheme. The main contribution of this paper is that not only knowledge of system model but also information of system states are not required. Only the data measured from input and output are required for reaching the saddle point of the zero-sum games by using proposed data-based iterative ACD algorithm. Moreover, the property of the proposed data-based iterative ACD algorithm is discussed. Simulation results demonstrate satisfactory performance of the proposed controller","PeriodicalId":406195,"journal":{"name":"2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":"{\"title\":\"Data-based adaptive critic design for discrete-time zero-sum games using output feedback\",\"authors\":\"Lili Cui, Huaguang Zhang, Xin Zhang, Yanhong Luo\",\"doi\":\"10.1109/ADPRL.2011.5967351\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A novel data-based adaptive critic design (ACD) using output feedback is proposed for discrete-time zero-sum games in this paper. The proposed data-based adaptive critic design (ACD) is actually a direct adaptive output feedback control scheme. The main contribution of this paper is that not only knowledge of system model but also information of system states are not required. Only the data measured from input and output are required for reaching the saddle point of the zero-sum games by using proposed data-based iterative ACD algorithm. Moreover, the property of the proposed data-based iterative ACD algorithm is discussed. Simulation results demonstrate satisfactory performance of the proposed controller\",\"PeriodicalId\":406195,\"journal\":{\"name\":\"2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL)\",\"volume\":\"20 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-04-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"13\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ADPRL.2011.5967351\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ADPRL.2011.5967351","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Data-based adaptive critic design for discrete-time zero-sum games using output feedback
A novel data-based adaptive critic design (ACD) using output feedback is proposed for discrete-time zero-sum games in this paper. The proposed data-based adaptive critic design (ACD) is actually a direct adaptive output feedback control scheme. The main contribution of this paper is that not only knowledge of system model but also information of system states are not required. Only the data measured from input and output are required for reaching the saddle point of the zero-sum games by using proposed data-based iterative ACD algorithm. Moreover, the property of the proposed data-based iterative ACD algorithm is discussed. Simulation results demonstrate satisfactory performance of the proposed controller