使用输出反馈的离散时间零和博弈的基于数据的自适应批评设计

2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL) Pub Date : 2011-04-11 DOI:10.1109/ADPRL.2011.5967351

Lili Cui, Huaguang Zhang, Xin Zhang, Yanhong Luo

{"title":"使用输出反馈的离散时间零和博弈的基于数据的自适应批评设计","authors":"Lili Cui, Huaguang Zhang, Xin Zhang, Yanhong Luo","doi":"10.1109/ADPRL.2011.5967351","DOIUrl":null,"url":null,"abstract":"A novel data-based adaptive critic design (ACD) using output feedback is proposed for discrete-time zero-sum games in this paper. The proposed data-based adaptive critic design (ACD) is actually a direct adaptive output feedback control scheme. The main contribution of this paper is that not only knowledge of system model but also information of system states are not required. Only the data measured from input and output are required for reaching the saddle point of the zero-sum games by using proposed data-based iterative ACD algorithm. Moreover, the property of the proposed data-based iterative ACD algorithm is discussed. Simulation results demonstrate satisfactory performance of the proposed controller","PeriodicalId":406195,"journal":{"name":"2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":"{\"title\":\"Data-based adaptive critic design for discrete-time zero-sum games using output feedback\",\"authors\":\"Lili Cui, Huaguang Zhang, Xin Zhang, Yanhong Luo\",\"doi\":\"10.1109/ADPRL.2011.5967351\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A novel data-based adaptive critic design (ACD) using output feedback is proposed for discrete-time zero-sum games in this paper. The proposed data-based adaptive critic design (ACD) is actually a direct adaptive output feedback control scheme. The main contribution of this paper is that not only knowledge of system model but also information of system states are not required. Only the data measured from input and output are required for reaching the saddle point of the zero-sum games by using proposed data-based iterative ACD algorithm. Moreover, the property of the proposed data-based iterative ACD algorithm is discussed. Simulation results demonstrate satisfactory performance of the proposed controller\",\"PeriodicalId\":406195,\"journal\":{\"name\":\"2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL)\",\"volume\":\"20 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-04-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"13\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ADPRL.2011.5967351\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ADPRL.2011.5967351","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 13

摘要

针对离散零和博弈问题，提出了一种基于数据的输出反馈自适应批评设计。提出的基于数据的自适应批评设计(ACD)实际上是一种直接自适应输出反馈控制方案。本文的主要贡献在于不仅不需要系统模型知识，而且不需要系统状态信息。利用本文提出的基于数据的迭代ACD算法，只需要从输入和输出测量到的数据就可以到达零和博弈的鞍点。此外，还讨论了基于数据的迭代ACD算法的性质。仿真结果表明该控制器具有良好的控制性能

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Data-based adaptive critic design for discrete-time zero-sum games using output feedback

A novel data-based adaptive critic design (ACD) using output feedback is proposed for discrete-time zero-sum games in this paper. The proposed data-based adaptive critic design (ACD) is actually a direct adaptive output feedback control scheme. The main contribution of this paper is that not only knowledge of system model but also information of system states are not required. Only the data measured from input and output are required for reaching the saddle point of the zero-sum games by using proposed data-based iterative ACD algorithm. Moreover, the property of the proposed data-based iterative ACD algorithm is discussed. Simulation results demonstrate satisfactory performance of the proposed controller

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL)

自引率

0.00%

发文量