领导-随从多智能体系统共识的强化学习控制

2018 IEEE 7th Data Driven Control and Learning Systems Conference (DDCLS) Pub Date : 2018-05-01 DOI:10.1109/DDCLS.2018.8516035

M. Chiang, An-Sheng Liu, L. Fu

{"title":"领导-随从多智能体系统共识的强化学习控制","authors":"M. Chiang, An-Sheng Liu, L. Fu","doi":"10.1109/DDCLS.2018.8516035","DOIUrl":null,"url":null,"abstract":"This paper considers the optimal consensus of multi-agent systems using reinforcement learning control. The system is nonlinear and the number of agents can be large. The control objective is to design the controllers for each agent such that all the agents will be consensus to the leader agent. We use the Actor-Critic Network and the Deterministic Policy Gradient method to realize the controller. The policy iteration algorithm is discussed and many simulations are provided to validate the result.","PeriodicalId":6565,"journal":{"name":"2018 IEEE 7th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"20 1","pages":"1152-1157"},"PeriodicalIF":0.0000,"publicationDate":"2018-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reinforcement Learning Control for Consensus of the Leader-Follower Multi-Agent Systems\",\"authors\":\"M. Chiang, An-Sheng Liu, L. Fu\",\"doi\":\"10.1109/DDCLS.2018.8516035\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper considers the optimal consensus of multi-agent systems using reinforcement learning control. The system is nonlinear and the number of agents can be large. The control objective is to design the controllers for each agent such that all the agents will be consensus to the leader agent. We use the Actor-Critic Network and the Deterministic Policy Gradient method to realize the controller. The policy iteration algorithm is discussed and many simulations are provided to validate the result.\",\"PeriodicalId\":6565,\"journal\":{\"name\":\"2018 IEEE 7th Data Driven Control and Learning Systems Conference (DDCLS)\",\"volume\":\"20 1\",\"pages\":\"1152-1157\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE 7th Data Driven Control and Learning Systems Conference (DDCLS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DDCLS.2018.8516035\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 7th Data Driven Control and Learning Systems Conference (DDCLS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DDCLS.2018.8516035","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文利用强化学习控制研究了多智能体系统的最优一致性问题。该系统是非线性的，agent的数量可能很大。控制目标是为每个代理设计控制器，使所有代理都与领导代理达成共识。我们使用行动者-评论家网络和确定性策略梯度方法来实现控制器。讨论了策略迭代算法，并进行了仿真验证。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Reinforcement Learning Control for Consensus of the Leader-Follower Multi-Agent Systems

This paper considers the optimal consensus of multi-agent systems using reinforcement learning control. The system is nonlinear and the number of agents can be large. The control objective is to design the controllers for each agent such that all the agents will be consensus to the leader agent. We use the Actor-Critic Network and the Deterministic Policy Gradient method to realize the controller. The policy iteration algorithm is discussed and many simulations are provided to validate the result.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2018 IEEE 7th Data Driven Control and Learning Systems Conference (DDCLS)

自引率

0.00%

发文量