{"title":"领导-随从多智能体系统共识的强化学习控制","authors":"M. Chiang, An-Sheng Liu, L. Fu","doi":"10.1109/DDCLS.2018.8516035","DOIUrl":null,"url":null,"abstract":"This paper considers the optimal consensus of multi-agent systems using reinforcement learning control. The system is nonlinear and the number of agents can be large. The control objective is to design the controllers for each agent such that all the agents will be consensus to the leader agent. We use the Actor-Critic Network and the Deterministic Policy Gradient method to realize the controller. The policy iteration algorithm is discussed and many simulations are provided to validate the result.","PeriodicalId":6565,"journal":{"name":"2018 IEEE 7th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"20 1","pages":"1152-1157"},"PeriodicalIF":0.0000,"publicationDate":"2018-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reinforcement Learning Control for Consensus of the Leader-Follower Multi-Agent Systems\",\"authors\":\"M. Chiang, An-Sheng Liu, L. Fu\",\"doi\":\"10.1109/DDCLS.2018.8516035\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper considers the optimal consensus of multi-agent systems using reinforcement learning control. The system is nonlinear and the number of agents can be large. The control objective is to design the controllers for each agent such that all the agents will be consensus to the leader agent. We use the Actor-Critic Network and the Deterministic Policy Gradient method to realize the controller. The policy iteration algorithm is discussed and many simulations are provided to validate the result.\",\"PeriodicalId\":6565,\"journal\":{\"name\":\"2018 IEEE 7th Data Driven Control and Learning Systems Conference (DDCLS)\",\"volume\":\"20 1\",\"pages\":\"1152-1157\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE 7th Data Driven Control and Learning Systems Conference (DDCLS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DDCLS.2018.8516035\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 7th Data Driven Control and Learning Systems Conference (DDCLS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DDCLS.2018.8516035","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Reinforcement Learning Control for Consensus of the Leader-Follower Multi-Agent Systems
This paper considers the optimal consensus of multi-agent systems using reinforcement learning control. The system is nonlinear and the number of agents can be large. The control objective is to design the controllers for each agent such that all the agents will be consensus to the leader agent. We use the Actor-Critic Network and the Deterministic Policy Gradient method to realize the controller. The policy iteration algorithm is discussed and many simulations are provided to validate the result.