{"title":"基于多智能体深度强化学习的决策方法","authors":"Weiwei Bian, Chunguang Wang, Chan Liu, Kuihua Huang, Ying Mi, Yanxiang Jia","doi":"10.1109/ICUS55513.2022.9987201","DOIUrl":null,"url":null,"abstract":"Based on the decision-making architecture of information pooling and sharing in the hidden layer, the communication protocol is set manually, and the pooling method is used to integrate the information. Although the problem of communication and extension between agents is solved, it is difficult for tasks lacking prior knowledge to design effective communication protocols. The centralized decision- making architecture based on two-way RNN communication uses the information storage characteristics of two-way RNN structure. It can self learn the communication protocol between agents, which overcomes the rigid requirement of task prior knowledge in communication protocol design. The action distribution of a single agent is used as the output of the multi- agent network to replace the joint action distribution, and the global state information in the environment is used as the input instead of simply inputting the local information to different agents. The effectiveness of the method is verified by an example.","PeriodicalId":345773,"journal":{"name":"2022 IEEE International Conference on Unmanned Systems (ICUS)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Decision-making Method Based on Multi-agent Deep Reinforcement Learning\",\"authors\":\"Weiwei Bian, Chunguang Wang, Chan Liu, Kuihua Huang, Ying Mi, Yanxiang Jia\",\"doi\":\"10.1109/ICUS55513.2022.9987201\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Based on the decision-making architecture of information pooling and sharing in the hidden layer, the communication protocol is set manually, and the pooling method is used to integrate the information. Although the problem of communication and extension between agents is solved, it is difficult for tasks lacking prior knowledge to design effective communication protocols. The centralized decision- making architecture based on two-way RNN communication uses the information storage characteristics of two-way RNN structure. It can self learn the communication protocol between agents, which overcomes the rigid requirement of task prior knowledge in communication protocol design. The action distribution of a single agent is used as the output of the multi- agent network to replace the joint action distribution, and the global state information in the environment is used as the input instead of simply inputting the local information to different agents. The effectiveness of the method is verified by an example.\",\"PeriodicalId\":345773,\"journal\":{\"name\":\"2022 IEEE International Conference on Unmanned Systems (ICUS)\",\"volume\":\"9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference on Unmanned Systems (ICUS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICUS55513.2022.9987201\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Unmanned Systems (ICUS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICUS55513.2022.9987201","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Decision-making Method Based on Multi-agent Deep Reinforcement Learning
Based on the decision-making architecture of information pooling and sharing in the hidden layer, the communication protocol is set manually, and the pooling method is used to integrate the information. Although the problem of communication and extension between agents is solved, it is difficult for tasks lacking prior knowledge to design effective communication protocols. The centralized decision- making architecture based on two-way RNN communication uses the information storage characteristics of two-way RNN structure. It can self learn the communication protocol between agents, which overcomes the rigid requirement of task prior knowledge in communication protocol design. The action distribution of a single agent is used as the output of the multi- agent network to replace the joint action distribution, and the global state information in the environment is used as the input instead of simply inputting the local information to different agents. The effectiveness of the method is verified by an example.