{"title":"Multi-station multi-robot task assignment method based on deep reinforcement learning","authors":"Junnan Zhang, Ke Wang, Chaoxu Mu","doi":"10.1049/cit2.12394","DOIUrl":null,"url":null,"abstract":"<p>This paper focuses on the problem of multi-station multi-robot spot welding task assignment, and proposes a deep reinforcement learning (DRL) framework, which is made up of a public graph attention network and independent policy networks. The graph of welding spots distribution is encoded using the graph attention network. Independent policy networks with attention mechanism as a decoder can handle the encoded graph and decide to assign robots to different tasks. The policy network is used to convert the large scale welding spots allocation problem to multiple small scale single-robot welding path planning problems, and the path planning problem is quickly solved through existing methods. Then, the model is trained through reinforcement learning. In addition, the task balancing method is used to allocate tasks to multiple stations. The proposed algorithm is compared with classical algorithms, and the results show that the algorithm based on DRL can produce higher quality solutions.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"10 1","pages":"134-146"},"PeriodicalIF":8.4000,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12394","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"CAAI Transactions on Intelligence Technology","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/cit2.12394","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
This paper focuses on the problem of multi-station multi-robot spot welding task assignment, and proposes a deep reinforcement learning (DRL) framework, which is made up of a public graph attention network and independent policy networks. The graph of welding spots distribution is encoded using the graph attention network. Independent policy networks with attention mechanism as a decoder can handle the encoded graph and decide to assign robots to different tasks. The policy network is used to convert the large scale welding spots allocation problem to multiple small scale single-robot welding path planning problems, and the path planning problem is quickly solved through existing methods. Then, the model is trained through reinforcement learning. In addition, the task balancing method is used to allocate tasks to multiple stations. The proposed algorithm is compared with classical algorithms, and the results show that the algorithm based on DRL can produce higher quality solutions.
期刊介绍:
CAAI Transactions on Intelligence Technology is a leading venue for original research on the theoretical and experimental aspects of artificial intelligence technology. We are a fully open access journal co-published by the Institution of Engineering and Technology (IET) and the Chinese Association for Artificial Intelligence (CAAI) providing research which is openly accessible to read and share worldwide.