Multi-objective cooperative transportation for reconfigurable robot using isomorphic mapping multi-agent reinforcement learning

IF 3.1 3区计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS

Mechatronics Pub Date : 2024-05-21 DOI:10.1016/j.mechatronics.2024.103206

Ruqing Zhao, Fusheng Li, Xin Lu, Shubin Lyu

{"title":"Multi-objective cooperative transportation for reconfigurable robot using isomorphic mapping multi-agent reinforcement learning","authors":"Ruqing Zhao, Fusheng Li, Xin Lu, Shubin Lyu","doi":"10.1016/j.mechatronics.2024.103206","DOIUrl":null,"url":null,"abstract":"<div><p>In this paper, we propose an Isomorphic Mapping Reconfigurable Multi-Agent Reinforcement Learning (IM-RMARL) framework, which is suitable for decision-making scenarios in reconfigurable multi-agent reinforcement learning. This method holds promising applications in fields such as logistics transportation systems and disaster relief. Classical multi-agent frameworks typically assume that the number of agents is fixed and remains constant throughout the training process. However, in practical applications involving reconfigurable robots, the number of agents may vary over time or according to task requirements. Additionally, classical frameworks often assume easy access to abundant experience data for training and optimization. However, in reconfigurable robot clusters, this assumption may not hold true as not all combinations exist within a single episode. Our approach effectively addresses these challenges by integrating agent mapping mechanisms and similar type of intelligent agents’ experience sharing mechanisms, which aid in handling dynamic agent counts and limited experience data. Our experimental results demonstrate the effectiveness of the proposed framework, the Utilization Rate of Transport Capacity of the IM-RMARL group reaches 0.82, and the Task Completion Rate reaches 0.92.</p></div>","PeriodicalId":49842,"journal":{"name":"Mechatronics","volume":"101 ","pages":"Article 103206"},"PeriodicalIF":3.1000,"publicationDate":"2024-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mechatronics","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957415824000710","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

In this paper, we propose an Isomorphic Mapping Reconfigurable Multi-Agent Reinforcement Learning (IM-RMARL) framework, which is suitable for decision-making scenarios in reconfigurable multi-agent reinforcement learning. This method holds promising applications in fields such as logistics transportation systems and disaster relief. Classical multi-agent frameworks typically assume that the number of agents is fixed and remains constant throughout the training process. However, in practical applications involving reconfigurable robots, the number of agents may vary over time or according to task requirements. Additionally, classical frameworks often assume easy access to abundant experience data for training and optimization. However, in reconfigurable robot clusters, this assumption may not hold true as not all combinations exist within a single episode. Our approach effectively addresses these challenges by integrating agent mapping mechanisms and similar type of intelligent agents’ experience sharing mechanisms, which aid in handling dynamic agent counts and limited experience data. Our experimental results demonstrate the effectiveness of the proposed framework, the Utilization Rate of Transport Capacity of the IM-RMARL group reaches 0.82, and the Task Completion Rate reaches 0.92.

查看原文本刊更多论文

利用同构映射多代理强化学习实现可重构机器人的多目标协同运输

本文提出了一种同构映射可重构多代理强化学习（IM-RMARL）框架，适用于可重构多代理强化学习中的决策场景。这种方法在物流运输系统和救灾等领域有着广阔的应用前景。经典的多代理框架通常假设代理的数量是固定的，并且在整个训练过程中保持不变。然而，在涉及可重构机器人的实际应用中，代理的数量可能会随时间或任务要求而变化。此外，经典框架通常假定可以轻松获取丰富的经验数据，用于训练和优化。然而，在可重构机器人集群中，这一假设可能并不成立，因为并非所有组合都存在于单个事件中。我们的方法整合了代理映射机制和类似类型的智能代理经验共享机制，有助于处理动态代理数量和有限的经验数据，从而有效地应对了这些挑战。我们的实验结果证明了建议框架的有效性，IM-RMARL 组的运输能力利用率达到 0.82，任务完成率达到 0.92。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Mechatronics 工程技术-工程：电子与电气

CiteScore

5.90

自引率

9.10%

发文量

审稿时长

109 days

期刊介绍： Mechatronics is the synergistic combination of precision mechanical engineering, electronic control and systems thinking in the design of products and manufacturing processes. It relates to the design of systems, devices and products aimed at achieving an optimal balance between basic mechanical structure and its overall control. The purpose of this journal is to provide rapid publication of topical papers featuring practical developments in mechatronics. It will cover a wide range of application areas including consumer product design, instrumentation, manufacturing methods, computer integration and process and device control, and will attract a readership from across the industrial and academic research spectrum. Particular importance will be attached to aspects of innovation in mechatronics design philosophy which illustrate the benefits obtainable by an a priori integration of functionality with embedded microprocessor control. A major item will be the design of machines, devices and systems possessing a degree of computer based intelligence. The journal seeks to publish research progress in this field with an emphasis on the applied rather than the theoretical. It will also serve the dual role of bringing greater recognition to this important area of engineering.