基于Stackelberg博弈和多智能体深度强化学习的退役动力电池人机协同拆卸任务规划

IF 14.2 1区工程技术 Q1 ENGINEERING, INDUSTRIAL

Journal of Manufacturing Systems Pub Date : 2025-08-05 DOI:10.1016/j.jmsy.2025.07.024

Xugang Zhang , Chuang Liu , Yong Yue , Qingshan Gong , Feng Ma , Yan Wang

{"title":"基于Stackelberg博弈和多智能体深度强化学习的退役动力电池人机协同拆卸任务规划","authors":"Xugang Zhang , Chuang Liu , Yong Yue , Qingshan Gong , Feng Ma , Yan Wang","doi":"10.1016/j.jmsy.2025.07.024","DOIUrl":null,"url":null,"abstract":"<div><div>With the increasing adoption of electric vehicles (EVs), the volume of retired power batteries has surged accordingly. In the context of sustainable development and a circular economy, the disassembly process of retired power battery for reuse is regarded as a crucial approach to addressing resource shortages and environmental pollution. To achieve the efficient disassembly of retired power batteries in human-robot collaborative (HRC) scenarios. Firstly, a mapping network between task units and task executors is established using a multilayer perceptron (MLP) neural network, based on the complexity of the tasks and the state of workers. Secondly, a Stackelberg Double Deep Q-Network (SDDQN) algorithm is proposed by integrating the leader-follower characteristics of the Stackelberg model with the Deep Q-Network (DQN) algorithm to address the task planning problem in HRC disassembly. Finally, the effectiveness of the proposed method is validated through two case studies. Compared to Nash Q-learning, Independent Q-learning, and the conventional DDQN algorithm, it demonstrates superior performance in terms of task completion time and average cumulative rewards. Additionally, the proposed method exhibits strong robustness against unexpected environmental disturbances.</div></div>","PeriodicalId":16227,"journal":{"name":"Journal of Manufacturing Systems","volume":"82 ","pages":"Pages 841-857"},"PeriodicalIF":14.2000,"publicationDate":"2025-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Human-robot collaborative disassembly task planning for retired power battery based on Stackelberg game and multi-agent deep reinforcement learning\",\"authors\":\"Xugang Zhang , Chuang Liu , Yong Yue , Qingshan Gong , Feng Ma , Yan Wang\",\"doi\":\"10.1016/j.jmsy.2025.07.024\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>With the increasing adoption of electric vehicles (EVs), the volume of retired power batteries has surged accordingly. In the context of sustainable development and a circular economy, the disassembly process of retired power battery for reuse is regarded as a crucial approach to addressing resource shortages and environmental pollution. To achieve the efficient disassembly of retired power batteries in human-robot collaborative (HRC) scenarios. Firstly, a mapping network between task units and task executors is established using a multilayer perceptron (MLP) neural network, based on the complexity of the tasks and the state of workers. Secondly, a Stackelberg Double Deep Q-Network (SDDQN) algorithm is proposed by integrating the leader-follower characteristics of the Stackelberg model with the Deep Q-Network (DQN) algorithm to address the task planning problem in HRC disassembly. Finally, the effectiveness of the proposed method is validated through two case studies. Compared to Nash Q-learning, Independent Q-learning, and the conventional DDQN algorithm, it demonstrates superior performance in terms of task completion time and average cumulative rewards. Additionally, the proposed method exhibits strong robustness against unexpected environmental disturbances.</div></div>\",\"PeriodicalId\":16227,\"journal\":{\"name\":\"Journal of Manufacturing Systems\",\"volume\":\"82 \",\"pages\":\"Pages 841-857\"},\"PeriodicalIF\":14.2000,\"publicationDate\":\"2025-08-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Manufacturing Systems\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0278612525001992\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, INDUSTRIAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Manufacturing Systems","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0278612525001992","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, INDUSTRIAL","Score":null,"Total":0}

引用次数: 0

摘要

随着电动汽车（ev）的普及，退役动力电池的数量也随之激增。在可持续发展和循环经济的背景下，废旧动力电池的拆解再利用被认为是解决资源短缺和环境污染的重要途径。实现人机协作（HRC）场景下退役动力电池的高效拆卸。首先，根据任务的复杂程度和工作人员的状态，利用多层感知器（MLP）神经网络建立任务单元和任务执行器之间的映射网络；其次，将Stackelberg模型的leader-follower特性与Deep Q-Network （DQN）算法相结合，提出了Stackelberg双深度Q-Network （SDDQN）算法来解决HRC拆卸中的任务规划问题。最后，通过两个案例验证了所提方法的有效性。与Nash Q-learning、Independent Q-learning和传统的DDQN算法相比，它在任务完成时间和平均累积奖励方面表现出更优越的性能。此外，该方法对意外环境干扰具有较强的鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Human-robot collaborative disassembly task planning for retired power battery based on Stackelberg game and multi-agent deep reinforcement learning

With the increasing adoption of electric vehicles (EVs), the volume of retired power batteries has surged accordingly. In the context of sustainable development and a circular economy, the disassembly process of retired power battery for reuse is regarded as a crucial approach to addressing resource shortages and environmental pollution. To achieve the efficient disassembly of retired power batteries in human-robot collaborative (HRC) scenarios. Firstly, a mapping network between task units and task executors is established using a multilayer perceptron (MLP) neural network, based on the complexity of the tasks and the state of workers. Secondly, a Stackelberg Double Deep Q-Network (SDDQN) algorithm is proposed by integrating the leader-follower characteristics of the Stackelberg model with the Deep Q-Network (DQN) algorithm to address the task planning problem in HRC disassembly. Finally, the effectiveness of the proposed method is validated through two case studies. Compared to Nash Q-learning, Independent Q-learning, and the conventional DDQN algorithm, it demonstrates superior performance in terms of task completion time and average cumulative rewards. Additionally, the proposed method exhibits strong robustness against unexpected environmental disturbances.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Manufacturing Systems 工程技术-工程：工业

CiteScore

23.30

自引率

13.20%

发文量

216

审稿时长

25 days

期刊介绍： The Journal of Manufacturing Systems is dedicated to showcasing cutting-edge fundamental and applied research in manufacturing at the systems level. Encompassing products, equipment, people, information, control, and support functions, manufacturing systems play a pivotal role in the economical and competitive development, production, delivery, and total lifecycle of products, meeting market and societal needs. With a commitment to publishing archival scholarly literature, the journal strives to advance the state of the art in manufacturing systems and foster innovation in crafting efficient, robust, and sustainable manufacturing systems. The focus extends from equipment-level considerations to the broader scope of the extended enterprise. The Journal welcomes research addressing challenges across various scales, including nano, micro, and macro-scale manufacturing, and spanning diverse sectors such as aerospace, automotive, energy, and medical device manufacturing.