基于Stackelberg博弈和多智能体深度强化学习的退役动力电池人机协同拆卸任务规划

IF 14.2 1区 工程技术 Q1 ENGINEERING, INDUSTRIAL
Xugang Zhang , Chuang Liu , Yong Yue , Qingshan Gong , Feng Ma , Yan Wang
{"title":"基于Stackelberg博弈和多智能体深度强化学习的退役动力电池人机协同拆卸任务规划","authors":"Xugang Zhang ,&nbsp;Chuang Liu ,&nbsp;Yong Yue ,&nbsp;Qingshan Gong ,&nbsp;Feng Ma ,&nbsp;Yan Wang","doi":"10.1016/j.jmsy.2025.07.024","DOIUrl":null,"url":null,"abstract":"<div><div>With the increasing adoption of electric vehicles (EVs), the volume of retired power batteries has surged accordingly. In the context of sustainable development and a circular economy, the disassembly process of retired power battery for reuse is regarded as a crucial approach to addressing resource shortages and environmental pollution. To achieve the efficient disassembly of retired power batteries in human-robot collaborative (HRC) scenarios. Firstly, a mapping network between task units and task executors is established using a multilayer perceptron (MLP) neural network, based on the complexity of the tasks and the state of workers. Secondly, a Stackelberg Double Deep Q-Network (SDDQN) algorithm is proposed by integrating the leader-follower characteristics of the Stackelberg model with the Deep Q-Network (DQN) algorithm to address the task planning problem in HRC disassembly. Finally, the effectiveness of the proposed method is validated through two case studies. Compared to Nash Q-learning, Independent Q-learning, and the conventional DDQN algorithm, it demonstrates superior performance in terms of task completion time and average cumulative rewards. Additionally, the proposed method exhibits strong robustness against unexpected environmental disturbances.</div></div>","PeriodicalId":16227,"journal":{"name":"Journal of Manufacturing Systems","volume":"82 ","pages":"Pages 841-857"},"PeriodicalIF":14.2000,"publicationDate":"2025-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Human-robot collaborative disassembly task planning for retired power battery based on Stackelberg game and multi-agent deep reinforcement learning\",\"authors\":\"Xugang Zhang ,&nbsp;Chuang Liu ,&nbsp;Yong Yue ,&nbsp;Qingshan Gong ,&nbsp;Feng Ma ,&nbsp;Yan Wang\",\"doi\":\"10.1016/j.jmsy.2025.07.024\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>With the increasing adoption of electric vehicles (EVs), the volume of retired power batteries has surged accordingly. In the context of sustainable development and a circular economy, the disassembly process of retired power battery for reuse is regarded as a crucial approach to addressing resource shortages and environmental pollution. To achieve the efficient disassembly of retired power batteries in human-robot collaborative (HRC) scenarios. Firstly, a mapping network between task units and task executors is established using a multilayer perceptron (MLP) neural network, based on the complexity of the tasks and the state of workers. Secondly, a Stackelberg Double Deep Q-Network (SDDQN) algorithm is proposed by integrating the leader-follower characteristics of the Stackelberg model with the Deep Q-Network (DQN) algorithm to address the task planning problem in HRC disassembly. Finally, the effectiveness of the proposed method is validated through two case studies. Compared to Nash Q-learning, Independent Q-learning, and the conventional DDQN algorithm, it demonstrates superior performance in terms of task completion time and average cumulative rewards. Additionally, the proposed method exhibits strong robustness against unexpected environmental disturbances.</div></div>\",\"PeriodicalId\":16227,\"journal\":{\"name\":\"Journal of Manufacturing Systems\",\"volume\":\"82 \",\"pages\":\"Pages 841-857\"},\"PeriodicalIF\":14.2000,\"publicationDate\":\"2025-08-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Manufacturing Systems\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0278612525001992\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, INDUSTRIAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Manufacturing Systems","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0278612525001992","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, INDUSTRIAL","Score":null,"Total":0}
引用次数: 0

摘要

随着电动汽车(ev)的普及,退役动力电池的数量也随之激增。在可持续发展和循环经济的背景下,废旧动力电池的拆解再利用被认为是解决资源短缺和环境污染的重要途径。实现人机协作(HRC)场景下退役动力电池的高效拆卸。首先,根据任务的复杂程度和工作人员的状态,利用多层感知器(MLP)神经网络建立任务单元和任务执行器之间的映射网络;其次,将Stackelberg模型的leader-follower特性与Deep Q-Network (DQN)算法相结合,提出了Stackelberg双深度Q-Network (SDDQN)算法来解决HRC拆卸中的任务规划问题。最后,通过两个案例验证了所提方法的有效性。与Nash Q-learning、Independent Q-learning和传统的DDQN算法相比,它在任务完成时间和平均累积奖励方面表现出更优越的性能。此外,该方法对意外环境干扰具有较强的鲁棒性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Human-robot collaborative disassembly task planning for retired power battery based on Stackelberg game and multi-agent deep reinforcement learning
With the increasing adoption of electric vehicles (EVs), the volume of retired power batteries has surged accordingly. In the context of sustainable development and a circular economy, the disassembly process of retired power battery for reuse is regarded as a crucial approach to addressing resource shortages and environmental pollution. To achieve the efficient disassembly of retired power batteries in human-robot collaborative (HRC) scenarios. Firstly, a mapping network between task units and task executors is established using a multilayer perceptron (MLP) neural network, based on the complexity of the tasks and the state of workers. Secondly, a Stackelberg Double Deep Q-Network (SDDQN) algorithm is proposed by integrating the leader-follower characteristics of the Stackelberg model with the Deep Q-Network (DQN) algorithm to address the task planning problem in HRC disassembly. Finally, the effectiveness of the proposed method is validated through two case studies. Compared to Nash Q-learning, Independent Q-learning, and the conventional DDQN algorithm, it demonstrates superior performance in terms of task completion time and average cumulative rewards. Additionally, the proposed method exhibits strong robustness against unexpected environmental disturbances.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Manufacturing Systems
Journal of Manufacturing Systems 工程技术-工程:工业
CiteScore
23.30
自引率
13.20%
发文量
216
审稿时长
25 days
期刊介绍: The Journal of Manufacturing Systems is dedicated to showcasing cutting-edge fundamental and applied research in manufacturing at the systems level. Encompassing products, equipment, people, information, control, and support functions, manufacturing systems play a pivotal role in the economical and competitive development, production, delivery, and total lifecycle of products, meeting market and societal needs. With a commitment to publishing archival scholarly literature, the journal strives to advance the state of the art in manufacturing systems and foster innovation in crafting efficient, robust, and sustainable manufacturing systems. The focus extends from equipment-level considerations to the broader scope of the extended enterprise. The Journal welcomes research addressing challenges across various scales, including nano, micro, and macro-scale manufacturing, and spanning diverse sectors such as aerospace, automotive, energy, and medical device manufacturing.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信