基于多智能体双深度q网络的分布式异构混合流车间多目标节能调度

IF 8.2 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Swarm and Evolutionary Computation Pub Date : 2025-07-16 DOI:10.1016/j.swevo.2025.102076

Minghai Yuan, Yang Ye, Hanyu Huang, Zhen Zhang, Fengque Pei, Wenbin Gu

{"title":"基于多智能体双深度q网络的分布式异构混合流车间多目标节能调度","authors":"Minghai Yuan, Yang Ye, Hanyu Huang, Zhen Zhang, Fengque Pei, Wenbin Gu","doi":"10.1016/j.swevo.2025.102076","DOIUrl":null,"url":null,"abstract":"<div><div>The Distributed Heterogeneous Hybrid Flow Shop Scheduling Problem (DHHFSP) poses a highly complex NP-hard combinatorial optimization challenge, particularly under the dual pressures of green manufacturing and distributed production. To address the limitations of conventional optimization algorithms in handling large-scale multi-objective scheduling with dynamic constraints, this paper proposes a novel Multi-Agent Double Deep Q-Network (MADDQN) based energy-efficient scheduling framework. The DHHFSP is formulated as a Markov Decision Process (MDP), where hierarchical agents representing jobs, workshops, and machines collaboratively learn optimal scheduling policies through a shared state representation and discrete rule-based action spaces. A hybrid reward mechanism combining delayed immediate rewards and global rewards is designed to efficiently guide the agents toward minimizing due time error (DTE) and total energy consumption (TEC). In addition, an adaptive energy-saving strategy is introduced to further reduce standby energy consumption without compromising delivery deadlines. Extensive computational experiments demonstrate that the proposed MADDQN achieves superior performance over state-of-the-art algorithms such as NSGA-II and optimal single rule methods in terms of convergence, solution diversity, and computational efficiency, with average improvements of 59.76%, 67.25%, and 99.72%, respectively. Furthermore, Pareto-based multi-objective evaluation metrics are utilized to comprehensively assess the balance between conflicting objectives. An industrial case study validates the practical applicability of the proposed method within real-world manufacturing execution systems (MES), offering a scalable and intelligent solution for energy-efficient scheduling in distributed heterogeneous manufacturing environments.</div></div>","PeriodicalId":48682,"journal":{"name":"Swarm and Evolutionary Computation","volume":"98 ","pages":"Article 102076"},"PeriodicalIF":8.2000,"publicationDate":"2025-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-objective energy-efficient scheduling of distributed heterogeneous hybrid flow shops via multi-agent double deep Q-Network\",\"authors\":\"Minghai Yuan, Yang Ye, Hanyu Huang, Zhen Zhang, Fengque Pei, Wenbin Gu\",\"doi\":\"10.1016/j.swevo.2025.102076\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The Distributed Heterogeneous Hybrid Flow Shop Scheduling Problem (DHHFSP) poses a highly complex NP-hard combinatorial optimization challenge, particularly under the dual pressures of green manufacturing and distributed production. To address the limitations of conventional optimization algorithms in handling large-scale multi-objective scheduling with dynamic constraints, this paper proposes a novel Multi-Agent Double Deep Q-Network (MADDQN) based energy-efficient scheduling framework. The DHHFSP is formulated as a Markov Decision Process (MDP), where hierarchical agents representing jobs, workshops, and machines collaboratively learn optimal scheduling policies through a shared state representation and discrete rule-based action spaces. A hybrid reward mechanism combining delayed immediate rewards and global rewards is designed to efficiently guide the agents toward minimizing due time error (DTE) and total energy consumption (TEC). In addition, an adaptive energy-saving strategy is introduced to further reduce standby energy consumption without compromising delivery deadlines. Extensive computational experiments demonstrate that the proposed MADDQN achieves superior performance over state-of-the-art algorithms such as NSGA-II and optimal single rule methods in terms of convergence, solution diversity, and computational efficiency, with average improvements of 59.76%, 67.25%, and 99.72%, respectively. Furthermore, Pareto-based multi-objective evaluation metrics are utilized to comprehensively assess the balance between conflicting objectives. An industrial case study validates the practical applicability of the proposed method within real-world manufacturing execution systems (MES), offering a scalable and intelligent solution for energy-efficient scheduling in distributed heterogeneous manufacturing environments.</div></div>\",\"PeriodicalId\":48682,\"journal\":{\"name\":\"Swarm and Evolutionary Computation\",\"volume\":\"98 \",\"pages\":\"Article 102076\"},\"PeriodicalIF\":8.2000,\"publicationDate\":\"2025-07-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Swarm and Evolutionary Computation\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2210650225002342\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Swarm and Evolutionary Computation","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2210650225002342","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

分布式异构混合流车间调度问题（DHHFSP）是一个高度复杂的NP-hard组合优化问题，特别是在绿色制造和分布式生产的双重压力下。针对传统优化算法在处理具有动态约束的大规模多目标调度时的局限性，提出了一种基于多智能体双深度q网络（madqn）的新型节能调度框架。DHHFSP被制定为马尔可夫决策过程（MDP），其中代表工作、车间和机器的分层代理通过共享状态表示和离散的基于规则的操作空间协作学习最佳调度策略。设计了一种延迟即时奖励和全局奖励相结合的混合奖励机制，有效地引导智能体最小化到期时间误差（DTE）和总能量消耗（TEC）。此外，还引入了自适应节能策略，在不影响交货期限的情况下进一步降低待机能耗。大量的计算实验表明，所提出的madqn在收敛性、解多样性和计算效率方面优于NSGA-II和最优单规则方法等最先进算法，平均分别提高59.76%、67.25%和99.72%。此外，利用基于pareto的多目标评价指标对冲突目标之间的平衡进行综合评价。一个工业案例研究验证了所提出的方法在实际制造执行系统（MES）中的实际适用性，为分布式异构制造环境中的节能调度提供了可扩展的智能解决方案。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Multi-objective energy-efficient scheduling of distributed heterogeneous hybrid flow shops via multi-agent double deep Q-Network

The Distributed Heterogeneous Hybrid Flow Shop Scheduling Problem (DHHFSP) poses a highly complex NP-hard combinatorial optimization challenge, particularly under the dual pressures of green manufacturing and distributed production. To address the limitations of conventional optimization algorithms in handling large-scale multi-objective scheduling with dynamic constraints, this paper proposes a novel Multi-Agent Double Deep Q-Network (MADDQN) based energy-efficient scheduling framework. The DHHFSP is formulated as a Markov Decision Process (MDP), where hierarchical agents representing jobs, workshops, and machines collaboratively learn optimal scheduling policies through a shared state representation and discrete rule-based action spaces. A hybrid reward mechanism combining delayed immediate rewards and global rewards is designed to efficiently guide the agents toward minimizing due time error (DTE) and total energy consumption (TEC). In addition, an adaptive energy-saving strategy is introduced to further reduce standby energy consumption without compromising delivery deadlines. Extensive computational experiments demonstrate that the proposed MADDQN achieves superior performance over state-of-the-art algorithms such as NSGA-II and optimal single rule methods in terms of convergence, solution diversity, and computational efficiency, with average improvements of 59.76%, 67.25%, and 99.72%, respectively. Furthermore, Pareto-based multi-objective evaluation metrics are utilized to comprehensively assess the balance between conflicting objectives. An industrial case study validates the practical applicability of the proposed method within real-world manufacturing execution systems (MES), offering a scalable and intelligent solution for energy-efficient scheduling in distributed heterogeneous manufacturing environments.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Swarm and Evolutionary Computation COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCEC-COMPUTER SCIENCE, THEORY & METHODS

CiteScore

16.00

自引率

12.00%

发文量

169

期刊介绍： Swarm and Evolutionary Computation is a pioneering peer-reviewed journal focused on the latest research and advancements in nature-inspired intelligent computation using swarm and evolutionary algorithms. It covers theoretical, experimental, and practical aspects of these paradigms and their hybrids, promoting interdisciplinary research. The journal prioritizes the publication of high-quality, original articles that push the boundaries of evolutionary computation and swarm intelligence. Additionally, it welcomes survey papers on current topics and novel applications. Topics of interest include but are not limited to: Genetic Algorithms, and Genetic Programming, Evolution Strategies, and Evolutionary Programming, Differential Evolution, Artificial Immune Systems, Particle Swarms, Ant Colony, Bacterial Foraging, Artificial Bees, Fireflies Algorithm, Harmony Search, Artificial Life, Digital Organisms, Estimation of Distribution Algorithms, Stochastic Diffusion Search, Quantum Computing, Nano Computing, Membrane Computing, Human-centric Computing, Hybridization of Algorithms, Memetic Computing, Autonomic Computing, Self-organizing systems, Combinatorial, Discrete, Binary, Constrained, Multi-objective, Multi-modal, Dynamic, and Large-scale Optimization.