Minghai Yuan, Yang Ye, Hanyu Huang, Zhen Zhang, Fengque Pei, Wenbin Gu
{"title":"基于多智能体双深度q网络的分布式异构混合流车间多目标节能调度","authors":"Minghai Yuan, Yang Ye, Hanyu Huang, Zhen Zhang, Fengque Pei, Wenbin Gu","doi":"10.1016/j.swevo.2025.102076","DOIUrl":null,"url":null,"abstract":"<div><div>The Distributed Heterogeneous Hybrid Flow Shop Scheduling Problem (DHHFSP) poses a highly complex NP-hard combinatorial optimization challenge, particularly under the dual pressures of green manufacturing and distributed production. To address the limitations of conventional optimization algorithms in handling large-scale multi-objective scheduling with dynamic constraints, this paper proposes a novel Multi-Agent Double Deep Q-Network (MADDQN) based energy-efficient scheduling framework. The DHHFSP is formulated as a Markov Decision Process (MDP), where hierarchical agents representing jobs, workshops, and machines collaboratively learn optimal scheduling policies through a shared state representation and discrete rule-based action spaces. A hybrid reward mechanism combining delayed immediate rewards and global rewards is designed to efficiently guide the agents toward minimizing due time error (DTE) and total energy consumption (TEC). In addition, an adaptive energy-saving strategy is introduced to further reduce standby energy consumption without compromising delivery deadlines. Extensive computational experiments demonstrate that the proposed MADDQN achieves superior performance over state-of-the-art algorithms such as NSGA-II and optimal single rule methods in terms of convergence, solution diversity, and computational efficiency, with average improvements of 59.76%, 67.25%, and 99.72%, respectively. Furthermore, Pareto-based multi-objective evaluation metrics are utilized to comprehensively assess the balance between conflicting objectives. An industrial case study validates the practical applicability of the proposed method within real-world manufacturing execution systems (MES), offering a scalable and intelligent solution for energy-efficient scheduling in distributed heterogeneous manufacturing environments.</div></div>","PeriodicalId":48682,"journal":{"name":"Swarm and Evolutionary Computation","volume":"98 ","pages":"Article 102076"},"PeriodicalIF":8.2000,"publicationDate":"2025-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-objective energy-efficient scheduling of distributed heterogeneous hybrid flow shops via multi-agent double deep Q-Network\",\"authors\":\"Minghai Yuan, Yang Ye, Hanyu Huang, Zhen Zhang, Fengque Pei, Wenbin Gu\",\"doi\":\"10.1016/j.swevo.2025.102076\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The Distributed Heterogeneous Hybrid Flow Shop Scheduling Problem (DHHFSP) poses a highly complex NP-hard combinatorial optimization challenge, particularly under the dual pressures of green manufacturing and distributed production. To address the limitations of conventional optimization algorithms in handling large-scale multi-objective scheduling with dynamic constraints, this paper proposes a novel Multi-Agent Double Deep Q-Network (MADDQN) based energy-efficient scheduling framework. The DHHFSP is formulated as a Markov Decision Process (MDP), where hierarchical agents representing jobs, workshops, and machines collaboratively learn optimal scheduling policies through a shared state representation and discrete rule-based action spaces. A hybrid reward mechanism combining delayed immediate rewards and global rewards is designed to efficiently guide the agents toward minimizing due time error (DTE) and total energy consumption (TEC). In addition, an adaptive energy-saving strategy is introduced to further reduce standby energy consumption without compromising delivery deadlines. Extensive computational experiments demonstrate that the proposed MADDQN achieves superior performance over state-of-the-art algorithms such as NSGA-II and optimal single rule methods in terms of convergence, solution diversity, and computational efficiency, with average improvements of 59.76%, 67.25%, and 99.72%, respectively. Furthermore, Pareto-based multi-objective evaluation metrics are utilized to comprehensively assess the balance between conflicting objectives. An industrial case study validates the practical applicability of the proposed method within real-world manufacturing execution systems (MES), offering a scalable and intelligent solution for energy-efficient scheduling in distributed heterogeneous manufacturing environments.</div></div>\",\"PeriodicalId\":48682,\"journal\":{\"name\":\"Swarm and Evolutionary Computation\",\"volume\":\"98 \",\"pages\":\"Article 102076\"},\"PeriodicalIF\":8.2000,\"publicationDate\":\"2025-07-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Swarm and Evolutionary Computation\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2210650225002342\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Swarm and Evolutionary Computation","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2210650225002342","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Multi-objective energy-efficient scheduling of distributed heterogeneous hybrid flow shops via multi-agent double deep Q-Network
The Distributed Heterogeneous Hybrid Flow Shop Scheduling Problem (DHHFSP) poses a highly complex NP-hard combinatorial optimization challenge, particularly under the dual pressures of green manufacturing and distributed production. To address the limitations of conventional optimization algorithms in handling large-scale multi-objective scheduling with dynamic constraints, this paper proposes a novel Multi-Agent Double Deep Q-Network (MADDQN) based energy-efficient scheduling framework. The DHHFSP is formulated as a Markov Decision Process (MDP), where hierarchical agents representing jobs, workshops, and machines collaboratively learn optimal scheduling policies through a shared state representation and discrete rule-based action spaces. A hybrid reward mechanism combining delayed immediate rewards and global rewards is designed to efficiently guide the agents toward minimizing due time error (DTE) and total energy consumption (TEC). In addition, an adaptive energy-saving strategy is introduced to further reduce standby energy consumption without compromising delivery deadlines. Extensive computational experiments demonstrate that the proposed MADDQN achieves superior performance over state-of-the-art algorithms such as NSGA-II and optimal single rule methods in terms of convergence, solution diversity, and computational efficiency, with average improvements of 59.76%, 67.25%, and 99.72%, respectively. Furthermore, Pareto-based multi-objective evaluation metrics are utilized to comprehensively assess the balance between conflicting objectives. An industrial case study validates the practical applicability of the proposed method within real-world manufacturing execution systems (MES), offering a scalable and intelligent solution for energy-efficient scheduling in distributed heterogeneous manufacturing environments.
期刊介绍:
Swarm and Evolutionary Computation is a pioneering peer-reviewed journal focused on the latest research and advancements in nature-inspired intelligent computation using swarm and evolutionary algorithms. It covers theoretical, experimental, and practical aspects of these paradigms and their hybrids, promoting interdisciplinary research. The journal prioritizes the publication of high-quality, original articles that push the boundaries of evolutionary computation and swarm intelligence. Additionally, it welcomes survey papers on current topics and novel applications. Topics of interest include but are not limited to: Genetic Algorithms, and Genetic Programming, Evolution Strategies, and Evolutionary Programming, Differential Evolution, Artificial Immune Systems, Particle Swarms, Ant Colony, Bacterial Foraging, Artificial Bees, Fireflies Algorithm, Harmony Search, Artificial Life, Digital Organisms, Estimation of Distribution Algorithms, Stochastic Diffusion Search, Quantum Computing, Nano Computing, Membrane Computing, Human-centric Computing, Hybridization of Algorithms, Memetic Computing, Autonomic Computing, Self-organizing systems, Combinatorial, Discrete, Binary, Constrained, Multi-objective, Multi-modal, Dynamic, and Large-scale Optimization.