{"title":"Decomposition-based multi-objective reinforcement learning for dynamic disassembly job shop scheduling with urgency guidance","authors":"Fangyu Li, Ruichong Ma, Jiarong Du, Honggui Han","doi":"10.1016/j.swevo.2025.102040","DOIUrl":null,"url":null,"abstract":"<div><div>The dynamic disassembly job shop scheduling problem (DDJSSP) entails organizing multiple jobs with distinct requirements across machines, where job operations are subject to sequence constraints and urgency conditions. Existing deep reinforcement learning techniques for multi-objective job shop scheduling problems (JSSP) with manually weighted reward functions result in a single policy, limiting the ability to approximate multiple policies in the Pareto front. To minimize both makespan and total energy consumption in DDJSSP, we propose an urgency-driven decomposition-based multi-objective reinforcement learning (UD-MORL) approach. We simulate a dynamic scheduling environment reflecting real-world complexities by introducing uncertain processing times, random employee absences, and an urgency rate for orders. We then develop a decomposition approach to separate objectives by adjusting weights and iterating policies based on performance and information entropy metrics. Finally, we employ a mutual information mechanism to identify the weight combination exhibiting the strongest correlation with population points, thereby improving weight-fitting efficiency. Experimental results on public general-purpose JSSP datasets show UD-MORL outperforms existing multi-objective reinforcement learning algorithms in hypervolume and sparsity, achieving an average hypervolume improvement, sparsity reduction, and a win-rate of 55% across all benchmark instances.</div></div>","PeriodicalId":48682,"journal":{"name":"Swarm and Evolutionary Computation","volume":"97 ","pages":"Article 102040"},"PeriodicalIF":8.2000,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Swarm and Evolutionary Computation","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2210650225001981","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
The dynamic disassembly job shop scheduling problem (DDJSSP) entails organizing multiple jobs with distinct requirements across machines, where job operations are subject to sequence constraints and urgency conditions. Existing deep reinforcement learning techniques for multi-objective job shop scheduling problems (JSSP) with manually weighted reward functions result in a single policy, limiting the ability to approximate multiple policies in the Pareto front. To minimize both makespan and total energy consumption in DDJSSP, we propose an urgency-driven decomposition-based multi-objective reinforcement learning (UD-MORL) approach. We simulate a dynamic scheduling environment reflecting real-world complexities by introducing uncertain processing times, random employee absences, and an urgency rate for orders. We then develop a decomposition approach to separate objectives by adjusting weights and iterating policies based on performance and information entropy metrics. Finally, we employ a mutual information mechanism to identify the weight combination exhibiting the strongest correlation with population points, thereby improving weight-fitting efficiency. Experimental results on public general-purpose JSSP datasets show UD-MORL outperforms existing multi-objective reinforcement learning algorithms in hypervolume and sparsity, achieving an average hypervolume improvement, sparsity reduction, and a win-rate of 55% across all benchmark instances.
期刊介绍:
Swarm and Evolutionary Computation is a pioneering peer-reviewed journal focused on the latest research and advancements in nature-inspired intelligent computation using swarm and evolutionary algorithms. It covers theoretical, experimental, and practical aspects of these paradigms and their hybrids, promoting interdisciplinary research. The journal prioritizes the publication of high-quality, original articles that push the boundaries of evolutionary computation and swarm intelligence. Additionally, it welcomes survey papers on current topics and novel applications. Topics of interest include but are not limited to: Genetic Algorithms, and Genetic Programming, Evolution Strategies, and Evolutionary Programming, Differential Evolution, Artificial Immune Systems, Particle Swarms, Ant Colony, Bacterial Foraging, Artificial Bees, Fireflies Algorithm, Harmony Search, Artificial Life, Digital Organisms, Estimation of Distribution Algorithms, Stochastic Diffusion Search, Quantum Computing, Nano Computing, Membrane Computing, Human-centric Computing, Hybridization of Algorithms, Memetic Computing, Autonomic Computing, Self-organizing systems, Combinatorial, Discrete, Binary, Constrained, Multi-objective, Multi-modal, Dynamic, and Large-scale Optimization.