A novel deep self-learning method for flexible job-shop scheduling problems with multiplicity: Deep reinforcement learning assisted the fluid master-apprentice evolutionary algorithm
IF 8.2 1区 计算机科学Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Linshan Ding , Dan Luo , Rauf Mudassar , Lei Yue , Leilei Meng
{"title":"A novel deep self-learning method for flexible job-shop scheduling problems with multiplicity: Deep reinforcement learning assisted the fluid master-apprentice evolutionary algorithm","authors":"Linshan Ding , Dan Luo , Rauf Mudassar , Lei Yue , Leilei Meng","doi":"10.1016/j.swevo.2025.101907","DOIUrl":null,"url":null,"abstract":"<div><div>In today’s dynamic environment, companies must navigate highly competitive markets. They consistently need to implement new technologies and deliver the right product at the right time in response to customer demand. This necessitates a high level of adaptability and efficiency in their manufacturing processes. Flexible job-shops offer a more efficient alternative to traditional manufacturing practices by accommodating these needs. Additionally, in actual manufacturing plants, multiple jobs are typically required for each part type. To address this complexity, this article investigates the flexible job-shop scheduling problem with multiplicity (MFJSP). We propose a deep self-learning method based on deep reinforcement learning and fluid master-apprentice evolutionary algorithm (DSLFMAE) to minimize makespan for the MFJSP. The proposed DSLFMAE is the integration of a fluid master-apprentice evolutionary (FMAE) algorithm and a proximal policy optimization (PPO) algorithm. The FMAE algorithm serves as the core optimization method, employing the PPO algorithm to dynamically adjust the control parameters of the FMAE algorithm during the optimization process. Twelve state features are extracted to capture the evolutionary states of the FMAE algorithm accurately, and a long short-term memory Q-network (LSTM-Q) is designed to encode these continuous states. Subsequently, to adjust multiple interrelated control parameters of the FMAE algorithm simultaneously, a multivariate Gaussian distribution-based PPO algorithm is developed to train the LSTM-Q network. Numerical outcomes show the efficacy and superiority of the DSLFMAE in addressing the flexible job-shop scheduling problem with multiplicity (MFJSP) across different scales.</div></div>","PeriodicalId":48682,"journal":{"name":"Swarm and Evolutionary Computation","volume":"94 ","pages":"Article 101907"},"PeriodicalIF":8.2000,"publicationDate":"2025-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Swarm and Evolutionary Computation","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2210650225000653","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
In today’s dynamic environment, companies must navigate highly competitive markets. They consistently need to implement new technologies and deliver the right product at the right time in response to customer demand. This necessitates a high level of adaptability and efficiency in their manufacturing processes. Flexible job-shops offer a more efficient alternative to traditional manufacturing practices by accommodating these needs. Additionally, in actual manufacturing plants, multiple jobs are typically required for each part type. To address this complexity, this article investigates the flexible job-shop scheduling problem with multiplicity (MFJSP). We propose a deep self-learning method based on deep reinforcement learning and fluid master-apprentice evolutionary algorithm (DSLFMAE) to minimize makespan for the MFJSP. The proposed DSLFMAE is the integration of a fluid master-apprentice evolutionary (FMAE) algorithm and a proximal policy optimization (PPO) algorithm. The FMAE algorithm serves as the core optimization method, employing the PPO algorithm to dynamically adjust the control parameters of the FMAE algorithm during the optimization process. Twelve state features are extracted to capture the evolutionary states of the FMAE algorithm accurately, and a long short-term memory Q-network (LSTM-Q) is designed to encode these continuous states. Subsequently, to adjust multiple interrelated control parameters of the FMAE algorithm simultaneously, a multivariate Gaussian distribution-based PPO algorithm is developed to train the LSTM-Q network. Numerical outcomes show the efficacy and superiority of the DSLFMAE in addressing the flexible job-shop scheduling problem with multiplicity (MFJSP) across different scales.
期刊介绍:
Swarm and Evolutionary Computation is a pioneering peer-reviewed journal focused on the latest research and advancements in nature-inspired intelligent computation using swarm and evolutionary algorithms. It covers theoretical, experimental, and practical aspects of these paradigms and their hybrids, promoting interdisciplinary research. The journal prioritizes the publication of high-quality, original articles that push the boundaries of evolutionary computation and swarm intelligence. Additionally, it welcomes survey papers on current topics and novel applications. Topics of interest include but are not limited to: Genetic Algorithms, and Genetic Programming, Evolution Strategies, and Evolutionary Programming, Differential Evolution, Artificial Immune Systems, Particle Swarms, Ant Colony, Bacterial Foraging, Artificial Bees, Fireflies Algorithm, Harmony Search, Artificial Life, Digital Organisms, Estimation of Distribution Algorithms, Stochastic Diffusion Search, Quantum Computing, Nano Computing, Membrane Computing, Human-centric Computing, Hybridization of Algorithms, Memetic Computing, Autonomic Computing, Self-organizing systems, Combinatorial, Discrete, Binary, Constrained, Multi-objective, Multi-modal, Dynamic, and Large-scale Optimization.