A novel deep self-learning method for flexible job-shop scheduling problems with multiplicity: Deep reinforcement learning assisted the fluid master-apprentice evolutionary algorithm

IF 8.2 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Swarm and Evolutionary Computation Pub Date : 2025-03-14 DOI:10.1016/j.swevo.2025.101907

Linshan Ding , Dan Luo , Rauf Mudassar , Lei Yue , Leilei Meng

{"title":"A novel deep self-learning method for flexible job-shop scheduling problems with multiplicity: Deep reinforcement learning assisted the fluid master-apprentice evolutionary algorithm","authors":"Linshan Ding , Dan Luo , Rauf Mudassar , Lei Yue , Leilei Meng","doi":"10.1016/j.swevo.2025.101907","DOIUrl":null,"url":null,"abstract":"<div><div>In today’s dynamic environment, companies must navigate highly competitive markets. They consistently need to implement new technologies and deliver the right product at the right time in response to customer demand. This necessitates a high level of adaptability and efficiency in their manufacturing processes. Flexible job-shops offer a more efficient alternative to traditional manufacturing practices by accommodating these needs. Additionally, in actual manufacturing plants, multiple jobs are typically required for each part type. To address this complexity, this article investigates the flexible job-shop scheduling problem with multiplicity (MFJSP). We propose a deep self-learning method based on deep reinforcement learning and fluid master-apprentice evolutionary algorithm (DSLFMAE) to minimize makespan for the MFJSP. The proposed DSLFMAE is the integration of a fluid master-apprentice evolutionary (FMAE) algorithm and a proximal policy optimization (PPO) algorithm. The FMAE algorithm serves as the core optimization method, employing the PPO algorithm to dynamically adjust the control parameters of the FMAE algorithm during the optimization process. Twelve state features are extracted to capture the evolutionary states of the FMAE algorithm accurately, and a long short-term memory Q-network (LSTM-Q) is designed to encode these continuous states. Subsequently, to adjust multiple interrelated control parameters of the FMAE algorithm simultaneously, a multivariate Gaussian distribution-based PPO algorithm is developed to train the LSTM-Q network. Numerical outcomes show the efficacy and superiority of the DSLFMAE in addressing the flexible job-shop scheduling problem with multiplicity (MFJSP) across different scales.</div></div>","PeriodicalId":48682,"journal":{"name":"Swarm and Evolutionary Computation","volume":"94 ","pages":"Article 101907"},"PeriodicalIF":8.2000,"publicationDate":"2025-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Swarm and Evolutionary Computation","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2210650225000653","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

In today’s dynamic environment, companies must navigate highly competitive markets. They consistently need to implement new technologies and deliver the right product at the right time in response to customer demand. This necessitates a high level of adaptability and efficiency in their manufacturing processes. Flexible job-shops offer a more efficient alternative to traditional manufacturing practices by accommodating these needs. Additionally, in actual manufacturing plants, multiple jobs are typically required for each part type. To address this complexity, this article investigates the flexible job-shop scheduling problem with multiplicity (MFJSP). We propose a deep self-learning method based on deep reinforcement learning and fluid master-apprentice evolutionary algorithm (DSLFMAE) to minimize makespan for the MFJSP. The proposed DSLFMAE is the integration of a fluid master-apprentice evolutionary (FMAE) algorithm and a proximal policy optimization (PPO) algorithm. The FMAE algorithm serves as the core optimization method, employing the PPO algorithm to dynamically adjust the control parameters of the FMAE algorithm during the optimization process. Twelve state features are extracted to capture the evolutionary states of the FMAE algorithm accurately, and a long short-term memory Q-network (LSTM-Q) is designed to encode these continuous states. Subsequently, to adjust multiple interrelated control parameters of the FMAE algorithm simultaneously, a multivariate Gaussian distribution-based PPO algorithm is developed to train the LSTM-Q network. Numerical outcomes show the efficacy and superiority of the DSLFMAE in addressing the flexible job-shop scheduling problem with multiplicity (MFJSP) across different scales.

Abstract Image

查看原文本刊更多论文

一种新的求解多重柔性作业车间调度问题的深度自学习方法：深度强化学习辅助流体师傅徒弟进化算法

在当今充满活力的环境中，企业必须在竞争激烈的市场中生存。他们始终需要实现新技术，并在正确的时间交付正确的产品，以响应客户的需求。这需要在制造过程中具有高水平的适应性和效率。灵活作业车间通过满足这些需求，为传统制造实践提供了更有效的替代方案。此外，在实际的制造工厂中，每个零件类型通常需要多个作业。为了解决这种复杂性，本文研究了具有多样性的灵活作业车间调度问题（MFJSP）。我们提出了一种基于深度强化学习和流体师徒进化算法（DSLFMAE）的深度自学习方法来最小化MFJSP的完工时间。所提出的DSLFMAE是一种流体师徒进化（FMAE）算法和近端策略优化（PPO）算法的集成。以FMAE算法为核心优化方法，在优化过程中采用PPO算法动态调整FMAE算法的控制参数。为了准确捕捉FMAE算法的进化状态，提取了12个状态特征，并设计了长短期记忆q网络（LSTM-Q）对这些连续状态进行编码。随后，为了同时调整FMAE算法的多个相互关联的控制参数，提出了一种基于多元高斯分布的PPO算法来训练LSTM-Q网络。数值结果表明，该算法在求解具有多重性的柔性作业车间调度问题（MFJSP）中的有效性和优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Swarm and Evolutionary Computation COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCEC-COMPUTER SCIENCE, THEORY & METHODS

CiteScore

16.00

自引率

12.00%

发文量

169

期刊介绍： Swarm and Evolutionary Computation is a pioneering peer-reviewed journal focused on the latest research and advancements in nature-inspired intelligent computation using swarm and evolutionary algorithms. It covers theoretical, experimental, and practical aspects of these paradigms and their hybrids, promoting interdisciplinary research. The journal prioritizes the publication of high-quality, original articles that push the boundaries of evolutionary computation and swarm intelligence. Additionally, it welcomes survey papers on current topics and novel applications. Topics of interest include but are not limited to: Genetic Algorithms, and Genetic Programming, Evolution Strategies, and Evolutionary Programming, Differential Evolution, Artificial Immune Systems, Particle Swarms, Ant Colony, Bacterial Foraging, Artificial Bees, Fireflies Algorithm, Harmony Search, Artificial Life, Digital Organisms, Estimation of Distribution Algorithms, Stochastic Diffusion Search, Quantum Computing, Nano Computing, Membrane Computing, Human-centric Computing, Hybridization of Algorithms, Memetic Computing, Autonomic Computing, Self-organizing systems, Combinatorial, Discrete, Binary, Constrained, Multi-objective, Multi-modal, Dynamic, and Large-scale Optimization.