Wenqiang Zhang , Xuan Bao , Huili Geng , Guohui Zhang , Mitsuo Gen
{"title":"Graph neural network and expert-guided deep reinforcement learning for solving flexible job-shop scheduling problem","authors":"Wenqiang Zhang , Xuan Bao , Huili Geng , Guohui Zhang , Mitsuo Gen","doi":"10.1016/j.cor.2025.107155","DOIUrl":null,"url":null,"abstract":"<div><div>Flexible Job-shop Scheduling Problem (FJSP) is crucial for efficient and flexible automated production. Recent advancements in Reinforcement Learning (RL) have shown promise in solving sequential decision problems. However, most Deep Reinforcement Learning (DRL) algorithms rely on Priority Dispatching Rules (PDRs), which limits scheduling efficiency. This paper proposed a novel Graph Neural Network and Expert-Guided Deep Reinforcement Learning (GNN-EGDRL) framework that employs Graph Neural Network (GNN) to extract features from both machines and operations, thereby integrating operation selection and machine assignment into a unified composite decision-making process. Expert solutions generated by PDRs guide the initial stages of training, allowing agents to gradually transition to self-directed action selection, thus balancing exploration and exploitation. The expert guidance strategy emphasizes the relationships between operations and machines, enhancing the model’s sensitivity and providing granular guidance for feature extraction, leading to optimized solutions. Extensive experiments demonstrate that the proposed GNN-EGDRL method consistently outperforms traditional PDRs and other end-to-end DRL over all problem instances. Notably, this superiority has been validated in a wide range of scenarios, including larger-scale examples, underscoring the method’s scalability and robustness. Furthermore, the method exhibits strong performance in scenarios not encountered during training, highlighting its effectiveness and adaptability in diverse production environments.</div></div>","PeriodicalId":10542,"journal":{"name":"Computers & Operations Research","volume":"183 ","pages":"Article 107155"},"PeriodicalIF":4.3000,"publicationDate":"2025-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Operations Research","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0305054825001832","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Flexible Job-shop Scheduling Problem (FJSP) is crucial for efficient and flexible automated production. Recent advancements in Reinforcement Learning (RL) have shown promise in solving sequential decision problems. However, most Deep Reinforcement Learning (DRL) algorithms rely on Priority Dispatching Rules (PDRs), which limits scheduling efficiency. This paper proposed a novel Graph Neural Network and Expert-Guided Deep Reinforcement Learning (GNN-EGDRL) framework that employs Graph Neural Network (GNN) to extract features from both machines and operations, thereby integrating operation selection and machine assignment into a unified composite decision-making process. Expert solutions generated by PDRs guide the initial stages of training, allowing agents to gradually transition to self-directed action selection, thus balancing exploration and exploitation. The expert guidance strategy emphasizes the relationships between operations and machines, enhancing the model’s sensitivity and providing granular guidance for feature extraction, leading to optimized solutions. Extensive experiments demonstrate that the proposed GNN-EGDRL method consistently outperforms traditional PDRs and other end-to-end DRL over all problem instances. Notably, this superiority has been validated in a wide range of scenarios, including larger-scale examples, underscoring the method’s scalability and robustness. Furthermore, the method exhibits strong performance in scenarios not encountered during training, highlighting its effectiveness and adaptability in diverse production environments.
期刊介绍:
Operations research and computers meet in a large number of scientific fields, many of which are of vital current concern to our troubled society. These include, among others, ecology, transportation, safety, reliability, urban planning, economics, inventory control, investment strategy and logistics (including reverse logistics). Computers & Operations Research provides an international forum for the application of computers and operations research techniques to problems in these and related fields.