{"title":"Data-driven hierarchical multi-policy deep reinforcement learning framework for multi-objective multiplicity dynamic flexible job shop scheduling","authors":"Linshan Ding , Zailin Guan , Dan Luo , Lei Yue","doi":"10.1016/j.jmsy.2025.03.019","DOIUrl":null,"url":null,"abstract":"<div><div>In the context of Industry 4.0, manufacturers face pressure to personalize products and accelerate the supply chain. This requires rapid response to volatile production schedules, ensuring a balance between operational efficiency and product quality. Moreover, the rapid development and convergence of the cloud computing, Internet of Things (IoT), and big data have expanded the need for real-time tracking and adaptive scheduling to address uncertainties, such as equipment downtime, supply variation, and ongoing product revisions. The capability of IoT has significantly improved the continuous monitoring and data analysis, emphasizing the importance of developing effective real-time scheduling solutions in the manufacturing system. In response to these evolving industrial requirements, and driven by objectives to reduce the makespan, total tardiness, and energy consumption, we study the multi-objective multiplicity dynamic flexible job shop scheduling problem (MOMDFJSP), to cope with the challenges of new order arrivals and machine breakdowns in the IoT-enabled manufacturing system. This study proposes a novel hierarchical multi-policy deep reinforcement learning framework for IoT-infused manufacturing environments, aiming to integrate these diverse requirements and uncertainties into a coherent and responsive scheduling framework. The proposed framework comprises an upper-level control policy network and three lower-level objective policy networks. The upper-level and lower-level networks are respectively responsible for selecting temporary optimization objectives and specific dispatching rules. Based on the proposed framework, we design a two-stage training approach named the hierarchical multi-policy soft actor-critic (HMPSAC) algorithm to train multiple policy networks. In addition, we develop a fluid model to design the state features and dispatching rules that act as inputs and outputs, respectively, for the deep reinforcement learning (DRL) policy network. The comparative analysis with well-known dispatching rules and DRL-based methods reveals the superior performance of HMPSAC algorithm.</div></div>","PeriodicalId":16227,"journal":{"name":"Journal of Manufacturing Systems","volume":"80 ","pages":"Pages 536-562"},"PeriodicalIF":12.2000,"publicationDate":"2025-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Manufacturing Systems","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0278612525000809","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, INDUSTRIAL","Score":null,"Total":0}
引用次数: 0
Abstract
In the context of Industry 4.0, manufacturers face pressure to personalize products and accelerate the supply chain. This requires rapid response to volatile production schedules, ensuring a balance between operational efficiency and product quality. Moreover, the rapid development and convergence of the cloud computing, Internet of Things (IoT), and big data have expanded the need for real-time tracking and adaptive scheduling to address uncertainties, such as equipment downtime, supply variation, and ongoing product revisions. The capability of IoT has significantly improved the continuous monitoring and data analysis, emphasizing the importance of developing effective real-time scheduling solutions in the manufacturing system. In response to these evolving industrial requirements, and driven by objectives to reduce the makespan, total tardiness, and energy consumption, we study the multi-objective multiplicity dynamic flexible job shop scheduling problem (MOMDFJSP), to cope with the challenges of new order arrivals and machine breakdowns in the IoT-enabled manufacturing system. This study proposes a novel hierarchical multi-policy deep reinforcement learning framework for IoT-infused manufacturing environments, aiming to integrate these diverse requirements and uncertainties into a coherent and responsive scheduling framework. The proposed framework comprises an upper-level control policy network and three lower-level objective policy networks. The upper-level and lower-level networks are respectively responsible for selecting temporary optimization objectives and specific dispatching rules. Based on the proposed framework, we design a two-stage training approach named the hierarchical multi-policy soft actor-critic (HMPSAC) algorithm to train multiple policy networks. In addition, we develop a fluid model to design the state features and dispatching rules that act as inputs and outputs, respectively, for the deep reinforcement learning (DRL) policy network. The comparative analysis with well-known dispatching rules and DRL-based methods reveals the superior performance of HMPSAC algorithm.
期刊介绍:
The Journal of Manufacturing Systems is dedicated to showcasing cutting-edge fundamental and applied research in manufacturing at the systems level. Encompassing products, equipment, people, information, control, and support functions, manufacturing systems play a pivotal role in the economical and competitive development, production, delivery, and total lifecycle of products, meeting market and societal needs.
With a commitment to publishing archival scholarly literature, the journal strives to advance the state of the art in manufacturing systems and foster innovation in crafting efficient, robust, and sustainable manufacturing systems. The focus extends from equipment-level considerations to the broader scope of the extended enterprise. The Journal welcomes research addressing challenges across various scales, including nano, micro, and macro-scale manufacturing, and spanning diverse sectors such as aerospace, automotive, energy, and medical device manufacturing.