Ming Wang , Jie Zhang , Peng Zhang , Wenbin Xiang , Mengyu Jin , Hongsen Li
{"title":"Generative deep reinforcement learning method for dynamic parallel machines scheduling with adaptive maintenance activities","authors":"Ming Wang , Jie Zhang , Peng Zhang , Wenbin Xiang , Mengyu Jin , Hongsen Li","doi":"10.1016/j.jmsy.2024.11.004","DOIUrl":null,"url":null,"abstract":"<div><div>In the process industries, where orders arrive at irregular intervals, inappropriate maintenance frequency often leads to unplanned shutdowns of high-speed parallel machines, resulting in unnecessary material consumption and a significant decline in the performance of the dynamic parallel machines scheduling. To address this issue, this paper proposes a generative deep reinforcement learning method that investigates the dynamic parallel machines scheduling problems with adaptive maintenance activities. Specifically, an enhanced Double DQN algorithm is proposed to schedule the dynamically arriving orders and maintenance activities, aiming to maximize average reliability while minimize the production costs. Additionally, a global exploration strategy is incorporated to enhance the scheduling and maintenance agent's global exploration capability, particularly in complex solution spaces with conflicting objectives. Furthermore, recognizing the difficulty of accurately capturing crucial scheduling and maintenance attributes within a predefined state space in a time-varying production environment, a guided Actor-Critic algorithm is introduced to autonomously generate the state space. Moreover, to tackle the unstable learning process caused by sparse rewards, a self-imitation learning is employed to guide the state space generation agent toward achieving rapid learning and convergence. Finally, simulation experiments validate that the proposed method not only autonomously enables state space generation but also exhibits superior performance for the investigated problem.</div></div>","PeriodicalId":16227,"journal":{"name":"Journal of Manufacturing Systems","volume":"77 ","pages":"Pages 946-961"},"PeriodicalIF":12.2000,"publicationDate":"2024-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Manufacturing Systems","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0278612524002589","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, INDUSTRIAL","Score":null,"Total":0}
引用次数: 0
Abstract
In the process industries, where orders arrive at irregular intervals, inappropriate maintenance frequency often leads to unplanned shutdowns of high-speed parallel machines, resulting in unnecessary material consumption and a significant decline in the performance of the dynamic parallel machines scheduling. To address this issue, this paper proposes a generative deep reinforcement learning method that investigates the dynamic parallel machines scheduling problems with adaptive maintenance activities. Specifically, an enhanced Double DQN algorithm is proposed to schedule the dynamically arriving orders and maintenance activities, aiming to maximize average reliability while minimize the production costs. Additionally, a global exploration strategy is incorporated to enhance the scheduling and maintenance agent's global exploration capability, particularly in complex solution spaces with conflicting objectives. Furthermore, recognizing the difficulty of accurately capturing crucial scheduling and maintenance attributes within a predefined state space in a time-varying production environment, a guided Actor-Critic algorithm is introduced to autonomously generate the state space. Moreover, to tackle the unstable learning process caused by sparse rewards, a self-imitation learning is employed to guide the state space generation agent toward achieving rapid learning and convergence. Finally, simulation experiments validate that the proposed method not only autonomously enables state space generation but also exhibits superior performance for the investigated problem.
期刊介绍:
The Journal of Manufacturing Systems is dedicated to showcasing cutting-edge fundamental and applied research in manufacturing at the systems level. Encompassing products, equipment, people, information, control, and support functions, manufacturing systems play a pivotal role in the economical and competitive development, production, delivery, and total lifecycle of products, meeting market and societal needs.
With a commitment to publishing archival scholarly literature, the journal strives to advance the state of the art in manufacturing systems and foster innovation in crafting efficient, robust, and sustainable manufacturing systems. The focus extends from equipment-level considerations to the broader scope of the extended enterprise. The Journal welcomes research addressing challenges across various scales, including nano, micro, and macro-scale manufacturing, and spanning diverse sectors such as aerospace, automotive, energy, and medical device manufacturing.