{"title":"基于协同多智能体深度强化学习的混合流车间机器预防性维修集成动态调度方法","authors":"Siqi Liu , Haiping Zhu , LieZheng Sheng","doi":"10.1016/j.rcim.2025.103085","DOIUrl":null,"url":null,"abstract":"<div><div>Hybrid flow shop widely used in manufacturing industry is facing the challenge of complex and dynamic production environment. Current study mostly cannot consider the machine preventive maintenance and dynamic events in hybrid flow shop scheduling process. Therefore, this paper presents an integrated dynamic scheduling method for hybrid flow shop scheduling problem- unrelated parallel machine considering preventive maintenance (DHSFP-UPM-PM). And the multi-scheduling objectives include minimizing completion time, processing cost and maintenance cost. Firstly, the definition of research problem and the basic maintenance strategy are presented in detail. And an integrated mathematic model of DHSFP-UPM-PM is constructed. Then the integrated dynamic scheduling framework based on cooperative multi-agent deep reinforcement learning is proposed to solve the DHSFP-UPM-PM. Based on the above, we proposed a cooperative multi-processing stage agents (PSA) approach to realize the transformation from traditional single-agent to multi-agent. Meanwhile, the cooperative multi-agent Markova Decision Process is formulated to clarify the interaction between each agent and production environment. The state and action space as the key elements of scheduling model is also designed for each PSA. To optimize scheduling objectives, this paper further formulates new global reward mechanism and centralized training-decentralized execution method based on multi agent proximal policy optimization. Lastly, the experiment results verify the superiority and effectiveness of the proposed method when solving integrated scheduling problem and dynamic event. And the proposed method presents remarkable adaptability and flexibility under a different production scenario which prove the benefits of multi-agent deep reinforcement learning in complex and dynamic environment.</div></div>","PeriodicalId":21452,"journal":{"name":"Robotics and Computer-integrated Manufacturing","volume":"97 ","pages":"Article 103085"},"PeriodicalIF":9.1000,"publicationDate":"2025-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Integrated dynamic scheduling method for hybrid flow shop with machine preventive maintenance based on cooperative multi-agent deep reinforcement learning\",\"authors\":\"Siqi Liu , Haiping Zhu , LieZheng Sheng\",\"doi\":\"10.1016/j.rcim.2025.103085\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Hybrid flow shop widely used in manufacturing industry is facing the challenge of complex and dynamic production environment. Current study mostly cannot consider the machine preventive maintenance and dynamic events in hybrid flow shop scheduling process. Therefore, this paper presents an integrated dynamic scheduling method for hybrid flow shop scheduling problem- unrelated parallel machine considering preventive maintenance (DHSFP-UPM-PM). And the multi-scheduling objectives include minimizing completion time, processing cost and maintenance cost. Firstly, the definition of research problem and the basic maintenance strategy are presented in detail. And an integrated mathematic model of DHSFP-UPM-PM is constructed. Then the integrated dynamic scheduling framework based on cooperative multi-agent deep reinforcement learning is proposed to solve the DHSFP-UPM-PM. Based on the above, we proposed a cooperative multi-processing stage agents (PSA) approach to realize the transformation from traditional single-agent to multi-agent. Meanwhile, the cooperative multi-agent Markova Decision Process is formulated to clarify the interaction between each agent and production environment. The state and action space as the key elements of scheduling model is also designed for each PSA. To optimize scheduling objectives, this paper further formulates new global reward mechanism and centralized training-decentralized execution method based on multi agent proximal policy optimization. Lastly, the experiment results verify the superiority and effectiveness of the proposed method when solving integrated scheduling problem and dynamic event. And the proposed method presents remarkable adaptability and flexibility under a different production scenario which prove the benefits of multi-agent deep reinforcement learning in complex and dynamic environment.</div></div>\",\"PeriodicalId\":21452,\"journal\":{\"name\":\"Robotics and Computer-integrated Manufacturing\",\"volume\":\"97 \",\"pages\":\"Article 103085\"},\"PeriodicalIF\":9.1000,\"publicationDate\":\"2025-06-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Robotics and Computer-integrated Manufacturing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0736584525001395\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Robotics and Computer-integrated Manufacturing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0736584525001395","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
Integrated dynamic scheduling method for hybrid flow shop with machine preventive maintenance based on cooperative multi-agent deep reinforcement learning
Hybrid flow shop widely used in manufacturing industry is facing the challenge of complex and dynamic production environment. Current study mostly cannot consider the machine preventive maintenance and dynamic events in hybrid flow shop scheduling process. Therefore, this paper presents an integrated dynamic scheduling method for hybrid flow shop scheduling problem- unrelated parallel machine considering preventive maintenance (DHSFP-UPM-PM). And the multi-scheduling objectives include minimizing completion time, processing cost and maintenance cost. Firstly, the definition of research problem and the basic maintenance strategy are presented in detail. And an integrated mathematic model of DHSFP-UPM-PM is constructed. Then the integrated dynamic scheduling framework based on cooperative multi-agent deep reinforcement learning is proposed to solve the DHSFP-UPM-PM. Based on the above, we proposed a cooperative multi-processing stage agents (PSA) approach to realize the transformation from traditional single-agent to multi-agent. Meanwhile, the cooperative multi-agent Markova Decision Process is formulated to clarify the interaction between each agent and production environment. The state and action space as the key elements of scheduling model is also designed for each PSA. To optimize scheduling objectives, this paper further formulates new global reward mechanism and centralized training-decentralized execution method based on multi agent proximal policy optimization. Lastly, the experiment results verify the superiority and effectiveness of the proposed method when solving integrated scheduling problem and dynamic event. And the proposed method presents remarkable adaptability and flexibility under a different production scenario which prove the benefits of multi-agent deep reinforcement learning in complex and dynamic environment.
期刊介绍:
The journal, Robotics and Computer-Integrated Manufacturing, focuses on sharing research applications that contribute to the development of new or enhanced robotics, manufacturing technologies, and innovative manufacturing strategies that are relevant to industry. Papers that combine theory and experimental validation are preferred, while review papers on current robotics and manufacturing issues are also considered. However, papers on traditional machining processes, modeling and simulation, supply chain management, and resource optimization are generally not within the scope of the journal, as there are more appropriate journals for these topics. Similarly, papers that are overly theoretical or mathematical will be directed to other suitable journals. The journal welcomes original papers in areas such as industrial robotics, human-robot collaboration in manufacturing, cloud-based manufacturing, cyber-physical production systems, big data analytics in manufacturing, smart mechatronics, machine learning, adaptive and sustainable manufacturing, and other fields involving unique manufacturing technologies.