Jingwen Yuan, Kaizhou Gao, Adam Slowik, Benxue Lu, Yanan Jia
{"title":"调度可重入流商店:强化学习引导的元启发式","authors":"Jingwen Yuan, Kaizhou Gao, Adam Slowik, Benxue Lu, Yanan Jia","doi":"10.1049/cim2.70029","DOIUrl":null,"url":null,"abstract":"<p>The reentrant flowshop scheduling problems (RFSP) are ubiquitous in high-tech industries such as semiconductor manufacturing and liquid crystal display (LCD) production. Given the complexity of RFSP, it is significant to improve the production efficiency using effective intelligent optimisation techniques. In this study, four meta-heuristics assisted by two reinforcement learning (RL) algorithms are proposed to minimise the maximum completion time (makespan) for RFSP. First, a mathematical model for RFSP is established. Second, four meta-heuristics are improved. The Nawaz–Enscore–Ham (NEH) heuristic is utilised for population initialisation. Based on the problem characteristics, we design six local search operators, which are integrated into the four meta-heuristics. Third, two RL algorithms, Q-learning and state–action-reward–state–action (SARSA), are employed to select the appropriate local search operator during iterations to enhance the convergence in a local space. Finally, the results of solving 72 instances indicate that the proposed algorithms perform effectively. The RL-guided local search can significantly enhance the overall performance of the four meta-heuristics. In particular, the artificial bee colony algorithm (ABC) combined with SARSA-guided local search yields the highest performance.</p>","PeriodicalId":33286,"journal":{"name":"IET Collaborative Intelligent Manufacturing","volume":"7 1","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2025-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cim2.70029","citationCount":"0","resultStr":"{\"title\":\"Scheduling Reentrant FlowShops: Reinforcement Learning-guided Meta-Heuristics\",\"authors\":\"Jingwen Yuan, Kaizhou Gao, Adam Slowik, Benxue Lu, Yanan Jia\",\"doi\":\"10.1049/cim2.70029\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>The reentrant flowshop scheduling problems (RFSP) are ubiquitous in high-tech industries such as semiconductor manufacturing and liquid crystal display (LCD) production. Given the complexity of RFSP, it is significant to improve the production efficiency using effective intelligent optimisation techniques. In this study, four meta-heuristics assisted by two reinforcement learning (RL) algorithms are proposed to minimise the maximum completion time (makespan) for RFSP. First, a mathematical model for RFSP is established. Second, four meta-heuristics are improved. The Nawaz–Enscore–Ham (NEH) heuristic is utilised for population initialisation. Based on the problem characteristics, we design six local search operators, which are integrated into the four meta-heuristics. Third, two RL algorithms, Q-learning and state–action-reward–state–action (SARSA), are employed to select the appropriate local search operator during iterations to enhance the convergence in a local space. Finally, the results of solving 72 instances indicate that the proposed algorithms perform effectively. The RL-guided local search can significantly enhance the overall performance of the four meta-heuristics. In particular, the artificial bee colony algorithm (ABC) combined with SARSA-guided local search yields the highest performance.</p>\",\"PeriodicalId\":33286,\"journal\":{\"name\":\"IET Collaborative Intelligent Manufacturing\",\"volume\":\"7 1\",\"pages\":\"\"},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2025-04-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cim2.70029\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IET Collaborative Intelligent Manufacturing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1049/cim2.70029\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, INDUSTRIAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Collaborative Intelligent Manufacturing","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/cim2.70029","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, INDUSTRIAL","Score":null,"Total":0}
引用次数: 0
摘要
可重入流程车间调度问题(RFSP)在半导体制造和液晶显示(LCD)生产等高科技行业中普遍存在。考虑到RFSP的复杂性,使用有效的智能优化技术来提高生产效率具有重要意义。在本研究中,提出了四种元启发式方法,辅以两种强化学习(RL)算法来最小化RFSP的最大完成时间(makespan)。首先,建立了RFSP的数学模型。其次,改进了四种元启发式方法。nawaz - enscoe - ham (NEH)启发式用于种群初始化。根据问题特点,设计了6个局部搜索算子,并将其集成到4个元启发式算法中。第三,采用Q-learning和状态-动作-奖励-状态-动作(SARSA)两种强化学习算法,在迭代过程中选择合适的局部搜索算子,增强局部空间的收敛性。最后,对72个实例进行了求解,结果表明所提算法是有效的。强化学习引导下的局部搜索可以显著提高四种元启发式的整体性能。其中,人工蜂群算法(ABC)与sarsa引导下的局部搜索相结合,获得了最高的性能。
The reentrant flowshop scheduling problems (RFSP) are ubiquitous in high-tech industries such as semiconductor manufacturing and liquid crystal display (LCD) production. Given the complexity of RFSP, it is significant to improve the production efficiency using effective intelligent optimisation techniques. In this study, four meta-heuristics assisted by two reinforcement learning (RL) algorithms are proposed to minimise the maximum completion time (makespan) for RFSP. First, a mathematical model for RFSP is established. Second, four meta-heuristics are improved. The Nawaz–Enscore–Ham (NEH) heuristic is utilised for population initialisation. Based on the problem characteristics, we design six local search operators, which are integrated into the four meta-heuristics. Third, two RL algorithms, Q-learning and state–action-reward–state–action (SARSA), are employed to select the appropriate local search operator during iterations to enhance the convergence in a local space. Finally, the results of solving 72 instances indicate that the proposed algorithms perform effectively. The RL-guided local search can significantly enhance the overall performance of the four meta-heuristics. In particular, the artificial bee colony algorithm (ABC) combined with SARSA-guided local search yields the highest performance.
期刊介绍:
IET Collaborative Intelligent Manufacturing is a Gold Open Access journal that focuses on the development of efficient and adaptive production and distribution systems. It aims to meet the ever-changing market demands by publishing original research on methodologies and techniques for the application of intelligence, data science, and emerging information and communication technologies in various aspects of manufacturing, such as design, modeling, simulation, planning, and optimization of products, processes, production, and assembly.
The journal is indexed in COMPENDEX (Elsevier), Directory of Open Access Journals (DOAJ), Emerging Sources Citation Index (Clarivate Analytics), INSPEC (IET), SCOPUS (Elsevier) and Web of Science (Clarivate Analytics).