A strip-packing constructive algorithm with deep reinforcement learning for dynamic resource-constrained seru scheduling problems

IF 3.1 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Soft Computing Pub Date : 2024-07-26 DOI:10.1007/s00500-024-09815-8

Yiran Xiang, Zhe Zhang, Xue Gong, Xiaoling Song, Yong Yin

{"title":"A strip-packing constructive algorithm with deep reinforcement learning for dynamic resource-constrained seru scheduling problems","authors":"Yiran Xiang, Zhe Zhang, Xue Gong, Xiaoling Song, Yong Yin","doi":"10.1007/s00500-024-09815-8","DOIUrl":null,"url":null,"abstract":"This study focuses on unspecified dynamic seru scheduling problems with resource constraints (UDSS-R) in seru production system (SPS). A mixed integer linear programming model is formulated to minimize the makespan, which is solved sequentially from both allocation and scheduling perspectives by a strip-packing constructive algorithm (SPCA) with deep reinforcement learning (DRL). The training samples are trained by the DRL model, and the reward values obtained are calculated by SPCA to train the network so that the agent can find a better solution. The output of DRL is the scheduling order of jobs in serus, while the solution of UDSS-R is solved by SPCA. Finally, a set of test instances are generated to conduct computational experiments with different instance scales for the DRL-SPCA, and the results confirm the effectiveness of proposed DRL-SPCA in solving UDSS-R with more outstanding performance in terms of solution quality and efficiency, across three data scales (10 serus × 100 jobs, 20 serus × 250 jobs, and 30 serus × 400 jobs), compared with GA and SAA, the Avg. RPD of DRL-SPCA decreased by 9.93% and 7.56%, 13.36% and 10.72%, and 9.09% and 7.08%, respectively. In addition, the Avg. CPU time was reduced by 29.53% and 27.93%, 57.48% and 57.04%, and 61.73% and 61.76%, respectively.","PeriodicalId":22039,"journal":{"name":"Soft Computing","volume":"47 1","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s00500-024-09815-8","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

This study focuses on unspecified dynamic seru scheduling problems with resource constraints (UDSS-R) in seru production system (SPS). A mixed integer linear programming model is formulated to minimize the makespan, which is solved sequentially from both allocation and scheduling perspectives by a strip-packing constructive algorithm (SPCA) with deep reinforcement learning (DRL). The training samples are trained by the DRL model, and the reward values obtained are calculated by SPCA to train the network so that the agent can find a better solution. The output of DRL is the scheduling order of jobs in serus, while the solution of UDSS-R is solved by SPCA. Finally, a set of test instances are generated to conduct computational experiments with different instance scales for the DRL-SPCA, and the results confirm the effectiveness of proposed DRL-SPCA in solving UDSS-R with more outstanding performance in terms of solution quality and efficiency, across three data scales (10 serus × 100 jobs, 20 serus × 250 jobs, and 30 serus × 400 jobs), compared with GA and SAA, the Avg. RPD of DRL-SPCA decreased by 9.93% and 7.56%, 13.36% and 10.72%, and 9.09% and 7.08%, respectively. In addition, the Avg. CPU time was reduced by 29.53% and 27.93%, 57.48% and 57.04%, and 61.73% and 61.76%, respectively.

Abstract Image

查看原文本刊更多论文

针对资源受限的动态 seru 调度问题的带状包装构造算法与深度强化学习

本研究的重点是血清生产系统（SPS）中具有资源约束的非指定动态血清调度问题（UDSS-R）。为了最小化工期，建立了一个混合整数线性规划模型，并通过带深度强化学习（DRL）的条状包装构造算法（SPCA）从分配和调度两个角度依次求解。训练样本由 DRL 模型训练，获得的奖励值由 SPCA 计算，以训练网络，从而使代理找到更好的解决方案。DRL 的输出是 serus 中工作的调度顺序，而 UDSS-R 的解则由 SPCA 解决。最后，生成了一组测试实例，对 DRL-SPCA 进行了不同实例规模的计算实验，结果证实了所提出的 DRL-SPCA 在求解 UDSS-R 时的有效性，在三种数据规模（10 serus × 100 个作业、20 serus × 250 个作业和 30 serus × 400 个作业）下，与 GA 和 SAA 相比，DRL-SPCA 的 Avg.与 GA 和 SAA 相比，DRL-SPCA 的平均 RPD 分别下降了 9.93% 和 7.56%，13.36% 和 10.72%，以及 9.09% 和 7.08%。此外，平均 CPU 时间减少了 29.53%。CPU 时间分别减少了 29.53% 和 27.93%，57.48% 和 57.04%，以及 61.73% 和 61.76%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Soft Computing 工程技术-计算机：跨学科应用

CiteScore

8.10

自引率

9.80%

发文量

927

审稿时长

7.3 months

期刊介绍： Soft Computing is dedicated to system solutions based on soft computing techniques. It provides rapid dissemination of important results in soft computing technologies, a fusion of research in evolutionary algorithms and genetic programming, neural science and neural net systems, fuzzy set theory and fuzzy systems, and chaos theory and chaotic systems. Soft Computing encourages the integration of soft computing techniques and tools into both everyday and advanced applications. By linking the ideas and techniques of soft computing with other disciplines, the journal serves as a unifying platform that fosters comparisons, extensions, and new applications. As a result, the journal is an international forum for all scientists and engineers engaged in research and development in this fast growing field.