工人约束混合流水车间问题的近端策略优化驱动超启发式算法

IF 6.6 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Soft Computing Pub Date : 2025-09-28 DOI:10.1016/j.asoc.2025.113990

Shengnan Ding , Weishi Shao , Zhongshi Shao , ShengTao Peng , Dechang Pi , Jiaquan Gao

{"title":"工人约束混合流水车间问题的近端策略优化驱动超启发式算法","authors":"Shengnan Ding , Weishi Shao , Zhongshi Shao , ShengTao Peng , Dechang Pi , Jiaquan Gao","doi":"10.1016/j.asoc.2025.113990","DOIUrl":null,"url":null,"abstract":"<div><div>As the production environment becomes increasingly complex, the integration of soft computing techniques becomes essential for addressing resource-constrained scheduling problems. This paper delves into a worker constrained hybrid flow shop scheduling problem (WHFSP) that integrates worker resources at each processing stage. A mixed-integer linear programming (MILP) model is constructed which enables the use of mathematical solvers to obtain optimal solutions for small-scale instances. Additionally, a novel soft computing-based scheduling framework, namely a proximal policy optimization-based hyper-heuristic algorithm (PPO-HH), is proposed. It automatically selects the most suitable low-level heuristic strategies based on the current state and historical data, facilitating efficient exploration and exploitation of the complex decision space. Several low-level heuristics including perturbative and local search operators are developed to explore the solution space. Subsequently, a high-level control strategy based on proximal policy optimization is proposed. A solution quality evaluation function and a reward mechanism based on problem characteristics are formulated. This mechanism provides feedback to PPO-HH based on the degree of alignment between the actions taken by the agent and the objectives, gradually optimizing the selection of low-level heuristic strategies. Eventually, it generates a probability distribution for each low-level heuristic in the given environment. Comprehensive numerical experiments are conducted to evaluate the performance of both the MILP model and the components of the PPO-HH algorithm. The comparison results show that PPO-HH is effective and efficient for solving the WHFSP.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"185 ","pages":"Article 113990"},"PeriodicalIF":6.6000,"publicationDate":"2025-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A proximal policy optimization driven hyper-heuristic for workers constrained hybrid flow shop problem\",\"authors\":\"Shengnan Ding , Weishi Shao , Zhongshi Shao , ShengTao Peng , Dechang Pi , Jiaquan Gao\",\"doi\":\"10.1016/j.asoc.2025.113990\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>As the production environment becomes increasingly complex, the integration of soft computing techniques becomes essential for addressing resource-constrained scheduling problems. This paper delves into a worker constrained hybrid flow shop scheduling problem (WHFSP) that integrates worker resources at each processing stage. A mixed-integer linear programming (MILP) model is constructed which enables the use of mathematical solvers to obtain optimal solutions for small-scale instances. Additionally, a novel soft computing-based scheduling framework, namely a proximal policy optimization-based hyper-heuristic algorithm (PPO-HH), is proposed. It automatically selects the most suitable low-level heuristic strategies based on the current state and historical data, facilitating efficient exploration and exploitation of the complex decision space. Several low-level heuristics including perturbative and local search operators are developed to explore the solution space. Subsequently, a high-level control strategy based on proximal policy optimization is proposed. A solution quality evaluation function and a reward mechanism based on problem characteristics are formulated. This mechanism provides feedback to PPO-HH based on the degree of alignment between the actions taken by the agent and the objectives, gradually optimizing the selection of low-level heuristic strategies. Eventually, it generates a probability distribution for each low-level heuristic in the given environment. Comprehensive numerical experiments are conducted to evaluate the performance of both the MILP model and the components of the PPO-HH algorithm. The comparison results show that PPO-HH is effective and efficient for solving the WHFSP.</div></div>\",\"PeriodicalId\":50737,\"journal\":{\"name\":\"Applied Soft Computing\",\"volume\":\"185 \",\"pages\":\"Article 113990\"},\"PeriodicalIF\":6.6000,\"publicationDate\":\"2025-09-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Soft Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1568494625013031\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494625013031","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

随着生产环境变得越来越复杂，软计算技术的集成对于解决资源受限的调度问题变得至关重要。本文研究了一种基于工人约束的混合流车间调度问题，该问题集成了各个加工阶段的工人资源。构造了一个混合整数线性规划（MILP）模型，使数学求解器能够得到小规模实例的最优解。此外，提出了一种新的基于软计算的调度框架，即基于近端策略优化的超启发式算法（PPO-HH）。它根据当前状态和历史数据自动选择最合适的低级启发式策略，便于对复杂决策空间进行高效的探索和利用。提出了几种低级启发式方法，包括微扰算子和局部搜索算子来探索解空间。在此基础上，提出了一种基于近端策略优化的高级控制策略。提出了基于问题特征的解质量评价函数和奖励机制。该机制根据智能体所采取的行动与目标的一致程度向PPO-HH提供反馈，逐步优化低级启发式策略的选择。最后，它为给定环境中的每个低级启发式生成一个概率分布。通过全面的数值实验来评估MILP模型和PPO-HH算法各组成部分的性能。对比结果表明，PPO-HH算法是求解WHFSP的有效方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A proximal policy optimization driven hyper-heuristic for workers constrained hybrid flow shop problem

As the production environment becomes increasingly complex, the integration of soft computing techniques becomes essential for addressing resource-constrained scheduling problems. This paper delves into a worker constrained hybrid flow shop scheduling problem (WHFSP) that integrates worker resources at each processing stage. A mixed-integer linear programming (MILP) model is constructed which enables the use of mathematical solvers to obtain optimal solutions for small-scale instances. Additionally, a novel soft computing-based scheduling framework, namely a proximal policy optimization-based hyper-heuristic algorithm (PPO-HH), is proposed. It automatically selects the most suitable low-level heuristic strategies based on the current state and historical data, facilitating efficient exploration and exploitation of the complex decision space. Several low-level heuristics including perturbative and local search operators are developed to explore the solution space. Subsequently, a high-level control strategy based on proximal policy optimization is proposed. A solution quality evaluation function and a reward mechanism based on problem characteristics are formulated. This mechanism provides feedback to PPO-HH based on the degree of alignment between the actions taken by the agent and the objectives, gradually optimizing the selection of low-level heuristic strategies. Eventually, it generates a probability distribution for each low-level heuristic in the given environment. Comprehensive numerical experiments are conducted to evaluate the performance of both the MILP model and the components of the PPO-HH algorithm. The comparison results show that PPO-HH is effective and efficient for solving the WHFSP.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Applied Soft Computing 工程技术-计算机：跨学科应用

CiteScore

15.80

自引率

6.90%

发文量

874

审稿时长

10.9 months

期刊介绍： Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities. Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.