Jianxiong Zhang , Yuming Jiang , Bing Guo , Tingting Liu , Dasha Hu , Jinbo Zhang , Yifei Deng , Hao Wang , Jv Yang , Xuefeng Ding
{"title":"Dynamic scheduling for cloud manufacturing with uncertain events by hierarchical reinforcement learning and attention mechanism","authors":"Jianxiong Zhang , Yuming Jiang , Bing Guo , Tingting Liu , Dasha Hu , Jinbo Zhang , Yifei Deng , Hao Wang , Jv Yang , Xuefeng Ding","doi":"10.1016/j.knosys.2025.113335","DOIUrl":null,"url":null,"abstract":"<div><div>Cloud manufacturing provides a platform for many-to-many scheduling of consumer tasks assigned to service providers. The dynamics and uncertainties of the cloud environment pose stringent requirements on the real-time performance and generalizability of scheduling algorithms. Moreover, the continuous variations in environmental states, task scale, and service statuses further complicate decision-making. However, existing dynamic scheduling methods, primarily developed to address static environments and constant scales, fall short of addressing the escalating volatility and complexity of real-world scheduling. To achieve near-real-time decision-making, a deep hierarchical reinforcement learning framework incorporating attention mechanisms and pointer networks is proposed for multi-objective dynamic scheduling in cloud manufacturing. This framework divides the scheduling problem into three subproblems (optimization objectives, manufacturing tasks, and service selection) and leverages a hierarchical structure to realize a three-step scheduling decision-making process. The proposed framework comprises three encoder–decoder-based agents, each corresponding to a subproblem and collaborating to achieve the overall decision. The agents utilize the multi-head attention mechanism to extract inter-task and inter-service relationships, enhancing decision precision and environmental adaptability. Additionally, the pointer network is incorporated into each agent, endowing the proposed framework with generalizability when inserting new tasks (or services) or removing existing ones. Experimental results across nine dynamic scenarios demonstrate that our framework outperforms five deep reinforcement learning algorithms and three meta-heuristics in terms of scheduling performance and runtime. Results from six out-of-training-scale instances further indicate that our framework exhibits superior generalization and scalability.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"316 ","pages":"Article 113335"},"PeriodicalIF":7.6000,"publicationDate":"2025-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S095070512500382X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Cloud manufacturing provides a platform for many-to-many scheduling of consumer tasks assigned to service providers. The dynamics and uncertainties of the cloud environment pose stringent requirements on the real-time performance and generalizability of scheduling algorithms. Moreover, the continuous variations in environmental states, task scale, and service statuses further complicate decision-making. However, existing dynamic scheduling methods, primarily developed to address static environments and constant scales, fall short of addressing the escalating volatility and complexity of real-world scheduling. To achieve near-real-time decision-making, a deep hierarchical reinforcement learning framework incorporating attention mechanisms and pointer networks is proposed for multi-objective dynamic scheduling in cloud manufacturing. This framework divides the scheduling problem into three subproblems (optimization objectives, manufacturing tasks, and service selection) and leverages a hierarchical structure to realize a three-step scheduling decision-making process. The proposed framework comprises three encoder–decoder-based agents, each corresponding to a subproblem and collaborating to achieve the overall decision. The agents utilize the multi-head attention mechanism to extract inter-task and inter-service relationships, enhancing decision precision and environmental adaptability. Additionally, the pointer network is incorporated into each agent, endowing the proposed framework with generalizability when inserting new tasks (or services) or removing existing ones. Experimental results across nine dynamic scenarios demonstrate that our framework outperforms five deep reinforcement learning algorithms and three meta-heuristics in terms of scheduling performance and runtime. Results from six out-of-training-scale instances further indicate that our framework exhibits superior generalization and scalability.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.