面向工业物联网柔性制造的置换不变和等变多智能体强化学习

IF 8.9 1区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

IEEE Internet of Things Journal Pub Date : 2025-07-02 DOI:10.1109/JIOT.2025.3584811

Yangyan Zeng;Aidong Liu;Suzhen Huang;Xiaoqun Chen;Wei Liang;Xiaokang Zhou

{"title":"面向工业物联网柔性制造的置换不变和等变多智能体强化学习","authors":"Yangyan Zeng;Aidong Liu;Suzhen Huang;Xiaoqun Chen;Wei Liang;Xiaokang Zhou","doi":"10.1109/JIOT.2025.3584811","DOIUrl":null,"url":null,"abstract":"With the advent of Industrial Internet of Things (IIoT), flexible manufacturing has gained increasing attention. Continuous changes in market demand, real-time data collection and device interconnectivity have made the production process more dynamic and unpredictable, rendering traditional manufacturing scheduling methods inadequate in addressing these changes. This issue can be framed as a dynamic flexible job-shop scheduling problem (DFJSP), which seeks to accommodate ever-evolving production scenarios and requirements by making real-time modifications and optimizing resource allocation. Traditional scheduling algorithms designed for static, single-environment scenarios are increasingly inadequate for handling the growing complexity of production environments. In this context, there is a pressing need for efficient and real-time scheduling algorithms. We propose a multiagent reinforcement learning (MARL) algorithm to solve DFJSP, where each device is associated with a corresponding agent, allowing the algorithm to scale flexibly. The complexity and dynamics of scheduling problems introduce additional challenges and complexities in state representation and decision-making. We propose two solutions to alleviate this problem. First, we employ a heterogeneous graph neural network (HGNN) to capture the relational dependencies between tasks and extract state features. Through multiple feature updates, each agent is enabled to make decisions based on global information. Furthermore, leveraging the inherent permutation invariance (PI) and permutation equivariance (PE) features of tasks in the waiting queue, we apply hypernetwork techniques to address the issue of dimensionality explosion caused by the excessive state space in scheduling environments. Experiments conducted under various scenario settings demonstrate that our scheduling method can significantly reduce task latency and improve resource utilization.","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"12 18","pages":"37863-37875"},"PeriodicalIF":8.9000,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Permutation-Invariant and Equivariant Multiagent Reinforcement Learning for Flexible Manufacturing in Industrial IoT\",\"authors\":\"Yangyan Zeng;Aidong Liu;Suzhen Huang;Xiaoqun Chen;Wei Liang;Xiaokang Zhou\",\"doi\":\"10.1109/JIOT.2025.3584811\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the advent of Industrial Internet of Things (IIoT), flexible manufacturing has gained increasing attention. Continuous changes in market demand, real-time data collection and device interconnectivity have made the production process more dynamic and unpredictable, rendering traditional manufacturing scheduling methods inadequate in addressing these changes. This issue can be framed as a dynamic flexible job-shop scheduling problem (DFJSP), which seeks to accommodate ever-evolving production scenarios and requirements by making real-time modifications and optimizing resource allocation. Traditional scheduling algorithms designed for static, single-environment scenarios are increasingly inadequate for handling the growing complexity of production environments. In this context, there is a pressing need for efficient and real-time scheduling algorithms. We propose a multiagent reinforcement learning (MARL) algorithm to solve DFJSP, where each device is associated with a corresponding agent, allowing the algorithm to scale flexibly. The complexity and dynamics of scheduling problems introduce additional challenges and complexities in state representation and decision-making. We propose two solutions to alleviate this problem. First, we employ a heterogeneous graph neural network (HGNN) to capture the relational dependencies between tasks and extract state features. Through multiple feature updates, each agent is enabled to make decisions based on global information. Furthermore, leveraging the inherent permutation invariance (PI) and permutation equivariance (PE) features of tasks in the waiting queue, we apply hypernetwork techniques to address the issue of dimensionality explosion caused by the excessive state space in scheduling environments. Experiments conducted under various scenario settings demonstrate that our scheduling method can significantly reduce task latency and improve resource utilization.\",\"PeriodicalId\":54347,\"journal\":{\"name\":\"IEEE Internet of Things Journal\",\"volume\":\"12 18\",\"pages\":\"37863-37875\"},\"PeriodicalIF\":8.9000,\"publicationDate\":\"2025-07-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Internet of Things Journal\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11062672/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet of Things Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/11062672/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

随着工业物联网（IIoT）的出现，柔性制造越来越受到关注。市场需求的不断变化、实时数据收集和设备互联使生产过程更具动态性和不可预测性，传统的制造调度方法无法应对这些变化。这个问题可以被定义为一个动态灵活的作业车间调度问题（DFJSP），它通过实时修改和优化资源分配来适应不断变化的生产场景和需求。为静态、单一环境场景设计的传统调度算法越来越不适合处理日益复杂的生产环境。在这种情况下，迫切需要高效、实时的调度算法。我们提出了一种多智能体强化学习（MARL）算法来解决DFJSP，其中每个设备都与相应的智能体相关联，从而允许算法灵活扩展。调度问题的复杂性和动态性给状态表示和决策带来了额外的挑战和复杂性。我们提出两种解决方案来缓解这个问题。首先，我们采用异构图神经网络（HGNN）捕获任务之间的关系依赖关系并提取状态特征。通过多个功能更新，每个代理都可以根据全局信息做出决策。此外，利用等待队列中任务固有的置换不变性（PI）和置换等价性（PE）特征，应用超网络技术解决了调度环境中状态空间过大导致的维数爆炸问题。在各种场景设置下进行的实验表明，我们的调度方法可以显著降低任务延迟，提高资源利用率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Permutation-Invariant and Equivariant Multiagent Reinforcement Learning for Flexible Manufacturing in Industrial IoT

With the advent of Industrial Internet of Things (IIoT), flexible manufacturing has gained increasing attention. Continuous changes in market demand, real-time data collection and device interconnectivity have made the production process more dynamic and unpredictable, rendering traditional manufacturing scheduling methods inadequate in addressing these changes. This issue can be framed as a dynamic flexible job-shop scheduling problem (DFJSP), which seeks to accommodate ever-evolving production scenarios and requirements by making real-time modifications and optimizing resource allocation. Traditional scheduling algorithms designed for static, single-environment scenarios are increasingly inadequate for handling the growing complexity of production environments. In this context, there is a pressing need for efficient and real-time scheduling algorithms. We propose a multiagent reinforcement learning (MARL) algorithm to solve DFJSP, where each device is associated with a corresponding agent, allowing the algorithm to scale flexibly. The complexity and dynamics of scheduling problems introduce additional challenges and complexities in state representation and decision-making. We propose two solutions to alleviate this problem. First, we employ a heterogeneous graph neural network (HGNN) to capture the relational dependencies between tasks and extract state features. Through multiple feature updates, each agent is enabled to make decisions based on global information. Furthermore, leveraging the inherent permutation invariance (PI) and permutation equivariance (PE) features of tasks in the waiting queue, we apply hypernetwork techniques to address the issue of dimensionality explosion caused by the excessive state space in scheduling environments. Experiments conducted under various scenario settings demonstrate that our scheduling method can significantly reduce task latency and improve resource utilization.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Internet of Things Journal Computer Science-Information Systems

CiteScore

17.60

自引率

13.20%

发文量

1982

期刊介绍： The EEE Internet of Things (IoT) Journal publishes articles and review articles covering various aspects of IoT, including IoT system architecture, IoT enabling technologies, IoT communication and networking protocols such as network coding, and IoT services and applications. Topics encompass IoT's impacts on sensor technologies, big data management, and future internet design for applications like smart cities and smart homes. Fields of interest include IoT architecture such as things-centric, data-centric, service-oriented IoT architecture; IoT enabling technologies and systematic integration such as sensor technologies, big sensor data management, and future Internet design for IoT; IoT services, applications, and test-beds such as IoT service middleware, IoT application programming interface (API), IoT application design, and IoT trials/experiments; IoT standardization activities and technology development in different standard development organizations (SDO) such as IEEE, IETF, ITU, 3GPP, ETSI, etc.