{"title":"面向工业物联网柔性制造的置换不变和等变多智能体强化学习","authors":"Yangyan Zeng;Aidong Liu;Suzhen Huang;Xiaoqun Chen;Wei Liang;Xiaokang Zhou","doi":"10.1109/JIOT.2025.3584811","DOIUrl":null,"url":null,"abstract":"With the advent of Industrial Internet of Things (IIoT), flexible manufacturing has gained increasing attention. Continuous changes in market demand, real-time data collection and device interconnectivity have made the production process more dynamic and unpredictable, rendering traditional manufacturing scheduling methods inadequate in addressing these changes. This issue can be framed as a dynamic flexible job-shop scheduling problem (DFJSP), which seeks to accommodate ever-evolving production scenarios and requirements by making real-time modifications and optimizing resource allocation. Traditional scheduling algorithms designed for static, single-environment scenarios are increasingly inadequate for handling the growing complexity of production environments. In this context, there is a pressing need for efficient and real-time scheduling algorithms. We propose a multiagent reinforcement learning (MARL) algorithm to solve DFJSP, where each device is associated with a corresponding agent, allowing the algorithm to scale flexibly. The complexity and dynamics of scheduling problems introduce additional challenges and complexities in state representation and decision-making. We propose two solutions to alleviate this problem. First, we employ a heterogeneous graph neural network (HGNN) to capture the relational dependencies between tasks and extract state features. Through multiple feature updates, each agent is enabled to make decisions based on global information. Furthermore, leveraging the inherent permutation invariance (PI) and permutation equivariance (PE) features of tasks in the waiting queue, we apply hypernetwork techniques to address the issue of dimensionality explosion caused by the excessive state space in scheduling environments. Experiments conducted under various scenario settings demonstrate that our scheduling method can significantly reduce task latency and improve resource utilization.","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"12 18","pages":"37863-37875"},"PeriodicalIF":8.9000,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Permutation-Invariant and Equivariant Multiagent Reinforcement Learning for Flexible Manufacturing in Industrial IoT\",\"authors\":\"Yangyan Zeng;Aidong Liu;Suzhen Huang;Xiaoqun Chen;Wei Liang;Xiaokang Zhou\",\"doi\":\"10.1109/JIOT.2025.3584811\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the advent of Industrial Internet of Things (IIoT), flexible manufacturing has gained increasing attention. Continuous changes in market demand, real-time data collection and device interconnectivity have made the production process more dynamic and unpredictable, rendering traditional manufacturing scheduling methods inadequate in addressing these changes. This issue can be framed as a dynamic flexible job-shop scheduling problem (DFJSP), which seeks to accommodate ever-evolving production scenarios and requirements by making real-time modifications and optimizing resource allocation. Traditional scheduling algorithms designed for static, single-environment scenarios are increasingly inadequate for handling the growing complexity of production environments. In this context, there is a pressing need for efficient and real-time scheduling algorithms. We propose a multiagent reinforcement learning (MARL) algorithm to solve DFJSP, where each device is associated with a corresponding agent, allowing the algorithm to scale flexibly. The complexity and dynamics of scheduling problems introduce additional challenges and complexities in state representation and decision-making. We propose two solutions to alleviate this problem. First, we employ a heterogeneous graph neural network (HGNN) to capture the relational dependencies between tasks and extract state features. Through multiple feature updates, each agent is enabled to make decisions based on global information. Furthermore, leveraging the inherent permutation invariance (PI) and permutation equivariance (PE) features of tasks in the waiting queue, we apply hypernetwork techniques to address the issue of dimensionality explosion caused by the excessive state space in scheduling environments. Experiments conducted under various scenario settings demonstrate that our scheduling method can significantly reduce task latency and improve resource utilization.\",\"PeriodicalId\":54347,\"journal\":{\"name\":\"IEEE Internet of Things Journal\",\"volume\":\"12 18\",\"pages\":\"37863-37875\"},\"PeriodicalIF\":8.9000,\"publicationDate\":\"2025-07-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Internet of Things Journal\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11062672/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet of Things Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/11062672/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
Permutation-Invariant and Equivariant Multiagent Reinforcement Learning for Flexible Manufacturing in Industrial IoT
With the advent of Industrial Internet of Things (IIoT), flexible manufacturing has gained increasing attention. Continuous changes in market demand, real-time data collection and device interconnectivity have made the production process more dynamic and unpredictable, rendering traditional manufacturing scheduling methods inadequate in addressing these changes. This issue can be framed as a dynamic flexible job-shop scheduling problem (DFJSP), which seeks to accommodate ever-evolving production scenarios and requirements by making real-time modifications and optimizing resource allocation. Traditional scheduling algorithms designed for static, single-environment scenarios are increasingly inadequate for handling the growing complexity of production environments. In this context, there is a pressing need for efficient and real-time scheduling algorithms. We propose a multiagent reinforcement learning (MARL) algorithm to solve DFJSP, where each device is associated with a corresponding agent, allowing the algorithm to scale flexibly. The complexity and dynamics of scheduling problems introduce additional challenges and complexities in state representation and decision-making. We propose two solutions to alleviate this problem. First, we employ a heterogeneous graph neural network (HGNN) to capture the relational dependencies between tasks and extract state features. Through multiple feature updates, each agent is enabled to make decisions based on global information. Furthermore, leveraging the inherent permutation invariance (PI) and permutation equivariance (PE) features of tasks in the waiting queue, we apply hypernetwork techniques to address the issue of dimensionality explosion caused by the excessive state space in scheduling environments. Experiments conducted under various scenario settings demonstrate that our scheduling method can significantly reduce task latency and improve resource utilization.
期刊介绍:
The EEE Internet of Things (IoT) Journal publishes articles and review articles covering various aspects of IoT, including IoT system architecture, IoT enabling technologies, IoT communication and networking protocols such as network coding, and IoT services and applications. Topics encompass IoT's impacts on sensor technologies, big data management, and future internet design for applications like smart cities and smart homes. Fields of interest include IoT architecture such as things-centric, data-centric, service-oriented IoT architecture; IoT enabling technologies and systematic integration such as sensor technologies, big sensor data management, and future Internet design for IoT; IoT services, applications, and test-beds such as IoT service middleware, IoT application programming interface (API), IoT application design, and IoT trials/experiments; IoT standardization activities and technology development in different standard development organizations (SDO) such as IEEE, IETF, ITU, 3GPP, ETSI, etc.