A novel multi-agent reinforcement learning framework for robust exception handling of manufacturing service collaboration based on asymmetric information
Xin Luo , Chunrong Pan , Zhengchao Liu , Lei Wang , Shibao Pang , Lifa He
{"title":"A novel multi-agent reinforcement learning framework for robust exception handling of manufacturing service collaboration based on asymmetric information","authors":"Xin Luo , Chunrong Pan , Zhengchao Liu , Lei Wang , Shibao Pang , Lifa He","doi":"10.1016/j.jmsy.2025.01.016","DOIUrl":null,"url":null,"abstract":"<div><div>Industrial internet platforms enable users to efficiently fulfill their customized needs through the sequential execution of a manufacturing service collaborative chain (MSCC) consisting of networked enterprises. However, various dynamic uncertainties (e.g., equipment failure, emergency order insertion, product quality deterioration) may interrupt the execution of the MSCC, resulting in processing overruns and reduced user willingness to customize. To enhance the ability of MSCC to respond to exception events (namely robustness), the asymmetric informative multi-agent reinforcement learning (AIMARL) method is proposed. AIMARL will re-select the appropriate manufacturing service for the unexecuted subtasks in the event of an MSCC exception. First, the method gives a definition way of MSCC robustness labels from the perspective of the platform and networked enterprises. Subsequently, the asymmetric cascade state and data-rule-driven asymmetric reward are designed based on the characteristics of unidirectional asymmetric information transmission in the sequential execution of the MSCC. Meanwhile, in order to fully utilize the graph features of the MSCC and extract the complex relationships between services, graph convolutional networks are embedded in both the asymmetric cascade state and data-rule-driven asymmetric reward. Experimental results demonstrate that AIMARL outperforms the other four multi-agent reinforcement learning methods for the problem. In addition, AIMARL is able to cope with dynamic uncertainties with better robustness than the anomaly handling methods used in the platform.</div></div>","PeriodicalId":16227,"journal":{"name":"Journal of Manufacturing Systems","volume":"79 ","pages":"Pages 364-382"},"PeriodicalIF":12.2000,"publicationDate":"2025-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Manufacturing Systems","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S027861252500024X","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, INDUSTRIAL","Score":null,"Total":0}
引用次数: 0
Abstract
Industrial internet platforms enable users to efficiently fulfill their customized needs through the sequential execution of a manufacturing service collaborative chain (MSCC) consisting of networked enterprises. However, various dynamic uncertainties (e.g., equipment failure, emergency order insertion, product quality deterioration) may interrupt the execution of the MSCC, resulting in processing overruns and reduced user willingness to customize. To enhance the ability of MSCC to respond to exception events (namely robustness), the asymmetric informative multi-agent reinforcement learning (AIMARL) method is proposed. AIMARL will re-select the appropriate manufacturing service for the unexecuted subtasks in the event of an MSCC exception. First, the method gives a definition way of MSCC robustness labels from the perspective of the platform and networked enterprises. Subsequently, the asymmetric cascade state and data-rule-driven asymmetric reward are designed based on the characteristics of unidirectional asymmetric information transmission in the sequential execution of the MSCC. Meanwhile, in order to fully utilize the graph features of the MSCC and extract the complex relationships between services, graph convolutional networks are embedded in both the asymmetric cascade state and data-rule-driven asymmetric reward. Experimental results demonstrate that AIMARL outperforms the other four multi-agent reinforcement learning methods for the problem. In addition, AIMARL is able to cope with dynamic uncertainties with better robustness than the anomaly handling methods used in the platform.
期刊介绍:
The Journal of Manufacturing Systems is dedicated to showcasing cutting-edge fundamental and applied research in manufacturing at the systems level. Encompassing products, equipment, people, information, control, and support functions, manufacturing systems play a pivotal role in the economical and competitive development, production, delivery, and total lifecycle of products, meeting market and societal needs.
With a commitment to publishing archival scholarly literature, the journal strives to advance the state of the art in manufacturing systems and foster innovation in crafting efficient, robust, and sustainable manufacturing systems. The focus extends from equipment-level considerations to the broader scope of the extended enterprise. The Journal welcomes research addressing challenges across various scales, including nano, micro, and macro-scale manufacturing, and spanning diverse sectors such as aerospace, automotive, energy, and medical device manufacturing.