Yang Deng , Guojun Sheng , Andy H.F. Chow , Zhili Zhou , Qinyang Bai , Zicheng Su
{"title":"Collaborative production control and distributor selection via multi-agent reinforcement learning with differentiable communication","authors":"Yang Deng , Guojun Sheng , Andy H.F. Chow , Zhili Zhou , Qinyang Bai , Zicheng Su","doi":"10.1016/j.eswa.2025.127539","DOIUrl":null,"url":null,"abstract":"<div><div>Collaborative production control and distributor selection are essential for resource allocation and meeting the core of Industry 5.0’s human-centric vision. However, traditional approaches typically handle these decisions independently, failing to adequately address fluctuating market conditions, demand uncertainty, and varying distributor competencies. This paper integrates production control and distributor selection as a Partially Observable Markov Decision Process (POMDP) in a multi-agent system. Specifically, a production control agent optimizes outputs by balancing inventory levels and opportunity costs, while a distributor selection agent dynamically adjusts allocations considering workforce skill diversity, cost efficiency, and equity. The formulated POMDP is solved using a multi-agent reinforcement learning (MARL) framework featuring a differentiable communication layer and GRU-based recurrent neural networks. Numerical experiments conducted under both stable and highly volatile market conditions demonstrate the proposed system’s enhanced adaptability and responsiveness. In particular, inter-agent messaging communication leading to improved welfare metrics and robust performance under diverse distributor-weight configurations. Notably, the resulting system promotes equitable distributor involvement, aligning with Industry 5.0’s emphasis on sustainable, people-centric supply chain operations.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"282 ","pages":"Article 127539"},"PeriodicalIF":7.5000,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425011613","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Collaborative production control and distributor selection are essential for resource allocation and meeting the core of Industry 5.0’s human-centric vision. However, traditional approaches typically handle these decisions independently, failing to adequately address fluctuating market conditions, demand uncertainty, and varying distributor competencies. This paper integrates production control and distributor selection as a Partially Observable Markov Decision Process (POMDP) in a multi-agent system. Specifically, a production control agent optimizes outputs by balancing inventory levels and opportunity costs, while a distributor selection agent dynamically adjusts allocations considering workforce skill diversity, cost efficiency, and equity. The formulated POMDP is solved using a multi-agent reinforcement learning (MARL) framework featuring a differentiable communication layer and GRU-based recurrent neural networks. Numerical experiments conducted under both stable and highly volatile market conditions demonstrate the proposed system’s enhanced adaptability and responsiveness. In particular, inter-agent messaging communication leading to improved welfare metrics and robust performance under diverse distributor-weight configurations. Notably, the resulting system promotes equitable distributor involvement, aligning with Industry 5.0’s emphasis on sustainable, people-centric supply chain operations.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.