A Bidding-Based Deep Reinforcement Learning Approach for Multi-Agent Job Shop Scheduling Problem

IF 8.6 1区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

IEEE Transactions on Systems Man Cybernetics-Systems Pub Date : 2025-06-10 DOI:10.1109/TSMC.2025.3572451

Gaopan Shen;Shudong Sun;Yaqiong Liu

{"title":"A Bidding-Based Deep Reinforcement Learning Approach for Multi-Agent Job Shop Scheduling Problem","authors":"Gaopan Shen;Shudong Sun;Yaqiong Liu","doi":"10.1109/TSMC.2025.3572451","DOIUrl":null,"url":null,"abstract":"In demand-driven personalized production, multi-agent job shop scheduling problem plays a pivotal role. Balancing the private preferences between consumers poses a challenge in achieving efficient resource allocation. A bidding-based deep reinforcement learning approach is proposed to generate a consensus schedule. An intelligent selector and an intelligent bidder (IB) are designed for each consumer to perform operation selection and determine the corresponding bid price, respectively. A job shop agent is established to collect bids and allocate resource through winner determination. To assist the IB to learn the correlation between consumers and the multi-agent job shop scheduling environment without revealing preferences, a graph neural network is adopted to extract observations. A reward function based on critic value guides the negotiation process among IBs. Extensive computational experiments demonstrate that the proposed approach achieves high social welfare outcomes. It outperforms in handling large-scale instances with more than 11 consumers and 20 jobs per consumer.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":"55 8","pages":"5642-5654"},"PeriodicalIF":8.6000,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Systems Man Cybernetics-Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/11029239/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

In demand-driven personalized production, multi-agent job shop scheduling problem plays a pivotal role. Balancing the private preferences between consumers poses a challenge in achieving efficient resource allocation. A bidding-based deep reinforcement learning approach is proposed to generate a consensus schedule. An intelligent selector and an intelligent bidder (IB) are designed for each consumer to perform operation selection and determine the corresponding bid price, respectively. A job shop agent is established to collect bids and allocate resource through winner determination. To assist the IB to learn the correlation between consumers and the multi-agent job shop scheduling environment without revealing preferences, a graph neural network is adopted to extract observations. A reward function based on critic value guides the negotiation process among IBs. Extensive computational experiments demonstrate that the proposed approach achieves high social welfare outcomes. It outperforms in handling large-scale instances with more than 11 consumers and 20 jobs per consumer.

查看原文本刊更多论文

基于竞价的多智能体作业车间调度深度强化学习方法

在需求驱动的个性化生产中，多智能体作业车间调度问题起着至关重要的作用。平衡消费者之间的私人偏好是实现有效资源配置的挑战。提出了一种基于竞标的深度强化学习方法来生成共识调度。设计了智能选择器（intelligent selector）和智能竞价器（intelligent bidder， IB），分别为每个消费者进行操作选择和确定相应的投标价格。建立了一个作业车间代理来收集投标并通过确定获胜者来分配资源。为了帮助IB在不暴露偏好的情况下了解消费者与多智能体作业车间调度环境之间的相关性，采用图神经网络提取观察值。基于评价值的奖励函数指导了ibb之间的谈判过程。大量的计算实验表明，所提出的方法取得了较高的社会福利效果。它在处理超过11个消费者和每个消费者20个作业的大规模实例方面表现出色。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Systems Man Cybernetics-Systems AUTOMATION & CONTROL SYSTEMS-COMPUTER SCIENCE, CYBERNETICS

CiteScore

18.50

自引率

11.50%

发文量

812

审稿时长

6 months

期刊介绍： The IEEE Transactions on Systems, Man, and Cybernetics: Systems encompasses the fields of systems engineering, covering issue formulation, analysis, and modeling throughout the systems engineering lifecycle phases. It addresses decision-making, issue interpretation, systems management, processes, and various methods such as optimization, modeling, and simulation in the development and deployment of large systems.