A Bidding-Based Deep Reinforcement Learning Approach for Multi-Agent Job Shop Scheduling Problem

IF 8.6 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS
Gaopan Shen;Shudong Sun;Yaqiong Liu
{"title":"A Bidding-Based Deep Reinforcement Learning Approach for Multi-Agent Job Shop Scheduling Problem","authors":"Gaopan Shen;Shudong Sun;Yaqiong Liu","doi":"10.1109/TSMC.2025.3572451","DOIUrl":null,"url":null,"abstract":"In demand-driven personalized production, multi-agent job shop scheduling problem plays a pivotal role. Balancing the private preferences between consumers poses a challenge in achieving efficient resource allocation. A bidding-based deep reinforcement learning approach is proposed to generate a consensus schedule. An intelligent selector and an intelligent bidder (IB) are designed for each consumer to perform operation selection and determine the corresponding bid price, respectively. A job shop agent is established to collect bids and allocate resource through winner determination. To assist the IB to learn the correlation between consumers and the multi-agent job shop scheduling environment without revealing preferences, a graph neural network is adopted to extract observations. A reward function based on critic value guides the negotiation process among IBs. Extensive computational experiments demonstrate that the proposed approach achieves high social welfare outcomes. It outperforms in handling large-scale instances with more than 11 consumers and 20 jobs per consumer.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":"55 8","pages":"5642-5654"},"PeriodicalIF":8.6000,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Systems Man Cybernetics-Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/11029239/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

In demand-driven personalized production, multi-agent job shop scheduling problem plays a pivotal role. Balancing the private preferences between consumers poses a challenge in achieving efficient resource allocation. A bidding-based deep reinforcement learning approach is proposed to generate a consensus schedule. An intelligent selector and an intelligent bidder (IB) are designed for each consumer to perform operation selection and determine the corresponding bid price, respectively. A job shop agent is established to collect bids and allocate resource through winner determination. To assist the IB to learn the correlation between consumers and the multi-agent job shop scheduling environment without revealing preferences, a graph neural network is adopted to extract observations. A reward function based on critic value guides the negotiation process among IBs. Extensive computational experiments demonstrate that the proposed approach achieves high social welfare outcomes. It outperforms in handling large-scale instances with more than 11 consumers and 20 jobs per consumer.
基于竞价的多智能体作业车间调度深度强化学习方法
在需求驱动的个性化生产中,多智能体作业车间调度问题起着至关重要的作用。平衡消费者之间的私人偏好是实现有效资源配置的挑战。提出了一种基于竞标的深度强化学习方法来生成共识调度。设计了智能选择器(intelligent selector)和智能竞价器(intelligent bidder, IB),分别为每个消费者进行操作选择和确定相应的投标价格。建立了一个作业车间代理来收集投标并通过确定获胜者来分配资源。为了帮助IB在不暴露偏好的情况下了解消费者与多智能体作业车间调度环境之间的相关性,采用图神经网络提取观察值。基于评价值的奖励函数指导了ibb之间的谈判过程。大量的计算实验表明,所提出的方法取得了较高的社会福利效果。它在处理超过11个消费者和每个消费者20个作业的大规模实例方面表现出色。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
IEEE Transactions on Systems Man Cybernetics-Systems
IEEE Transactions on Systems Man Cybernetics-Systems AUTOMATION & CONTROL SYSTEMS-COMPUTER SCIENCE, CYBERNETICS
CiteScore
18.50
自引率
11.50%
发文量
812
审稿时长
6 months
期刊介绍: The IEEE Transactions on Systems, Man, and Cybernetics: Systems encompasses the fields of systems engineering, covering issue formulation, analysis, and modeling throughout the systems engineering lifecycle phases. It addresses decision-making, issue interpretation, systems management, processes, and various methods such as optimization, modeling, and simulation in the development and deployment of large systems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信