PARCO:学习并行自回归政策,实现高效的多代理组合优化

Federico Berto, Chuanbo Hua, Laurin Luttmann, Jiwoo Son, Junyoung Park, Kyuree Ahn, Changhyun Kwon, Lin Xie, Jinkyoo Park
{"title":"PARCO:学习并行自回归政策,实现高效的多代理组合优化","authors":"Federico Berto, Chuanbo Hua, Laurin Luttmann, Jiwoo Son, Junyoung Park, Kyuree Ahn, Changhyun Kwon, Lin Xie, Jinkyoo Park","doi":"arxiv-2409.03811","DOIUrl":null,"url":null,"abstract":"Multi-agent combinatorial optimization problems such as routing and\nscheduling have great practical relevance but present challenges due to their\nNP-hard combinatorial nature, hard constraints on the number of possible\nagents, and hard-to-optimize objective functions. This paper introduces PARCO\n(Parallel AutoRegressive Combinatorial Optimization), a novel approach that\nlearns fast surrogate solvers for multi-agent combinatorial problems with\nreinforcement learning by employing parallel autoregressive decoding. We\npropose a model with a Multiple Pointer Mechanism to efficiently decode\nmultiple decisions simultaneously by different agents, enhanced by a\nPriority-based Conflict Handling scheme. Moreover, we design specialized\nCommunication Layers that enable effective agent collaboration, thus enriching\ndecision-making. We evaluate PARCO in representative multi-agent combinatorial\nproblems in routing and scheduling and demonstrate that our learned solvers\noffer competitive results against both classical and neural baselines in terms\nof both solution quality and speed. We make our code openly available at\nhttps://github.com/ai4co/parco.","PeriodicalId":501315,"journal":{"name":"arXiv - CS - Multiagent Systems","volume":"37 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"PARCO: Learning Parallel Autoregressive Policies for Efficient Multi-Agent Combinatorial Optimization\",\"authors\":\"Federico Berto, Chuanbo Hua, Laurin Luttmann, Jiwoo Son, Junyoung Park, Kyuree Ahn, Changhyun Kwon, Lin Xie, Jinkyoo Park\",\"doi\":\"arxiv-2409.03811\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Multi-agent combinatorial optimization problems such as routing and\\nscheduling have great practical relevance but present challenges due to their\\nNP-hard combinatorial nature, hard constraints on the number of possible\\nagents, and hard-to-optimize objective functions. This paper introduces PARCO\\n(Parallel AutoRegressive Combinatorial Optimization), a novel approach that\\nlearns fast surrogate solvers for multi-agent combinatorial problems with\\nreinforcement learning by employing parallel autoregressive decoding. We\\npropose a model with a Multiple Pointer Mechanism to efficiently decode\\nmultiple decisions simultaneously by different agents, enhanced by a\\nPriority-based Conflict Handling scheme. Moreover, we design specialized\\nCommunication Layers that enable effective agent collaboration, thus enriching\\ndecision-making. We evaluate PARCO in representative multi-agent combinatorial\\nproblems in routing and scheduling and demonstrate that our learned solvers\\noffer competitive results against both classical and neural baselines in terms\\nof both solution quality and speed. We make our code openly available at\\nhttps://github.com/ai4co/parco.\",\"PeriodicalId\":501315,\"journal\":{\"name\":\"arXiv - CS - Multiagent Systems\",\"volume\":\"37 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Multiagent Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.03811\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Multiagent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.03811","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

路由和调度等多代理组合优化问题具有重要的现实意义,但由于其NP-硬组合性质、对可能代理数量的硬约束以及难以优化的目标函数,这些问题面临着挑战。本文介绍了 PARCO(并行自回归组合优化),这是一种新颖的方法,它通过采用并行自回归解码,为多代理组合问题学习快速代求解器。我们提出了一个具有多重指针机制的模型,通过基于优先级的冲突处理方案,同时对不同代理的多个决策进行高效解码。此外,我们还设计了专门的通信层(Communication Layers),以实现有效的代理协作,从而丰富决策过程。我们在路由和调度方面的代表性多代理组合问题中对 PARCO 进行了评估,结果表明,我们的学习求解器在求解质量和速度方面都能与经典和神经基线求解器相媲美。我们将代码公开在https://github.com/ai4co/parco。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
PARCO: Learning Parallel Autoregressive Policies for Efficient Multi-Agent Combinatorial Optimization
Multi-agent combinatorial optimization problems such as routing and scheduling have great practical relevance but present challenges due to their NP-hard combinatorial nature, hard constraints on the number of possible agents, and hard-to-optimize objective functions. This paper introduces PARCO (Parallel AutoRegressive Combinatorial Optimization), a novel approach that learns fast surrogate solvers for multi-agent combinatorial problems with reinforcement learning by employing parallel autoregressive decoding. We propose a model with a Multiple Pointer Mechanism to efficiently decode multiple decisions simultaneously by different agents, enhanced by a Priority-based Conflict Handling scheme. Moreover, we design specialized Communication Layers that enable effective agent collaboration, thus enriching decision-making. We evaluate PARCO in representative multi-agent combinatorial problems in routing and scheduling and demonstrate that our learned solvers offer competitive results against both classical and neural baselines in terms of both solution quality and speed. We make our code openly available at https://github.com/ai4co/parco.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信