Gase: graph attention sampling with edges fusion for solving vehicle routing problems

IF 3.3 2区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Memetic Computing Pub Date : 2024-08-06 DOI:10.1007/s12293-024-00428-0

Zhenwei Wang, Ruibin Bai, Fazlullah Khan, Ender Özcan, Tiehua Zhang

{"title":"Gase: graph attention sampling with edges fusion for solving vehicle routing problems","authors":"Zhenwei Wang, Ruibin Bai, Fazlullah Khan, Ender Özcan, Tiehua Zhang","doi":"10.1007/s12293-024-00428-0","DOIUrl":null,"url":null,"abstract":"<p>Learning-based methods have become increasingly popular for solving vehicle routing problems (VRP) due to their near-optimal performance and fast inference speed. Among them, the combination of deep reinforcement learning and graph representation allows for the abstraction of node topology structures and features in an encoder–decoder style. Such an approach makes it possible to solve routing problems end-to-end without needing complicated heuristic operators designed by domain experts. Existing research studies have been focusing on novel encoding and decoding structures via various neural network models to enhance the node embedding representation. Despite the sophisticated approaches being designed for VRP, there is a noticeable lack of consideration for the graph-theoretic properties inherent to routing problems. Moreover, the potential ramifications of inter-nodal interactions on the decision-making efficacy of the models have not been adequately explored. To bridge this gap, we propose an adaptive graph attention sampling with the edges fusion framework, where nodes’ embedding is determined through attention calculation from certain highly correlated neighborhoods and edges, utilizing a filtered adjacency matrix. In detail, the selections of particular neighbors and adjacency edges are led by a multi-head attention mechanism, contributing directly to the message passing and node embedding in graph attention sampling networks. Furthermore, an adaptive actor-critic algorithm with policy improvements is incorporated to expedite the training convergence. We then conduct comprehensive experiments against baseline methods on learning-based VRP tasks from different perspectives. Our proposed model outperforms the existing methods by 2.08–6.23% and shows stronger generalization ability, achieving the state-of-the-art performance on randomly generated instances and standard benchmark datasets.</p>","PeriodicalId":48780,"journal":{"name":"Memetic Computing","volume":"79 1","pages":""},"PeriodicalIF":3.3000,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Memetic Computing","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s12293-024-00428-0","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Learning-based methods have become increasingly popular for solving vehicle routing problems (VRP) due to their near-optimal performance and fast inference speed. Among them, the combination of deep reinforcement learning and graph representation allows for the abstraction of node topology structures and features in an encoder–decoder style. Such an approach makes it possible to solve routing problems end-to-end without needing complicated heuristic operators designed by domain experts. Existing research studies have been focusing on novel encoding and decoding structures via various neural network models to enhance the node embedding representation. Despite the sophisticated approaches being designed for VRP, there is a noticeable lack of consideration for the graph-theoretic properties inherent to routing problems. Moreover, the potential ramifications of inter-nodal interactions on the decision-making efficacy of the models have not been adequately explored. To bridge this gap, we propose an adaptive graph attention sampling with the edges fusion framework, where nodes’ embedding is determined through attention calculation from certain highly correlated neighborhoods and edges, utilizing a filtered adjacency matrix. In detail, the selections of particular neighbors and adjacency edges are led by a multi-head attention mechanism, contributing directly to the message passing and node embedding in graph attention sampling networks. Furthermore, an adaptive actor-critic algorithm with policy improvements is incorporated to expedite the training convergence. We then conduct comprehensive experiments against baseline methods on learning-based VRP tasks from different perspectives. Our proposed model outperforms the existing methods by 2.08–6.23% and shows stronger generalization ability, achieving the state-of-the-art performance on randomly generated instances and standard benchmark datasets.

Abstract Image

查看原文本刊更多论文

Gase：图注意采样与边缘融合用于解决车辆路由问题

基于学习的方法因其接近最优的性能和快速的推理速度，在解决车辆路由问题（VRP）方面越来越受欢迎。其中，深度强化学习与图表示法的结合可以以编码器-解码器的方式抽象出节点拓扑结构和特征。这种方法可以端到端地解决路由问题，而无需领域专家设计复杂的启发式算子。现有的研究一直在关注通过各种神经网络模型来增强节点嵌入表示的新型编码和解码结构。尽管为 VRP 设计了复杂的方法，但明显缺乏对路由问题固有的图论特性的考虑。此外，节点间的相互作用对模型决策效率的潜在影响也未得到充分探讨。为了弥补这一不足，我们提出了一种边缘融合框架下的自适应图注意力采样，即利用过滤邻接矩阵，通过对某些高度相关的邻域和边缘进行注意力计算来确定节点的嵌入。具体来说，特定邻域和邻接边的选择由多头注意力机制主导，直接促进图注意力采样网络中的信息传递和节点嵌入。此外，为了加快训练收敛速度，我们还采用了一种具有政策改进功能的自适应演员批评算法。然后，我们从不同角度对基于学习的 VRP 任务的基准方法进行了综合实验。我们提出的模型优于现有方法 2.08-6.23%，并显示出更强的泛化能力，在随机生成的实例和标准基准数据集上达到了最先进的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Memetic Computing COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-OPERATIONS RESEARCH & MANAGEMENT SCIENCE

CiteScore

6.80

自引率

12.80%

发文量

期刊介绍： Memes have been defined as basic units of transferrable information that reside in the brain and are propagated across populations through the process of imitation. From an algorithmic point of view, memes have come to be regarded as building-blocks of prior knowledge, expressed in arbitrary computational representations (e.g., local search heuristics, fuzzy rules, neural models, etc.), that have been acquired through experience by a human or machine, and can be imitated (i.e., reused) across problems. The Memetic Computing journal welcomes papers incorporating the aforementioned socio-cultural notion of memes into artificial systems, with particular emphasis on enhancing the efficacy of computational and artificial intelligence techniques for search, optimization, and machine learning through explicit prior knowledge incorporation. The goal of the journal is to thus be an outlet for high quality theoretical and applied research on hybrid, knowledge-driven computational approaches that may be characterized under any of the following categories of memetics: Type 1: General-purpose algorithms integrated with human-crafted heuristics that capture some form of prior domain knowledge; e.g., traditional memetic algorithms hybridizing evolutionary global search with a problem-specific local search. Type 2: Algorithms with the ability to automatically select, adapt, and reuse the most appropriate heuristics from a diverse pool of available choices; e.g., learning a mapping between global search operators and multiple local search schemes, given an optimization problem at hand. Type 3: Algorithms that autonomously learn with experience, adaptively reusing data and/or machine learning models drawn from related problems as prior knowledge in new target tasks of interest; examples include, but are not limited to, transfer learning and optimization, multi-task learning and optimization, or any other multi-X evolutionary learning and optimization methodologies.