Learning-aided Neighborhood Search for Vehicle Routing Problems.

IEEE transactions on pattern analysis and machine intelligence Pub Date : 2025-03-24 DOI:10.1109/TPAMI.2025.3554669

Tong Guo, Yi Mei, Mengjie Zhang, Haoran Zhao, Kaiquan Cai, Wenbo Du

{"title":"Learning-aided Neighborhood Search for Vehicle Routing Problems.","authors":"Tong Guo, Yi Mei, Mengjie Zhang, Haoran Zhao, Kaiquan Cai, Wenbo Du","doi":"10.1109/TPAMI.2025.3554669","DOIUrl":null,"url":null,"abstract":"<p><p>The Vehicle Routing Problem (VRP) is a classic optimization problem with diverse real-world applications. The neighborhood search has emerged as an effective approach, yielding high-quality solutions across different VRPs. However, most existing studies exhaustively explore all considered neighborhoods with a pre-fixed order, leading to an inefficient search process. To address this issue, this paper proposes a Learning-aided Neighborhood Search algorithm (LaNS) that employs a cutting-edge multi-agent reinforcement learning-driven adaptive operator/neighborhood selection mechanism to achieve efficient routing for VRP. Within this framework, two agents serve as high-level instructors, collaboratively guiding the search direction by selecting perturbation/improvement operators from a pool of low-level heuristics. Furthermore, to equip the agents with comprehensive information for learning guidance knowledge, we have developed a new informative state representation. This representation transforms the spatial route structures into an image-like tensor, allowing us to extract spatial features using a convolutional neural network. Comprehensive evaluations on diverse VRP benchmarks, including the capacitated VRP (CVRP), multi-depot VRP (MDVRP) and cumulative multi-depot VRP with energy constraints, demonstrate LaNS's superiority over the state-of-the-art neighborhood search methods as well as the existing learning-guided neighborhood search algorithms.</p>","PeriodicalId":94034,"journal":{"name":"IEEE transactions on pattern analysis and machine intelligence","volume":"PP ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on pattern analysis and machine intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TPAMI.2025.3554669","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The Vehicle Routing Problem (VRP) is a classic optimization problem with diverse real-world applications. The neighborhood search has emerged as an effective approach, yielding high-quality solutions across different VRPs. However, most existing studies exhaustively explore all considered neighborhoods with a pre-fixed order, leading to an inefficient search process. To address this issue, this paper proposes a Learning-aided Neighborhood Search algorithm (LaNS) that employs a cutting-edge multi-agent reinforcement learning-driven adaptive operator/neighborhood selection mechanism to achieve efficient routing for VRP. Within this framework, two agents serve as high-level instructors, collaboratively guiding the search direction by selecting perturbation/improvement operators from a pool of low-level heuristics. Furthermore, to equip the agents with comprehensive information for learning guidance knowledge, we have developed a new informative state representation. This representation transforms the spatial route structures into an image-like tensor, allowing us to extract spatial features using a convolutional neural network. Comprehensive evaluations on diverse VRP benchmarks, including the capacitated VRP (CVRP), multi-depot VRP (MDVRP) and cumulative multi-depot VRP with energy constraints, demonstrate LaNS's superiority over the state-of-the-art neighborhood search methods as well as the existing learning-guided neighborhood search algorithms.

查看原文本刊更多论文

车辆路径问题的学习辅助邻域搜索。

车辆路径问题（VRP）是一个具有多种实际应用的经典优化问题。社区搜索已经成为一种有效的方法，在不同的vrp中产生高质量的解决方案。然而，大多数现有的研究都是用预先确定的顺序详尽地探索所有考虑的邻域，导致搜索过程效率低下。为了解决这一问题，本文提出了一种学习辅助邻域搜索算法（LaNS），该算法采用先进的多智能体强化学习驱动的自适应算子/邻域选择机制来实现VRP的高效路由。在这个框架中，两个智能体作为高级讲师，通过从低级启发式池中选择扰动/改进算子来协同指导搜索方向。此外，为了使智能体具有全面的信息来学习引导知识，我们开发了一种新的信息状态表示。这种表示将空间路线结构转换为类似图像的张量，允许我们使用卷积神经网络提取空间特征。对各种VRP基准进行综合评估，包括有能力VRP （CVRP）、多车场VRP （MDVRP）和具有能量约束的累积多车场VRP，证明了局域网比最先进的邻域搜索方法以及现有的学习引导邻域搜索算法具有优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE transactions on pattern analysis and machine intelligence

自引率

0.00%

发文量