Solving the capacitated vehicle routing problem with timing windows using rollouts and MAX-SAT

2022 Eighth Indian Control Conference (ICC) Pub Date : 2022-06-14 DOI:10.1109/ICC56513.2022.10093678

H. Khadilkar

引用次数: 2

Abstract

The vehicle routing problem is a well known class of NP-hard combinatorial optimisation problems in literature. Traditional solution methods involve either carefully designed heuristics, or time-consuming metaheuristics. Recent work in reinforcement learning has been a promising alternative approach, but has found it difficult to compete with traditional methods in terms of solution quality. This paper proposes a hybrid approach that combines reinforcement learning, policy rollouts, and a satisfiability solver to enable a tunable tradeoff between computation times and solution quality. Results on a popular public data set show that the algorithm is able to produce solutions closer to optimal levels than existing learning based approaches, and with shorter computation times than meta-heuristics. The approach requires minimal design effort and is able to solve unseen problems of arbitrary scale without additional training. Furthermore, the methodology is generalisable to other combinatorial optimisation problems.

查看原文本刊更多论文

利用滚动和MAX-SAT求解带定时窗的有容车辆路径问题

在文献中，车辆路线问题是一类众所周知的NP-hard组合优化问题。传统的解决方法包括精心设计的启发式方法或耗时的元启发式方法。最近在强化学习方面的工作是一种很有前途的替代方法，但在解决方案质量方面很难与传统方法竞争。本文提出了一种混合方法，结合了强化学习，策略推出和满意度求解器，以实现计算时间和解决方案质量之间的可调权衡。在一个流行的公共数据集上的结果表明，该算法能够产生比现有的基于学习的方法更接近最佳水平的解决方案，并且比元启发式的计算时间更短。该方法需要最小的设计努力，并且能够解决任意规模的未见问题，而无需额外的培训。此外，该方法可推广到其他组合优化问题。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 Eighth Indian Control Conference (ICC)

自引率

0.00%

发文量