A deep reinforcement learning hyperheuristic for the covering tour problem with varying coverage

IF 4.1 2区工程技术 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Computers & Operations Research Pub Date : 2024-10-28 DOI:10.1016/j.cor.2024.106881

Parisa Torabi , Ahmad Hemmati , Anna Oleynik , Guttorm Alendal

{"title":"A deep reinforcement learning hyperheuristic for the covering tour problem with varying coverage","authors":"Parisa Torabi , Ahmad Hemmati , Anna Oleynik , Guttorm Alendal","doi":"10.1016/j.cor.2024.106881","DOIUrl":null,"url":null,"abstract":"<div><div>Covering Tour Problem (CTP) is a combinatorial optimization problem in which the objective is to identify a minimum-cost tour that satisfies the coverage of a certain subset of nodes in a graph. The Covering Tour Problem with Varying Coverage (CTP-VC) is an extension of this problem in which the coverage radius is dependent on the amount of time spent at each node. In this paper, we propose a novel approach to address the CTP-VC using a Deep Reinforcement Learning Hyperheuristic (DRLH). This study includes experiments on the existing Adaptive Metaheuristic to solve CTP-VC, to enhance its solution quality. Further, new heuristics and three selection methods, namely Uniform Random Selection (URS), adaptive Metaheuristic (AMH), and the proposed DRLH are introduced. We detail the computational setup, including the instance sets utilized, the training process for the DRLH agent, and the validation procedures for model selection. Through extensive experimentation and analysis, we evaluate the performance of different selection methods, assess the solution quality of the DRLH approach, investigate the robustness of selection methods, examine heuristic selection frequency, and analyze solution convergence. Our results demonstrate the efficacy of the DRLH approach in tackling the CTP-VC, offering promising insights for future research in the interface of combinatorial optimization and reinforcement learning methodologies.</div></div>","PeriodicalId":10542,"journal":{"name":"Computers & Operations Research","volume":"174 ","pages":"Article 106881"},"PeriodicalIF":4.1000,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Operations Research","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0305054824003538","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

Covering Tour Problem (CTP) is a combinatorial optimization problem in which the objective is to identify a minimum-cost tour that satisfies the coverage of a certain subset of nodes in a graph. The Covering Tour Problem with Varying Coverage (CTP-VC) is an extension of this problem in which the coverage radius is dependent on the amount of time spent at each node. In this paper, we propose a novel approach to address the CTP-VC using a Deep Reinforcement Learning Hyperheuristic (DRLH). This study includes experiments on the existing Adaptive Metaheuristic to solve CTP-VC, to enhance its solution quality. Further, new heuristics and three selection methods, namely Uniform Random Selection (URS), adaptive Metaheuristic (AMH), and the proposed DRLH are introduced. We detail the computational setup, including the instance sets utilized, the training process for the DRLH agent, and the validation procedures for model selection. Through extensive experimentation and analysis, we evaluate the performance of different selection methods, assess the solution quality of the DRLH approach, investigate the robustness of selection methods, examine heuristic selection frequency, and analyze solution convergence. Our results demonstrate the efficacy of the DRLH approach in tackling the CTP-VC, offering promising insights for future research in the interface of combinatorial optimization and reinforcement learning methodologies.

查看原文本刊更多论文

针对不同覆盖率的覆盖巡游问题的深度强化学习超寻优方法

覆盖游问题（Covering Tour Problem，CTP）是一个组合优化问题，其目标是找出一个最小成本的游程，以满足覆盖图中某个节点子集的要求。覆盖范围可变的巡回问题（CTP-VC）是这一问题的扩展，其中的覆盖半径取决于在每个节点花费的时间。在本文中，我们提出了一种利用深度强化学习超启发式（DRLH）解决 CTP-VC 问题的新方法。本研究包括对现有自适应元启发式求解 CTP-VC 的实验，以提高其求解质量。此外，还介绍了新的启发式和三种选择方法，即统一随机选择法（URS）、自适应元启发式（AMH）和所提议的 DRLH。我们详细介绍了计算设置，包括使用的实例集、DRLH 代理的训练过程以及模型选择的验证程序。通过大量实验和分析，我们评估了不同选择方法的性能，评估了 DRLH 方法的解决方案质量，研究了选择方法的鲁棒性，检查了启发式选择频率，并分析了解决方案的收敛性。我们的研究结果证明了 DRLH 方法在处理 CTP-VC 方面的有效性，并为未来在组合优化和强化学习方法接口方面的研究提供了很有前景的见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computers & Operations Research 工程技术-工程：工业

CiteScore

8.60

自引率

8.70%

发文量

292

审稿时长

8.5 months

期刊介绍： Operations research and computers meet in a large number of scientific fields, many of which are of vital current concern to our troubled society. These include, among others, ecology, transportation, safety, reliability, urban planning, economics, inventory control, investment strategy and logistics (including reverse logistics). Computers & Operations Research provides an international forum for the application of computers and operations research techniques to problems in these and related fields.