A deep reinforcement learning hyperheuristic for the covering tour problem with varying coverage

IF 4.1 2区 工程技术 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Parisa Torabi , Ahmad Hemmati , Anna Oleynik , Guttorm Alendal
{"title":"A deep reinforcement learning hyperheuristic for the covering tour problem with varying coverage","authors":"Parisa Torabi ,&nbsp;Ahmad Hemmati ,&nbsp;Anna Oleynik ,&nbsp;Guttorm Alendal","doi":"10.1016/j.cor.2024.106881","DOIUrl":null,"url":null,"abstract":"<div><div>Covering Tour Problem (CTP) is a combinatorial optimization problem in which the objective is to identify a minimum-cost tour that satisfies the coverage of a certain subset of nodes in a graph. The Covering Tour Problem with Varying Coverage (CTP-VC) is an extension of this problem in which the coverage radius is dependent on the amount of time spent at each node. In this paper, we propose a novel approach to address the CTP-VC using a Deep Reinforcement Learning Hyperheuristic (DRLH). This study includes experiments on the existing Adaptive Metaheuristic to solve CTP-VC, to enhance its solution quality. Further, new heuristics and three selection methods, namely Uniform Random Selection (URS), adaptive Metaheuristic (AMH), and the proposed DRLH are introduced. We detail the computational setup, including the instance sets utilized, the training process for the DRLH agent, and the validation procedures for model selection. Through extensive experimentation and analysis, we evaluate the performance of different selection methods, assess the solution quality of the DRLH approach, investigate the robustness of selection methods, examine heuristic selection frequency, and analyze solution convergence. Our results demonstrate the efficacy of the DRLH approach in tackling the CTP-VC, offering promising insights for future research in the interface of combinatorial optimization and reinforcement learning methodologies.</div></div>","PeriodicalId":10542,"journal":{"name":"Computers & Operations Research","volume":"174 ","pages":"Article 106881"},"PeriodicalIF":4.1000,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Operations Research","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0305054824003538","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Covering Tour Problem (CTP) is a combinatorial optimization problem in which the objective is to identify a minimum-cost tour that satisfies the coverage of a certain subset of nodes in a graph. The Covering Tour Problem with Varying Coverage (CTP-VC) is an extension of this problem in which the coverage radius is dependent on the amount of time spent at each node. In this paper, we propose a novel approach to address the CTP-VC using a Deep Reinforcement Learning Hyperheuristic (DRLH). This study includes experiments on the existing Adaptive Metaheuristic to solve CTP-VC, to enhance its solution quality. Further, new heuristics and three selection methods, namely Uniform Random Selection (URS), adaptive Metaheuristic (AMH), and the proposed DRLH are introduced. We detail the computational setup, including the instance sets utilized, the training process for the DRLH agent, and the validation procedures for model selection. Through extensive experimentation and analysis, we evaluate the performance of different selection methods, assess the solution quality of the DRLH approach, investigate the robustness of selection methods, examine heuristic selection frequency, and analyze solution convergence. Our results demonstrate the efficacy of the DRLH approach in tackling the CTP-VC, offering promising insights for future research in the interface of combinatorial optimization and reinforcement learning methodologies.
针对不同覆盖率的覆盖巡游问题的深度强化学习超寻优方法
覆盖游问题(Covering Tour Problem,CTP)是一个组合优化问题,其目标是找出一个最小成本的游程,以满足覆盖图中某个节点子集的要求。覆盖范围可变的巡回问题(CTP-VC)是这一问题的扩展,其中的覆盖半径取决于在每个节点花费的时间。在本文中,我们提出了一种利用深度强化学习超启发式(DRLH)解决 CTP-VC 问题的新方法。本研究包括对现有自适应元启发式求解 CTP-VC 的实验,以提高其求解质量。此外,还介绍了新的启发式和三种选择方法,即统一随机选择法(URS)、自适应元启发式(AMH)和所提议的 DRLH。我们详细介绍了计算设置,包括使用的实例集、DRLH 代理的训练过程以及模型选择的验证程序。通过大量实验和分析,我们评估了不同选择方法的性能,评估了 DRLH 方法的解决方案质量,研究了选择方法的鲁棒性,检查了启发式选择频率,并分析了解决方案的收敛性。我们的研究结果证明了 DRLH 方法在处理 CTP-VC 方面的有效性,并为未来在组合优化和强化学习方法接口方面的研究提供了很有前景的见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Computers & Operations Research
Computers & Operations Research 工程技术-工程:工业
CiteScore
8.60
自引率
8.70%
发文量
292
审稿时长
8.5 months
期刊介绍: Operations research and computers meet in a large number of scientific fields, many of which are of vital current concern to our troubled society. These include, among others, ecology, transportation, safety, reliability, urban planning, economics, inventory control, investment strategy and logistics (including reverse logistics). Computers & Operations Research provides an international forum for the application of computers and operations research techniques to problems in these and related fields.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信