Andoni I. Garmendia, Quentin Cappart, Josu Ceberio, Alexander Mendiburu
{"title":"MARCO: A Memory-Augmented Reinforcement Framework for Combinatorial Optimization","authors":"Andoni I. Garmendia, Quentin Cappart, Josu Ceberio, Alexander Mendiburu","doi":"arxiv-2408.02207","DOIUrl":null,"url":null,"abstract":"Neural Combinatorial Optimization (NCO) is an emerging domain where deep\nlearning techniques are employed to address combinatorial optimization problems\nas a standalone solver. Despite their potential, existing NCO methods often\nsuffer from inefficient search space exploration, frequently leading to local\noptima entrapment or redundant exploration of previously visited states. This\npaper introduces a versatile framework, referred to as Memory-Augmented\nReinforcement for Combinatorial Optimization (MARCO), that can be used to\nenhance both constructive and improvement methods in NCO through an innovative\nmemory module. MARCO stores data collected throughout the optimization\ntrajectory and retrieves contextually relevant information at each state. This\nway, the search is guided by two competing criteria: making the best decision\nin terms of the quality of the solution and avoiding revisiting already\nexplored solutions. This approach promotes a more efficient use of the\navailable optimization budget. Moreover, thanks to the parallel nature of NCO\nmodels, several search threads can run simultaneously, all sharing the same\nmemory module, enabling an efficient collaborative exploration. Empirical\nevaluations, carried out on the maximum cut, maximum independent set and\ntravelling salesman problems, reveal that the memory module effectively\nincreases the exploration, enabling the model to discover diverse,\nhigher-quality solutions. MARCO achieves good performance in a low\ncomputational cost, establishing a promising new direction in the field of NCO.","PeriodicalId":501347,"journal":{"name":"arXiv - CS - Neural and Evolutionary Computing","volume":"51 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Neural and Evolutionary Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.02207","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Neural Combinatorial Optimization (NCO) is an emerging domain where deep
learning techniques are employed to address combinatorial optimization problems
as a standalone solver. Despite their potential, existing NCO methods often
suffer from inefficient search space exploration, frequently leading to local
optima entrapment or redundant exploration of previously visited states. This
paper introduces a versatile framework, referred to as Memory-Augmented
Reinforcement for Combinatorial Optimization (MARCO), that can be used to
enhance both constructive and improvement methods in NCO through an innovative
memory module. MARCO stores data collected throughout the optimization
trajectory and retrieves contextually relevant information at each state. This
way, the search is guided by two competing criteria: making the best decision
in terms of the quality of the solution and avoiding revisiting already
explored solutions. This approach promotes a more efficient use of the
available optimization budget. Moreover, thanks to the parallel nature of NCO
models, several search threads can run simultaneously, all sharing the same
memory module, enabling an efficient collaborative exploration. Empirical
evaluations, carried out on the maximum cut, maximum independent set and
travelling salesman problems, reveal that the memory module effectively
increases the exploration, enabling the model to discover diverse,
higher-quality solutions. MARCO achieves good performance in a low
computational cost, establishing a promising new direction in the field of NCO.