Decision Transformer for Enhancing Neural Local Search on the Job Shop Scheduling Problem

arXiv - CS - Artificial Intelligence Pub Date : 2024-09-04 DOI:arxiv-2409.02697

Constantin Waubert de Puiseau, Fabian Wolz, Merlin Montag, Jannik Peters, Hasan Tercan, Tobias Meisen

{"title":"Decision Transformer for Enhancing Neural Local Search on the Job Shop Scheduling Problem","authors":"Constantin Waubert de Puiseau, Fabian Wolz, Merlin Montag, Jannik Peters, Hasan Tercan, Tobias Meisen","doi":"arxiv-2409.02697","DOIUrl":null,"url":null,"abstract":"The job shop scheduling problem (JSSP) and its solution algorithms have been\nof enduring interest in both academia and industry for decades. In recent\nyears, machine learning (ML) is playing an increasingly important role in\nadvancing existing and building new heuristic solutions for the JSSP, aiming to\nfind better solutions in shorter computation times. In this paper we build on\ntop of a state-of-the-art deep reinforcement learning (DRL) agent, called\nNeural Local Search (NLS), which can efficiently and effectively control a\nlarge local neighborhood search on the JSSP. In particular, we develop a method\nfor training the decision transformer (DT) algorithm on search trajectories\ntaken by a trained NLS agent to further improve upon the learned\ndecision-making sequences. Our experiments show that the DT successfully learns\nlocal search strategies that are different and, in many cases, more effective\nthan those of the NLS agent itself. In terms of the tradeoff between solution\nquality and acceptable computational time needed for the search, the DT is\nparticularly superior in application scenarios where longer computational times\nare acceptable. In this case, it makes up for the longer inference times\nrequired per search step, which are caused by the larger neural network\narchitecture, through better quality decisions per step. Thereby, the DT\nachieves state-of-the-art results for solving the JSSP with ML-enhanced search.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":"45 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.02697","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The job shop scheduling problem (JSSP) and its solution algorithms have been of enduring interest in both academia and industry for decades. In recent years, machine learning (ML) is playing an increasingly important role in advancing existing and building new heuristic solutions for the JSSP, aiming to find better solutions in shorter computation times. In this paper we build on top of a state-of-the-art deep reinforcement learning (DRL) agent, called Neural Local Search (NLS), which can efficiently and effectively control a large local neighborhood search on the JSSP. In particular, we develop a method for training the decision transformer (DT) algorithm on search trajectories taken by a trained NLS agent to further improve upon the learned decision-making sequences. Our experiments show that the DT successfully learns local search strategies that are different and, in many cases, more effective than those of the NLS agent itself. In terms of the tradeoff between solution quality and acceptable computational time needed for the search, the DT is particularly superior in application scenarios where longer computational times are acceptable. In this case, it makes up for the longer inference times required per search step, which are caused by the larger neural network architecture, through better quality decisions per step. Thereby, the DT achieves state-of-the-art results for solving the JSSP with ML-enhanced search.

查看原文本刊更多论文

在工作车间调度问题上增强神经局部搜索的决策变换器

几十年来，作业车间调度问题（JSSP）及其求解算法一直受到学术界和工业界的广泛关注。近年来，机器学习（ML）在改进现有 JSSP 解决方案和构建新的启发式解决方案方面发挥着越来越重要的作用，其目的是在更短的计算时间内找到更好的解决方案。在本文中，我们在最先进的深度强化学习（DRL）代理（称为神经局部搜索（NLS））的基础上进行了改进，该代理可以高效地控制 JSSP 上的大型局部邻域搜索。特别是，我们开发了一种在训练有素的 NLS 代理的搜索轨迹上训练决策转换器（DT）算法的方法，以进一步改进所学的决策序列。我们的实验表明，DT 成功地学习到了不同于 NLS 代理本身的本地搜索策略，而且在很多情况下，比 NLS 代理本身的策略更加有效。在解决方案质量与搜索所需的可接受计算时间之间的权衡方面，DT 在可接受较长计算时间的应用场景中尤为出色。在这种情况下，它通过每一步更高质量的决策，弥补了因更大的神经网络架构而导致的每一步搜索所需的更长推理时间。因此，在利用 ML 增强搜索求解 JSSP 时，DT 达到了最先进的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

arXiv - CS - Artificial Intelligence

自引率

0.00%

发文量