Minimizing the weighted number of tardy jobs: data-driven heuristic for single-machine scheduling

IF 4.3 2区工程技术 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Computers & Operations Research Pub Date : 2025-09-26 DOI:10.1016/j.cor.2025.107281

Nikolai Antonov , Přemysl Šůcha , Mikoláš Janota , Jan Hůla

{"title":"Minimizing the weighted number of tardy jobs: data-driven heuristic for single-machine scheduling","authors":"Nikolai Antonov , Přemysl Šůcha , Mikoláš Janota , Jan Hůla","doi":"10.1016/j.cor.2025.107281","DOIUrl":null,"url":null,"abstract":"<div><div>Existing research on single-machine scheduling is largely focused on exact algorithms, which perform well on typical instances but can significantly deteriorate on certain regions of the problem space. In contrast, data-driven approaches provide strong and scalable performance when tailored to the structure of specific datasets. Leveraging this idea, we focus on a single-machine scheduling problem where each job is defined by its weight, duration, due date, and deadline, aiming to minimize the total weight of tardy jobs. We introduce a novel data-driven scheduling heuristic that combines machine learning with problem-specific characteristics, ensuring feasible solutions, which is a common challenge for ML-based algorithms. Experimental results demonstrate that our approach significantly outperforms the state-of-the-art in terms of optimality gap, number of optimal solutions, and adaptability across varied data scenarios, highlighting its flexibility for practical applications. In addition, we conduct a systematic exploration of ML models, addressing a common gap in similar studies by offering a detailed model selection process and demonstrating why the chosen model is the best fit.</div></div>","PeriodicalId":10542,"journal":{"name":"Computers & Operations Research","volume":"185 ","pages":"Article 107281"},"PeriodicalIF":4.3000,"publicationDate":"2025-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Operations Research","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0305054825003107","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

Existing research on single-machine scheduling is largely focused on exact algorithms, which perform well on typical instances but can significantly deteriorate on certain regions of the problem space. In contrast, data-driven approaches provide strong and scalable performance when tailored to the structure of specific datasets. Leveraging this idea, we focus on a single-machine scheduling problem where each job is defined by its weight, duration, due date, and deadline, aiming to minimize the total weight of tardy jobs. We introduce a novel data-driven scheduling heuristic that combines machine learning with problem-specific characteristics, ensuring feasible solutions, which is a common challenge for ML-based algorithms. Experimental results demonstrate that our approach significantly outperforms the state-of-the-art in terms of optimality gap, number of optimal solutions, and adaptability across varied data scenarios, highlighting its flexibility for practical applications. In addition, we conduct a systematic exploration of ML models, addressing a common gap in similar studies by offering a detailed model selection process and demonstrating why the chosen model is the best fit.

查看原文本刊更多论文

最小化延迟作业的加权数：单机调度的数据驱动启发式算法

现有的单机调度研究主要集中在精确算法上，这些算法在典型情况下表现良好，但在问题空间的某些区域会严重恶化。相反，数据驱动的方法在针对特定数据集的结构进行定制时提供强大的可伸缩性能。利用这一思想，我们将重点放在单机调度问题上，其中每个作业由其权重、持续时间、到期日期和截止日期定义，旨在最小化延迟作业的总权重。我们引入了一种新颖的数据驱动调度启发式算法，将机器学习与问题特定特征相结合，确保可行的解决方案，这是基于ml的算法面临的共同挑战。实验结果表明，我们的方法在最优性差距、最优解的数量和不同数据场景的适应性方面明显优于最先进的方法，突出了其实际应用的灵活性。此外，我们对机器学习模型进行了系统的探索，通过提供详细的模型选择过程并演示为什么所选择的模型是最适合的，从而解决了类似研究中的常见差距。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computers & Operations Research 工程技术-工程：工业

CiteScore

8.60

自引率

8.70%

发文量

292

审稿时长

8.5 months

期刊介绍： Operations research and computers meet in a large number of scientific fields, many of which are of vital current concern to our troubled society. These include, among others, ecology, transportation, safety, reliability, urban planning, economics, inventory control, investment strategy and logistics (including reverse logistics). Computers & Operations Research provides an international forum for the application of computers and operations research techniques to problems in these and related fields.