A comparative study of multi-algorithm optimization for inventory analytics in supply chains

Supply Chain Analytics Pub Date : 2025-08-21 DOI:10.1016/j.sca.2025.100154

Oussama Zabraoui, Yahya Hmamou , Anas Chafi , Salaheddine Kammouri Alami

{"title":"A comparative study of multi-algorithm optimization for inventory analytics in supply chains","authors":"Oussama Zabraoui, Yahya Hmamou , Anas Chafi , Salaheddine Kammouri Alami","doi":"10.1016/j.sca.2025.100154","DOIUrl":null,"url":null,"abstract":"<div><div>Effective management of inventory is essential for achieving high service levels, minimizing costs, and maintaining the overall resilience of retail supply chains—particularly in complex, real-world environments. Conventional strategies often prove inadequate because they rely on rigid assumptions or single-technique models that fail to accommodate practical challenges such as fluctuating demand, unpredictable lead times, and disruptions in supply.</div><div>To bridge this gap, our research undertakes a comprehensive comparison of multiple approaches — including Reinforcement Learning (RL), Genetic Algorithms (GA), Deep Learning (DL), Machine Learning (ML), and heuristic techniques — evaluated within a consistent and realistic testing framework based on the Walmart M5 dataset. This dataset offers a robust benchmark, containing multi-store, multi-item sales data that captures seasonal trends, event-driven demand variations, and price sensitivity. We introduce and evaluate an innovative hybrid methodology that combines a Genetic Algorithm with a Deep Q-Network (GA–DQN). The GA component conducts a broad, global search to optimize static inventory parameters such as reorder points and safety stock, while the DQN module learns adaptive, state-aware ordering strategies that can respond to dynamic, uncertain conditions. Our results show that this hybrid GA–DQN model achieves a significant improvement over a standalone DQN baseline—raising the service level from 61% to 94% and simultaneously lowering overall inventory costs. The framework we propose is modular and includes three key components: demand forecasting using Long Short-Term Memory (LSTM) networks to capture temporal sales patterns; GA-based optimization to fine-tune static policy parameters; and RL-driven adaptive control to support responsive, real-time ordering decisions. This integrated approach delivers a scalable, data-driven solution well-suited to the demands of modern retail supply chains, effectively addressing issues such as supplier unreliability, demand uncertainty, and the management of perishable goods.</div></div>","PeriodicalId":101186,"journal":{"name":"Supply Chain Analytics","volume":"12 ","pages":"Article 100154"},"PeriodicalIF":0.0000,"publicationDate":"2025-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Supply Chain Analytics","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949863525000548","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Effective management of inventory is essential for achieving high service levels, minimizing costs, and maintaining the overall resilience of retail supply chains—particularly in complex, real-world environments. Conventional strategies often prove inadequate because they rely on rigid assumptions or single-technique models that fail to accommodate practical challenges such as fluctuating demand, unpredictable lead times, and disruptions in supply.

To bridge this gap, our research undertakes a comprehensive comparison of multiple approaches — including Reinforcement Learning (RL), Genetic Algorithms (GA), Deep Learning (DL), Machine Learning (ML), and heuristic techniques — evaluated within a consistent and realistic testing framework based on the Walmart M5 dataset. This dataset offers a robust benchmark, containing multi-store, multi-item sales data that captures seasonal trends, event-driven demand variations, and price sensitivity. We introduce and evaluate an innovative hybrid methodology that combines a Genetic Algorithm with a Deep Q-Network (GA–DQN). The GA component conducts a broad, global search to optimize static inventory parameters such as reorder points and safety stock, while the DQN module learns adaptive, state-aware ordering strategies that can respond to dynamic, uncertain conditions. Our results show that this hybrid GA–DQN model achieves a significant improvement over a standalone DQN baseline—raising the service level from 61% to 94% and simultaneously lowering overall inventory costs. The framework we propose is modular and includes three key components: demand forecasting using Long Short-Term Memory (LSTM) networks to capture temporal sales patterns; GA-based optimization to fine-tune static policy parameters; and RL-driven adaptive control to support responsive, real-time ordering decisions. This integrated approach delivers a scalable, data-driven solution well-suited to the demands of modern retail supply chains, effectively addressing issues such as supplier unreliability, demand uncertainty, and the management of perishable goods.

Abstract Image

查看原文本刊更多论文

供应链库存分析的多算法优化比较研究

有效的库存管理对于实现高服务水平、最小化成本和保持零售供应链的整体弹性至关重要，特别是在复杂的现实环境中。传统战略往往被证明是不够的，因为它们依赖于僵化的假设或单一技术模型，无法适应需求波动、不可预测的交货时间和供应中断等实际挑战。为了弥补这一差距，我们的研究对多种方法进行了全面比较-包括强化学习（RL），遗传算法（GA），深度学习（DL），机器学习（ML）和启发式技术-在基于沃尔玛M5数据集的一致和现实的测试框架内进行评估。该数据集提供了一个强大的基准，包含多商店、多项目销售数据，这些数据捕获了季节性趋势、事件驱动的需求变化和价格敏感性。我们介绍并评估了一种结合遗传算法和深度q -网络（GA-DQN）的创新混合方法。GA组件进行广泛的全局搜索，以优化静态库存参数，如再订货点和安全库存，而DQN模块学习自适应、状态感知的订购策略，可以响应动态、不确定的条件。我们的研究结果表明，与单独的DQN基线相比，这种混合GA-DQN模型实现了显著的改进——将服务水平从61%提高到94%，同时降低了总体库存成本。我们提出的框架是模块化的，包括三个关键组成部分：使用长短期记忆（LSTM）网络进行需求预测，以捕捉时间销售模式；基于遗传算法的静态策略参数优化和rl驱动的自适应控制，以支持响应，实时订购决策。这种集成方法提供了一种可扩展的、数据驱动的解决方案，非常适合现代零售供应链的需求，有效地解决了供应商不可靠性、需求不确定性和易腐货物管理等问题。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Supply Chain Analytics

自引率

0.00%

发文量