{"title":"基于q学习的多仓库联合补货配送优化算法","authors":"Lu Peng , Sirui Wang","doi":"10.1016/j.asoc.2025.113307","DOIUrl":null,"url":null,"abstract":"<div><div>The joint replenishment and delivery (JRD) strategy is critical for increasing operational management efficiency and reducing costs. This study introduces a new multi-warehouse JRD model for heterogeneous products. The main goal of the JRD strategy is to minimize costs by determining the optimal replenishment intervals, delivery frequency, and basic replenishment cycle time. To address the difficulties of the JRD optimization issue, we propose the Q-learning-based arithmetic optimization algorithm (QAOA). In the QAOA framework, Q-learning is the guiding principle, making decisions based on current conditions and constantly refining search strategy via feedback mechanisms. Furthermore, an escape mechanism has been implemented to reduce the possibility of algorithmic stagnation in local optima. Experiments show that QAOA exceeds eight popular benchmark algorithms. By employing the QAOA technique, the practical JRD model has been effectively handled, resulting in considerable cost reductions in supply chain management.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"178 ","pages":"Article 113307"},"PeriodicalIF":7.2000,"publicationDate":"2025-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Q-learning based arithmetic optimization algorithm for a multi-warehouse joint replenishment and delivery problem\",\"authors\":\"Lu Peng , Sirui Wang\",\"doi\":\"10.1016/j.asoc.2025.113307\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The joint replenishment and delivery (JRD) strategy is critical for increasing operational management efficiency and reducing costs. This study introduces a new multi-warehouse JRD model for heterogeneous products. The main goal of the JRD strategy is to minimize costs by determining the optimal replenishment intervals, delivery frequency, and basic replenishment cycle time. To address the difficulties of the JRD optimization issue, we propose the Q-learning-based arithmetic optimization algorithm (QAOA). In the QAOA framework, Q-learning is the guiding principle, making decisions based on current conditions and constantly refining search strategy via feedback mechanisms. Furthermore, an escape mechanism has been implemented to reduce the possibility of algorithmic stagnation in local optima. Experiments show that QAOA exceeds eight popular benchmark algorithms. By employing the QAOA technique, the practical JRD model has been effectively handled, resulting in considerable cost reductions in supply chain management.</div></div>\",\"PeriodicalId\":50737,\"journal\":{\"name\":\"Applied Soft Computing\",\"volume\":\"178 \",\"pages\":\"Article 113307\"},\"PeriodicalIF\":7.2000,\"publicationDate\":\"2025-05-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Soft Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1568494625006180\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494625006180","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
A Q-learning based arithmetic optimization algorithm for a multi-warehouse joint replenishment and delivery problem
The joint replenishment and delivery (JRD) strategy is critical for increasing operational management efficiency and reducing costs. This study introduces a new multi-warehouse JRD model for heterogeneous products. The main goal of the JRD strategy is to minimize costs by determining the optimal replenishment intervals, delivery frequency, and basic replenishment cycle time. To address the difficulties of the JRD optimization issue, we propose the Q-learning-based arithmetic optimization algorithm (QAOA). In the QAOA framework, Q-learning is the guiding principle, making decisions based on current conditions and constantly refining search strategy via feedback mechanisms. Furthermore, an escape mechanism has been implemented to reduce the possibility of algorithmic stagnation in local optima. Experiments show that QAOA exceeds eight popular benchmark algorithms. By employing the QAOA technique, the practical JRD model has been effectively handled, resulting in considerable cost reductions in supply chain management.
期刊介绍:
Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities.
Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.