Juan Zhou , Qianwang Deng , Yinwen Ma , Rui Pan , Jingxing Zhang , Mao Tan
{"title":"基于双层q学习的NSGA-II多产品订单综合生产调度与库存决策","authors":"Juan Zhou , Qianwang Deng , Yinwen Ma , Rui Pan , Jingxing Zhang , Mao Tan","doi":"10.1016/j.eswa.2025.130008","DOIUrl":null,"url":null,"abstract":"<div><div>High-quality spare parts production and supply are critical for establishing sustained competitive advantages. However, limited manufacturing resources and tight delivery deadlines challenge enterprises in balancing delivery reliability and inventory capital occupation. The widespread existence of multi-product spare parts orders, characterized by interdependent delivery constraints, further complicates production resource allocation and operational coordination. To address these issues, this study treats production and inventory as dual sources for order fulfillment and investigates the integrated production scheduling and inventory decision optimization problem considering multi-product orders (IPSID-MPO), aiming to simultaneously minimize total inventory capital occupation and total order delay penalty. We propose a novel double-layer Q-learning-guided non-dominated sorting genetic algorithm-II (DQ-NSGA), incorporating key innovations: (i) a decoding strategy that considers product delivery constraints, (ii) a hybrid initialization mechanism integrating four problem-specific heuristics, (iii) knowledge-driven local search strategies, and (iv) a self-adaptive adjustment mechanism via double-layer Q-learning. The outer layer dynamically tunes crossover/mutation probabilities based on population evolution, while the inner layer guides individual-specific search strategies. Comprehensive experiments on 180 benchmark instances demonstrate DQ-NSGA’s superiority over mainstream comparative algorithms. Comparisons with common simplified models that consider single-product orders demonstrates the necessity of incorporating multi-product orders in production and inventory decision-making. Furthermore, compared to Make-to-Stock and Make-to-Order paradigms, the proposed IPSID-MPO model not only reduces inventory carrying costs by 28.7% but also effectively enhances the flexibility of the manufacturing system.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"299 ","pages":"Article 130008"},"PeriodicalIF":7.5000,"publicationDate":"2025-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Double-layer Q-learning guided NSGA-II for integrated production scheduling and inventory decision considering multi-product orders\",\"authors\":\"Juan Zhou , Qianwang Deng , Yinwen Ma , Rui Pan , Jingxing Zhang , Mao Tan\",\"doi\":\"10.1016/j.eswa.2025.130008\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>High-quality spare parts production and supply are critical for establishing sustained competitive advantages. However, limited manufacturing resources and tight delivery deadlines challenge enterprises in balancing delivery reliability and inventory capital occupation. The widespread existence of multi-product spare parts orders, characterized by interdependent delivery constraints, further complicates production resource allocation and operational coordination. To address these issues, this study treats production and inventory as dual sources for order fulfillment and investigates the integrated production scheduling and inventory decision optimization problem considering multi-product orders (IPSID-MPO), aiming to simultaneously minimize total inventory capital occupation and total order delay penalty. We propose a novel double-layer Q-learning-guided non-dominated sorting genetic algorithm-II (DQ-NSGA), incorporating key innovations: (i) a decoding strategy that considers product delivery constraints, (ii) a hybrid initialization mechanism integrating four problem-specific heuristics, (iii) knowledge-driven local search strategies, and (iv) a self-adaptive adjustment mechanism via double-layer Q-learning. The outer layer dynamically tunes crossover/mutation probabilities based on population evolution, while the inner layer guides individual-specific search strategies. Comprehensive experiments on 180 benchmark instances demonstrate DQ-NSGA’s superiority over mainstream comparative algorithms. Comparisons with common simplified models that consider single-product orders demonstrates the necessity of incorporating multi-product orders in production and inventory decision-making. Furthermore, compared to Make-to-Stock and Make-to-Order paradigms, the proposed IPSID-MPO model not only reduces inventory carrying costs by 28.7% but also effectively enhances the flexibility of the manufacturing system.</div></div>\",\"PeriodicalId\":50461,\"journal\":{\"name\":\"Expert Systems with Applications\",\"volume\":\"299 \",\"pages\":\"Article 130008\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-10-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems with Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0957417425036243\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425036243","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
摘要
高质量的备件生产和供应是建立持续竞争优势的关键。然而,有限的制造资源和紧迫的交货期限对企业在交货可靠性和库存资金占用之间的平衡提出了挑战。多产品备件订单的普遍存在,其特点是相互依赖的交货约束,进一步复杂化了生产资源分配和操作协调。为了解决这些问题,本研究将生产和库存作为订单履行的双重来源,研究了考虑多产品订单的集成生产调度和库存决策优化问题(IPSID-MPO),旨在同时最小化总库存资本占用和总订单延迟惩罚。我们提出了一种新的双层q学习引导的非支配排序遗传算法- ii (DQ-NSGA),其中包含以下关键创新:(i)考虑产品交付约束的解码策略,(ii)集成四种问题特定启发式的混合初始化机制,(iii)知识驱动的局部搜索策略,以及(iv)通过双层q学习的自适应调整机制。外层基于种群进化动态调整交叉/突变概率,而内层指导个体特定的搜索策略。在180个基准实例上的综合实验表明,DQ-NSGA优于主流比较算法。通过与考虑单产品订单的简化模型的比较,证明了在生产和库存决策中纳入多产品订单的必要性。此外,与库存制造模式和订单制造模式相比,IPSID-MPO模型不仅降低了28.7%的库存持有成本,而且有效地提高了制造系统的灵活性。
Double-layer Q-learning guided NSGA-II for integrated production scheduling and inventory decision considering multi-product orders
High-quality spare parts production and supply are critical for establishing sustained competitive advantages. However, limited manufacturing resources and tight delivery deadlines challenge enterprises in balancing delivery reliability and inventory capital occupation. The widespread existence of multi-product spare parts orders, characterized by interdependent delivery constraints, further complicates production resource allocation and operational coordination. To address these issues, this study treats production and inventory as dual sources for order fulfillment and investigates the integrated production scheduling and inventory decision optimization problem considering multi-product orders (IPSID-MPO), aiming to simultaneously minimize total inventory capital occupation and total order delay penalty. We propose a novel double-layer Q-learning-guided non-dominated sorting genetic algorithm-II (DQ-NSGA), incorporating key innovations: (i) a decoding strategy that considers product delivery constraints, (ii) a hybrid initialization mechanism integrating four problem-specific heuristics, (iii) knowledge-driven local search strategies, and (iv) a self-adaptive adjustment mechanism via double-layer Q-learning. The outer layer dynamically tunes crossover/mutation probabilities based on population evolution, while the inner layer guides individual-specific search strategies. Comprehensive experiments on 180 benchmark instances demonstrate DQ-NSGA’s superiority over mainstream comparative algorithms. Comparisons with common simplified models that consider single-product orders demonstrates the necessity of incorporating multi-product orders in production and inventory decision-making. Furthermore, compared to Make-to-Stock and Make-to-Order paradigms, the proposed IPSID-MPO model not only reduces inventory carrying costs by 28.7% but also effectively enhances the flexibility of the manufacturing system.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.