Dynamic lead time promising

2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL) Pub Date : 2011-04-11 DOI:10.1109/ADPRL.2011.5967376

Matthew J. Reindorp, M. Fu

引用次数: 4

Abstract

We consider a make-to-order business that serves customers in multiple priority classes. Orders from customers in higher classes bring greater revenue, but they expect shorter lead times than customers in lower classes. In making lead time promises, the firm must recognize preexisting order commitments, uncertainty over future demand from each class, and the possibility of supply chain disruptions. We model this scenario as a Markov decision problem and use reinforcement learning to determine the firm's lead time policy. In order to achieve tractability on large problems, we utilize a sequential decision-making approach that effectively allows us to eliminate one dimension from the state space of the system. Initial numerical results from the sequential dynamic approach suggest that the resulting policies more closely approximate optimal policies than static optimization approaches.

查看原文本刊更多论文

动态提前期承诺

我们考虑一个按订单生产的业务，它为多个优先级的客户提供服务。来自高阶层客户的订单带来了更多的收入，但他们期望的交货时间比低阶层客户短。在做出交货期承诺时，公司必须认识到预先存在的订单承诺、每个类别未来需求的不确定性以及供应链中断的可能性。我们将这种情况建模为马尔可夫决策问题，并使用强化学习来确定公司的提前期政策。为了在大型问题上实现可跟踪性，我们利用了一种顺序决策方法，该方法有效地允许我们从系统的状态空间中消除一个维度。序列动态方法的初始数值结果表明，所得到的策略比静态优化方法更接近最优策略。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL)

自引率

0.00%

发文量