Dynamic Programs with Shared Resources and Signals: Dynamic Fluid Policies and Asymptotic Optimality

IO: Empirical Studies of Firms & Markets eJournal Pub Date : 2020-11-10 DOI:10.2139/ssrn.3728111

David B. Brown, Jingwei Zhang

{"title":"Dynamic Programs with Shared Resources and Signals: Dynamic Fluid Policies and Asymptotic Optimality","authors":"David B. Brown, Jingwei Zhang","doi":"10.2139/ssrn.3728111","DOIUrl":null,"url":null,"abstract":"Allocating Resources Across Systems Coupled by Shared Information Many sequential decision problems involve repeatedly allocating a limited resource across subsystems that are jointly affected by randomly evolving exogenous factors. For example, in adaptive clinical trials, a decision maker needs to allocate patients to treatments in an effort to learn about the efficacy of treatments, but the number of available patients may vary randomly over time. In capital budgeting problems, firms may allocate resources to conduct R&D on new products, but funding budgets may evolve randomly. In many inventory management problems, firms need to allocate limited production capacity to satisfy uncertain demands at multiple locations, and these demands may be correlated due to vagaries in shared market conditions. In this paper, we develop a model involving “shared resources and signals” that captures these and potentially many other applications. The framework is naturally described as a stochastic dynamic program, but this problem is quite difficult to solve. We develop an approximation method based on a “dynamic fluid relaxation”: in this approximation, the subsystem state evolution is approximated by a deterministic fluid model, but the exogenous states (the signals) retain their stochastic evolution. We develop an algorithm for solving the dynamic fluid relaxation. We analyze the corresponding feasible policies and performance bounds from the dynamic fluid relaxation and show that these are asymptotically optimal as the number of subsystems grows large. We show that competing state-of-the-art approaches used in the literature on weakly coupled dynamic programs in general fail to provide asymptotic optimality. Finally, we illustrate the approach on the aforementioned dynamic capital budgeting and multilocation inventory management problems.","PeriodicalId":430354,"journal":{"name":"IO: Empirical Studies of Firms & Markets eJournal","volume":"139 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IO: Empirical Studies of Firms & Markets eJournal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.3728111","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

Allocating Resources Across Systems Coupled by Shared Information Many sequential decision problems involve repeatedly allocating a limited resource across subsystems that are jointly affected by randomly evolving exogenous factors. For example, in adaptive clinical trials, a decision maker needs to allocate patients to treatments in an effort to learn about the efficacy of treatments, but the number of available patients may vary randomly over time. In capital budgeting problems, firms may allocate resources to conduct R&D on new products, but funding budgets may evolve randomly. In many inventory management problems, firms need to allocate limited production capacity to satisfy uncertain demands at multiple locations, and these demands may be correlated due to vagaries in shared market conditions. In this paper, we develop a model involving “shared resources and signals” that captures these and potentially many other applications. The framework is naturally described as a stochastic dynamic program, but this problem is quite difficult to solve. We develop an approximation method based on a “dynamic fluid relaxation”: in this approximation, the subsystem state evolution is approximated by a deterministic fluid model, but the exogenous states (the signals) retain their stochastic evolution. We develop an algorithm for solving the dynamic fluid relaxation. We analyze the corresponding feasible policies and performance bounds from the dynamic fluid relaxation and show that these are asymptotically optimal as the number of subsystems grows large. We show that competing state-of-the-art approaches used in the literature on weakly coupled dynamic programs in general fail to provide asymptotic optimality. Finally, we illustrate the approach on the aforementioned dynamic capital budgeting and multilocation inventory management problems.

查看原文本刊更多论文

具有共享资源和信号的动态规划:动态流体策略和渐近最优性

许多顺序决策问题涉及在子系统之间重复分配有限的资源，这些子系统受随机演化的外生因素的共同影响。例如，在适应性临床试验中，决策者需要将患者分配到治疗中，以努力了解治疗的疗效，但可用患者的数量可能随时间随机变化。在资本预算问题中，企业可能会分配资源进行新产品的研发，但资金预算可能会随机演变。在许多库存管理问题中，企业需要分配有限的生产能力来满足多个地点的不确定需求，而这些需求可能由于共享市场条件的变幻莫测而相互关联。在本文中，我们开发了一个涉及“共享资源和信号”的模型，该模型可以捕获这些和潜在的许多其他应用程序。该框架自然被描述为随机动态规划，但这个问题很难解决。我们开发了一种基于“动态流体松弛”的近似方法:在这种近似中，子系统状态演化由确定性流体模型近似，但外源状态(信号)保持其随机演化。提出了一种求解动态流体松弛的算法。从动态流体松弛的角度分析了相应的可行策略和性能界，并证明了随着子系统数量的增加，这些策略和性能界是渐近最优的。我们表明，在文献中使用的关于弱耦合动态规划的竞争的最先进的方法一般不能提供渐近最优性。最后，针对上述动态资金预算和多地点库存管理问题，给出了具体的解决方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IO: Empirical Studies of Firms & Markets eJournal

自引率

0.00%

发文量