Online Resource Allocation with Personalized Learning

Oper. Res. Pub Date : 2022-05-10 DOI:10.2139/ssrn.3538509

M. Zhalechian, Esmaeil Keyvanshokooh, Cong Shi, M. P. Oyen

引用次数: 4

Abstract

Joint online learning and resource allocation is a fundamental problem inherent in many applications. In a general setting, heterogeneous customers arrive sequentially, each of which can be allocated to a resource in an online fashion. Customers stochastically consume the resources, allocations yield stochastic rewards, and the system receives feedback outcomes with delay. In “Online Resource Allocation with Personalized Learning,” Zhalechian, Keyvanshokooh, Shi, and Van Oyen introduce a generic framework to solve this problem. It judiciously synergizes online learning with a broad class of online resource allocation mechanisms, where the sequence of customer contexts is adversarial, and the customer reward and resource consumption are stochastic and unknown. They propose online algorithms that strike a three-way balance between exploration, exploitation, and hedging against adversarial arrival sequence. A performance guarantee is provided for each online algorithm, and the efficacy of their algorithms is demonstrated using clinical data from a health system.

查看原文本刊更多论文

个性化学习的在线资源分配

在线联合学习和资源分配是许多应用程序中固有的一个基本问题。在一般情况下，异构客户按顺序到达，每个客户都可以以在线方式分配给资源。客户随机消耗资源，分配产生随机奖励，系统接收有延迟的反馈结果。Zhalechian, Keyvanshokooh, Shi和Van Oyen在“个性化学习的在线资源分配”一文中介绍了一个解决这个问题的通用框架。它明智地将在线学习与广泛的在线资源分配机制协同起来，其中客户上下文的顺序是对抗性的，客户奖励和资源消耗是随机的和未知的。他们提出了一种在线算法，在探索、利用和对冲对抗到达序列之间取得了三方面的平衡。为每个在线算法提供性能保证，并使用来自卫生系统的临床数据证明其算法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Oper. Res.

自引率

0.00%

发文量