Contextual Ranking and Selection with Gaussian Processes and OCBA

IF 0.7 4区计算机科学 Q4 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

ACM Transactions on Modeling and Computer Simulation Pub Date : 2023-11-20 DOI:10.1145/3633456

Sait Cakmak, Yuhao Wang, Siyang Gao, Enlu Zhou

引用次数: 0

Abstract

In many real world problems, we are faced with the problem of selecting the best among a finite number of alternatives, where the best alternative is determined based on context specific information. In this work, we study the contextual Ranking and Selection problem under a finite-alternative-finite-context setting, where we aim to find the best alternative for each context. We use a separate Gaussian process to model the reward for each alternative, and derive the large deviations rate function for both the expected and worst-case contextual probability of correct selection. We propose the GP-C-OCBA sampling policy, which uses the Gaussian process posterior to iteratively allocate observations to maximize the rate function. We prove its consistency and show that it achieves the optimal convergence rate under the assumption of a non-informative prior. Numerical experiments show that our algorithm is highly competitive in terms of sampling efficiency, while having significantly smaller computational overhead.

查看原文本刊更多论文

基于高斯过程和OCBA的上下文排序与选择

在现实世界的许多问题中，我们都面临着在有限数量的备选方案中选择最佳方案的问题，其中最佳方案是基于特定于上下文的信息确定的。在这项工作中，我们研究了有限选择-有限上下文设置下的上下文排序和选择问题，我们的目标是为每个上下文找到最佳选择。我们使用一个单独的高斯过程来模拟每种选择的奖励，并推导出正确选择的期望和最坏情况上下文概率的大偏差率函数。我们提出了GP-C-OCBA抽样策略，该策略使用高斯后验过程迭代分配观测值以最大化速率函数。我们证明了它的一致性，并表明在无信息先验假设下，它达到了最优的收敛速度。数值实验表明，该算法在采样效率方面具有很强的竞争力，同时计算开销也显著减少。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ACM Transactions on Modeling and Computer Simulation 工程技术-计算机：跨学科应用

CiteScore

2.50

自引率

22.20%

发文量

审稿时长

>12 weeks

期刊介绍： The ACM Transactions on Modeling and Computer Simulation (TOMACS) provides a single archival source for the publication of high-quality research and developmental results referring to all phases of the modeling and simulation life cycle. The subjects of emphasis are discrete event simulation, combined discrete and continuous simulation, as well as Monte Carlo methods. The use of simulation techniques is pervasive, extending to virtually all the sciences. TOMACS serves to enhance the understanding, improve the practice, and increase the utilization of computer simulation. Submissions should contribute to the realization of these objectives, and papers treating applications should stress their contributions vis-á-vis these objectives.