A Bayesian ranking and selection problem with pairwise comparisons

2017 Winter Simulation Conference (WSC) Pub Date : 2017-12-03 DOI:10.1109/WSC.2017.8247947

L. Priekule, Stephan Meisel

引用次数: 4

Abstract

We consider a ranking and selection problem where sampling of two alternatives at once is required for learning about the true performances of the individual alternatives. The true performance of an alternative is defined as its average probability of outperforming the other alternatives. We derive and numerically compare four different solution approaches. Two Knowledge Gradient sampling policies are compared with a pure exploration policy and with a knockout tournament. The knockout tournament serves as a natural benchmarking approach with respect to pairwise comparisons, and determines the sampling budget provided to the other approaches. Our numerical results show that the Knowledge Gradient policies outperform both knockout tournament and pure exploration, and that they lead to significant improvements already at a very small number of pairwise comparisons. In particular we find that a nonstationary Knowledge Gradient policy is the best of the considered approaches for ranking and selection with pairwise comparisons.

查看原文本刊更多论文

具有两两比较的贝叶斯排序和选择问题

我们考虑一个排序和选择问题，其中需要一次对两个备选方案进行抽样，以了解单个备选方案的真实性能。一个备选方案的真实性能被定义为其优于其他备选方案的平均概率。我们推导并数值比较了四种不同的解决方法。将两种知识梯度采样策略与纯探索策略和淘汰赛策略进行了比较。对于两两比较，淘汰赛是一种自然的基准测试方法，它决定了提供给其他方法的抽样预算。我们的数值结果表明，知识梯度策略优于淘汰赛和纯粹的探索，并且它们在非常少量的两两比较中已经导致了显着的改进。特别地，我们发现非平稳的知识梯度策略是两两比较中最好的排序和选择方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 Winter Simulation Conference (WSC)

自引率

0.00%

发文量