Active Learning for Contextual Search with Binary Feedback

Manag. Sci. Pub Date : 2022-07-12 DOI:10.1287/mnsc.2022.4473

Xi Chen, Quanquan C. Liu, Yining Wang

引用次数: 1

Abstract

In this paper, we study the learning problem in contextual search, which is motivated by applications such as crowdsourcing and personalized medicine experiments. In particular, for a sequence of arriving context vectors, with each context associated with an underlying value, the decision maker either makes a query at a certain point or skips the context. The decision maker will only observe the binary feedback on the relationship between the query point and the value associated with the context. We study a probably approximately correct learning setting, where the goal is to learn the underlying mean value function in context with a minimum number of queries. To address this challenge, we propose a trisection search approach combined with a margin-based active learning method. We show that the algorithm only needs to make [Formula: see text] queries to achieve an ε-estimation accuracy. This sample complexity significantly reduces the required sample complexity in the passive setting where neither sample skipping nor query selection is allowed, which is at least [Formula: see text]. This paper was accepted by J. George Shanthikumar, data science.

查看原文本刊更多论文

基于二元反馈的主动学习上下文搜索

在本文中，我们研究了上下文搜索中的学习问题，这是由众包和个性化医学实验等应用驱动的。特别是，对于到达的上下文向量序列，每个上下文都与一个底层值相关联，决策者要么在某个点进行查询，要么跳过上下文。决策者将只观察关于查询点和与上下文关联的值之间关系的二元反馈。我们研究了一个可能近似正确的学习设置，其目标是使用最少的查询次数来学习上下文中的底层均值函数。为了解决这一挑战，我们提出了一种结合基于边缘的主动学习方法的三切分搜索方法。我们表明，该算法只需要进行[Formula: see text]查询即可达到ε-估计精度。这种样本复杂度显著降低了被动设置中所需的样本复杂度，在被动设置中，既不允许跳过样本，也不允许查询选择，这至少是[公式:见文本]。这篇论文被数据科学的J. George Shanthikumar接受。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Manag. Sci.

自引率

0.00%

发文量