Learning a Better Negative Sampling Policy with Deep Neural Networks for Search

Proceedings of the 2019 ACM SIGIR International Conference on Theory of Information Retrieval Pub Date : 2019-09-23 DOI:10.1145/3341981.3344220

Daniel Cohen, Scott M. Jordan, W. Bruce Croft

引用次数: 14

Abstract

In information retrieval, sampling methods used to select documents for neural models must often deal with large class imbalances during training. This issue necessitates careful selection of negative instances when training neural models to avoid the risk of overfitting. For most work, heuristic sampling approaches, or policies, are created based off of domain experts, such as choosing samples with high BM25 scores or a random process over candidate documents. However, these sampling approaches are done with the test distribution in mind. In this paper, we demonstrate that the method chosen to sample negative documents during training plays a critical role in both the stability of training, as well as overall performance. Furthermore, we establish that using reinforcement learning to optimize a policy over a set of sampling functions can significantly improve performance over standard training practices with respect to IR metrics and is robust to hyperparameters and random seeds.

查看原文本刊更多论文

用深度神经网络学习一种更好的搜索负抽样策略

在信息检索中，用于选择神经模型文档的抽样方法必须在训练过程中经常处理大的类不平衡。这个问题需要在训练神经模型时仔细选择负面实例，以避免过度拟合的风险。对于大多数工作，启发式抽样方法或策略是基于领域专家创建的，例如选择具有高BM25分数的样本或对候选文档进行随机处理。然而，这些抽样方法是在考虑测试分布的情况下完成的。在本文中，我们证明了在训练过程中选择的负面文件抽样方法对训练的稳定性和整体性能都起着至关重要的作用。此外，我们建立了使用强化学习在一组采样函数上优化策略可以显着提高相对于IR指标的标准训练实践的性能，并且对超参数和随机种子具有鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2019 ACM SIGIR International Conference on Theory of Information Retrieval

自引率

0.00%

发文量