High accuracy retrieval with multiple nested ranker

Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval Pub Date : 2006-08-06 DOI:10.1145/1148170.1148246

Irina Matveeva, C. Burges, Timo Burkard, Andy Laucius, Leon Wong

引用次数: 130

Abstract

High precision at the top ranks has become a new focus of research in information retrieval. This paper presents the multiple nested ranker approach that improves the accuracy at the top ranks by iteratively re-ranking the top scoring documents. At each iteration, this approach uses the RankNet learning algorithm to re-rank a subset of the results. This splits the problem into smaller and easier tasks and generates a new distribution of the results to be learned by the algorithm. We evaluate this approach using different settings on a data set labeled with several degrees of relevance. We use the normalized discounted cumulative gain (NDCG) to measure the performance because it depends not only on the position but also on the relevance score of the document in the ranked list. Our experiments show that making the learning algorithm concentrate on the top scoring results improves precision at the top ten documents in terms of the NDCG score.

查看原文本刊更多论文

具有多个嵌套排序器的高精度检索

高检索精度已成为信息检索研究的新热点。本文提出了一种多嵌套排序方法，通过对得分最高的文档进行迭代重新排序，提高了最高排名的准确性。在每次迭代中，该方法使用RankNet学习算法对结果子集进行重新排序。这将问题分解为更小更简单的任务，并生成算法要学习的结果的新分布。我们使用不同的设置来评估这种方法，这些设置在标记了几个相关度的数据集上。我们使用归一化贴现累积增益(NDCG)来衡量性能，因为它不仅取决于位置，还取决于文档在排名列表中的相关性得分。我们的实验表明，使学习算法集中在得分最高的结果上，可以提高NDCG得分前十位文档的精度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval

自引率

0.00%

发文量