A comparative study of methods for estimating query language models with pseudo feedback

Proceedings of the 18th ACM conference on Information and knowledge management Pub Date : 2009-11-02 DOI:10.1145/1645953.1646259

Yuanhua Lv, ChengXiang Zhai

引用次数: 196

Abstract

We systematically compare five representative state-of-the-art methods for estimating query language models with pseudo feedback in ad hoc information retrieval, including two variants of the relevance language model, two variants of the mixture feedback model, and the divergence minimization estimation method. Our experiment results show that a variant of relevance model and a variant of the mixture model tend to outperform other methods. We further propose several heuristics that are intuitively related to the good retrieval performance of an estimation method, and show that the variations in how these heuristics are implemented in different methods provide a good explanation of many empirical observations.

查看原文本刊更多论文

基于伪反馈的查询语言模型估计方法的比较研究

本文系统比较了自组织信息检索中具有代表性的五种具有伪反馈的查询语言模型估计方法，包括两种变体的关联语言模型、两种变体的混合反馈模型和最小散度估计方法。我们的实验结果表明，一种变体的关联模型和一种变体的混合模型往往优于其他方法。我们进一步提出了几种直观地与估计方法的良好检索性能相关的启发式，并表明这些启发式在不同方法中如何实现的变化为许多经验观察提供了很好的解释。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 18th ACM conference on Information and knowledge management

自引率

0.00%

发文量