A comparison of time-aware ranking methods

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval Pub Date : 2011-07-24 DOI:10.1145/2009916.2010147

Nattiya Kanhabua, K. Nørvåg

引用次数: 20

Abstract

When searching a temporal document collection, e.g., news archives or blogs, the time dimension must be explicitly incorporated into a retrieval model in order to improve relevance ranking. Previous work has followed one of two main approaches: 1) a mixture model linearly combining textual similarity and temporal similarity, or 2) a probabilistic model generating a query from the textual and temporal part of a document independently. In this paper, we compare the effectiveness of different time-aware ranking methods by using a mixture model applied to all methods. Extensive evaluation is conducted using the New York Times Annotated Corpus, queries and relevance judgments obtained using the Amazon Mechanical Turk.

查看原文本刊更多论文

时间感知排序方法的比较

当搜索一个时间文档集合时，例如，新闻档案或博客，时间维度必须明确地合并到检索模型中，以提高相关性排名。以前的工作遵循两种主要方法之一:1)线性组合文本相似性和时间相似性的混合模型，或2)从文档的文本和时间部分独立生成查询的概率模型。在本文中，我们通过一个适用于所有方法的混合模型，比较了不同时间感知排序方法的有效性。使用《纽约时报》注释语料库进行广泛的评估，使用亚马逊机械土耳其人获得查询和相关性判断。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval

自引率

0.00%

发文量