Building a filtering test collection for TREC 2002

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI:10.1145/860435.860481

I. Soboroff, S. Robertson

引用次数: 84

Abstract

Test collections for the filtering track in TREC have typically used either past sets of relevance judgments, or categorized collections such as Reuters Corpus Volume 1 or OHSUMED, because filtering systems need relevance judgments during the experiment for training and adaptation. For TREC 2002, we constructed an entirely new set of search topics for the Reuters Corpus for measuring filtering systems. Our method for building the topics involved multiple iterations of feedback from assessors, and fusion of results from multiple search systems using different search algorithms. We also developed a second set of "inexpensive" topics based on categories in the document collection. We found that the initial judgments made for the experiment were sufficient; subsequent pooled judging changed system rankings very little. We also found that systems performed very differently on the category topics than on the assessor-built topics.

查看原文本刊更多论文

为TREC 2002构建一个过滤测试集合

TREC中过滤轨迹的测试集通常使用过去的相关判断集，或分类集合，如路透社语料库卷1或OHSUMED，因为过滤系统在实验期间需要相关性判断以进行训练和适应。对于TREC 2002，我们为路透社语料库构建了一套全新的搜索主题集，用于测量过滤系统。我们构建主题的方法涉及对评估者的反馈进行多次迭代，以及使用不同搜索算法融合来自多个搜索系统的结果。我们还根据文档集合中的类别开发了第二组“廉价”主题。我们发现，为实验所做的初步判断是充分的;随后的集合评判对系统排名的改变很小。我们还发现，系统在类别主题上的表现与在评估员构建的主题上的表现非常不同。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval

自引率

0.00%

发文量