Measuring Generality of Documents

22nd International Conference on Data Engineering Workshops (ICDEW'06) Pub Date : 2006-04-03 DOI:10.1109/ICDEW.2006.77

H. Shin, E. Hovy, D. McLeod, Larry Pryor

引用次数: 0

Abstract

Most traditional Information Retrieval (IR) systems, including web search engines, operationalize "relevant" as the word frequency in a document of a set of keywords. Because of this limitation, traditional IR systems frequently retrieve irrelevant documents in response to a user’s request. In this paper, we propose a new criterion, "generality," that provides an additional basis on which to rank retrieved documents. The generality is a level of abstraction to retrieve results based on desired generality appropriate for a user’s knowledge and interests. We compared our generality quantification algorithm with human judges’ weighting of values to show that the developed algorithm is significantly correlated.

查看原文本刊更多论文

测量文件的通用性

大多数传统的信息检索(IR)系统，包括网络搜索引擎，将“相关”作为一组关键字在文档中的词频来操作。由于这种限制，传统的IR系统在响应用户请求时经常检索不相关的文档。在本文中，我们提出了一个新的标准，“一般性”，它为检索到的文档排序提供了额外的基础。通用性是一种抽象级别，用于根据适合用户知识和兴趣的通用性检索结果。我们将我们的一般性量化算法与人类法官的权重值进行了比较，表明所开发的算法具有显著的相关性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

22nd International Conference on Data Engineering Workshops (ICDEW'06)

自引率

0.00%

发文量