Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval最新文献_第5页

XML retrieval: what to retrieve? XML检索:检索什么?

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI: 10.1145/860435.860525

J. Kamps, maarten marx, M. de Rijke, Börkur Sigurbjörnsson

引用次数: 43

A System for new event detection 一个用于新事件检测的系统

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI: 10.1145/860435.860495

T. Brants, Francine Chen

引用次数: 335

Single n-gram stemming 单n图词干提取

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI: 10.1145/860435.860528

J. Mayfield, Paul McNamee

引用次数: 120

Stemming in the language modeling framework 语言建模框架中的词干提取

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI: 10.1145/860435.860548

James Allan, G. Kumaran

引用次数: 17

Query type classification for web document retrieval web文档检索的查询类型分类

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI: 10.1145/860435.860449

Inho Kang, Gil-Chang Kim

{"title":"Query type classification for web document retrieval","authors":"Inho Kang, Gil-Chang Kim","doi":"10.1145/860435.860449","DOIUrl":"https://doi.org/10.1145/860435.860449","url":null,"abstract":"The heterogeneous Web exacerbates IR problems and short user queries make them worse. The contents of web documents are not enough to find good answer documents. Link information and URL information compensates for the insufficiencies of content information. However, static combination of multiple evidences may lower the retrieval performance. We need different strategies to find target documents according to a query type. We can classify user queries as three categories, the topic relevance task, the homepage finding task, and the service finding task. In this paper, a user query classification scheme is proposed. This scheme uses the difference of distribution, mutual information, the usage rate as anchor texts, and the POS information for the classification. After we classified a user query, we apply different algorithms and information for the better results. For the topic relevance task, we emphasize the content information, on the other hand, for the homepage finding task, we emphasize the Link information and the URL information. We could get the best performance when our proposed classification method with the OKAPI scoring algorithm was used.","PeriodicalId":209809,"journal":{"name":"Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130084282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 300

Head/modifier pairs for everyone 每个人的头/修饰语对

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI: 10.1145/860435.860557

C. Koster

引用次数: 4

Quantitative evaluation of passage retrieval algorithms for question answering 问答通道检索算法的定量评价

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI: 10.1145/860435.860445

Stefanie Tellex, Boris Katz, Jimmy J. Lin, A. Fernandes, Gregory A. Marton

引用次数: 349

A repetition based measure for verification of text collections and for text categorization 一种基于重复的方法，用于验证文本集合和文本分类

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI: 10.1145/860435.860456

D. Khmelev, W. Teahan

引用次数: 101

Querying XML using structures and keywords in timber 使用木材中的结构和关键字查询XML

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI: 10.1145/860435.860554

Cong Yu, H. Jagadish, Dragomir R. Radev

引用次数: 7

Automatic image annotation and retrieval using cross-media relevance models 使用跨媒体关联模型的自动图像注释和检索

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI: 10.1145/860435.860459

J. Jeon, V. Lavrenko, R. Manmatha

{"title":"Automatic image annotation and retrieval using cross-media relevance models","authors":"J. Jeon, V. Lavrenko, R. Manmatha","doi":"10.1145/860435.860459","DOIUrl":"https://doi.org/10.1145/860435.860459","url":null,"abstract":"Libraries have traditionally used manual image annotation for indexing and then later retrieving their image collections. However, manual image annotation is an expensive and labor intensive procedure and hence there has been great interest in coming up with automatic ways to retrieve images based on content. Here, we propose an automatic approach to annotating and retrieving images based on a training set of images. We assume that regions in an image can be described using a small vocabulary of blobs. Blobs are generated from image features using clustering. Given a training set of images with annotations, we show that probabilistic models allow us to predict the probability of generating a word given the blobs in an image. This may be used to automatically annotate and retrieve images given a word as a query. We show that relevance models allow us to derive these probabilities in a natural way. Experiments show that the annotation performance of this cross-media relevance model is almost six times as good (in terms of mean precision) than a model based on word-blob co-occurrence model and twice as good as a state of the art model derived from machine translation. Our approach shows the usefulness of using formal information retrieval models for the task of image annotation and retrieval.","PeriodicalId":209809,"journal":{"name":"Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127029349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1342