Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval最新文献_第10页

Document retrieval from user-selected web sites 从用户选择的网站检索文档

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI: 10.1145/860435.860558

U. Bohnacker, Ingrid Renz

引用次数: 1

Probabilistic structured query methods 概率结构化查询方法

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI: 10.1145/860435.860497

Kareem Darwish, Douglas W. Oard

引用次数: 128

On an equivalence between PLSI and LDA PLSI与LDA的等价性

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI: 10.1145/860435.860537

M. Girolami, A. Kabán

引用次数: 232

Stuff I've seen: a system for personal information retrieval and re-use 我见过的东西:个人信息检索和再利用系统

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI: 10.1145/860435.860451

S. Dumais, Edward Cutrell, Jonathan J. Cadiz, Gavin Jancke, Raman Sarin, Daniel C. Robbins

引用次数: 76

When query expansion fails 当查询扩展失败时

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI: 10.1145/860435.860514

B. Billerbeck, J. Zobel

引用次数: 24

Empirical development of an exponential probabilistic model for text retrieval: using textual analysis to build a better model 文本检索的指数概率模型的实证发展:利用文本分析建立一个更好的模型

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI: 10.1145/860435.860441

J. Teevan, David R Karger

引用次数: 27

A personalised information retrieval tool 一个个性化的信息检索工具

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI: 10.1145/860435.860532

I. Martin, J. Jose

引用次数: 9

User-trainable video annotation using multimodal cues 用户可训练的视频注释使用多模态线索

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI: 10.1145/860435.860522

Ching-Yung Lin, M. Naphade, A. Natsev, C. Neti, John R. Smith, Belle L. Tseng, H. Nock, W. H. Adams

引用次数: 10

Domain-independent text segmentation using anisotropic diffusion and dynamic programming 基于各向异性扩散和动态规划的领域无关文本分割

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI: 10.1145/860435.860494

Xiang-Hua Ji, H. Zha

{"title":"Domain-independent text segmentation using anisotropic diffusion and dynamic programming","authors":"Xiang-Hua Ji, H. Zha","doi":"10.1145/860435.860494","DOIUrl":"https://doi.org/10.1145/860435.860494","url":null,"abstract":"This paper presents a novel domain-independent text segmentation method, which identifies the boundaries of topic changes in long text documents and/or text streams. The method consists of three components: As a preprocessing step, we eliminate the document-dependent stop words as well as the generic stop words before the sentence similarity is computed. This step assists in the discrimination of the sentence semantic information. Then the cohesion information of sentences in a document or a text stream is captured with a sentence-distance matrix with each entry corresponding to the similarity between a sentence pair. The distance matrix can be represented with a gray-scale image. Thus, a text segmentation problem is converted into an image segmentation problem. We apply the anisotropic diffusion technique to the image representation of the distance matrix to enhance the semantic cohesion of sentence topical groups as well as sharpen topical boundaries. At last, the dynamic programming technique is adapted to find the optimal topical boundaries and provide a zoom-in and zoom-out mechanism for topics access by segmenting text in variable numbers of sentence topical groups. Our approach involves no domain-specific training, and it can be applied to texts in a variety of domains. The experimental results show that our approach is effective in text segmentation and outperforms several state-of-the-art methods.","PeriodicalId":209809,"journal":{"name":"Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128813661","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 69

Probabilistic term variant generator for biomedical terms 生物医学术语的概率术语变体生成器

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI: 10.1145/860435.860467

Yoshimasa Tsuruoka, Junichi Tsujii

引用次数: 32