Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval最新文献_第4页

ReCoM: reinforcement clustering of multi-type interrelated data objects ReCoM:多类型相互关联数据对象的增强聚类

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI: 10.1145/860435.860486

Jidong Wang, Hua-Jun Zeng, Zheng Chen, Hongjun Lu, Li Tao, Wei-Ying Ma

{"title":"ReCoM: reinforcement clustering of multi-type interrelated data objects","authors":"Jidong Wang, Hua-Jun Zeng, Zheng Chen, Hongjun Lu, Li Tao, Wei-Ying Ma","doi":"10.1145/860435.860486","DOIUrl":"https://doi.org/10.1145/860435.860486","url":null,"abstract":"Most existing clustering algorithms cluster highly related data objects such as Web pages and Web users separately. The interrelation among different types of data objects is either not considered, or represented by a static feature space and treated in the same ways as other attributes of the objects. In this paper, we propose a novel clustering approach for clustering multi-type interrelated data objects, ReCoM (Reinforcement Clustering of Multi-type Interrelated data objects). Under this approach, relationships among data objects are used to improve the cluster quality of interrelated data objects through an iterative reinforcement clustering process. At the same time, the link structure derived from relationships of the interrelated data objects is used to differentiate the importance of objects and the learned importance is also used in the clustering process to further improve the clustering results. Experimental results show that the proposed approach not only effectively overcomes the problem of data sparseness caused by the high dimensional relationship space but also significantly improves the clustering accuracy.","PeriodicalId":209809,"journal":{"name":"Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval","volume":"163 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126063527","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 134

Building and applying a concept hierarchy representation of a user profile 构建和应用用户配置文件的概念层次结构表示

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI: 10.1145/860435.860473

Nikolaos Nanas, V. Uren, A. Roeck

引用次数: 76

Experimental result analysis for a generative probabilistic image retrieval model 生成概率图像检索模型的实验结果分析

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI: 10.1145/860435.860461

T. Westerveld, A. D. Vries

引用次数: 41

Music modeling with random fields 随机场的音乐建模

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI: 10.1145/860435.860515

V. Lavrenko, Jeremy Pickens

{"title":"Music modeling with random fields","authors":"V. Lavrenko, Jeremy Pickens","doi":"10.1145/860435.860515","DOIUrl":"https://doi.org/10.1145/860435.860515","url":null,"abstract":"Recent interest in the area of music information retrieval is exploding. However, very few of the existing music retrieval techniques take advantage of recent developments in statistical modeling. In this report we discuss an application of Random Fields to the problem of statistical modeling of polyphonic music. With such models in hand, the challenges of developing effective searching, browsing, and organization techniques for the growing bodies of music collections may be successfully met. 1 Polyphonic music can be thought of as a two-dimensional stochastic process. Unlike text, the musical vocabulary is relatively small, containing at most several hundred discrete note symbols. What makes music so fascinating and expressive is the very rich structure inherent in musical pieces. Whereas text samples can be reasonably modeled using simple unigram or bi-gram language models, polyphonic music is characterized by numerous periodic symmetries, repetitions, and overlapping shortand long-term interactions that are beyond the capabilities of simple Markov chains. Random Fields are a generalization of Markov chains to multidimensional spatial processes. They are incredibly flexible, allowing us to model arbitrary interactions between elements of data. Recently random fields have found applications in large-vocabulary tasks, such as language modeling and information extraction. One of the most influential works in the area is the 1997 publication of Della Pietra et al. [2], which outlined the algorithms used in parts of this paper. Berger et al. [1] were the first to suggest the use of maximum entropy models for natural language processing. While our work was inspired by applications of random fields to language processing, it bears more similarity to the use of the framework by the researchers in computer vision. In most natural language applications authors start with a reasonable set of features (which are usually single words, or hand-crafted expressions), and the main challenge is to optimize the weights corresponding to these features. This works well in natural language, where words bear significant semantic content. In our case, induction of the random field is the crucial step. We will use the techniques suggested by [2] to automatically induce new high-level, salient features, such as chords and melodic progressions.","PeriodicalId":209809,"journal":{"name":"Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval","volume":"148 Pt 5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126319246","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

On the effectiveness of evaluating retrieval systems in the absence of relevance judgments 在缺乏相关性判断的情况下评价检索系统的有效性

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI: 10.1145/860435.860501

J. Aslam, R. Savell

引用次数: 59

Exploiting query history for document ranking in interactive information retrieval 利用查询历史进行交互式信息检索中的文档排序

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI: 10.1145/860435.860509

Xuehua Shen, ChengXiang Zhai

引用次数: 65

Investigating the relationship between language model perplexity and IR precision-recall measures 语言模型困惑度与IR查准率的关系研究

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI: 10.1145/860435.860505

L. Azzopardi, M. Girolami, Keith van Risjbergen

引用次数: 151

Enhancing cross-language information retrieval by an automatic acquisition of bilingual terminology from comparable corpora 通过从可比语料库中自动获取双语术语来增强跨语言信息检索

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI: 10.1145/860435.860519

F. Sadat, Masatoshi Yoshikawa, Shunsuke Uemura

引用次数: 10

Error analysis of difficult TREC topics TREC难题误差分析

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI: 10.1145/860435.860524

Xiao Hu, S. Bandhakavi, ChengXiang Zhai

引用次数: 20

Using terminological feedback for web search refinement: a log-based study 使用术语反馈优化网络搜索:基于日志的研究

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval Pub Date : 2003-07-28 DOI: 10.1145/860435.860453

Peter G. Anick

引用次数: 335