Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval最新文献_第2页

Session details: Applications II 会话细节:应用程序

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval Pub Date : 2010-07-19 DOI: 10.1145/3254389

David D. Lewis

引用次数: 0

Query recovery of short user queries: on query expansion with stopwords 用户短查询的查询恢复:对带有停止词的查询扩展

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval Pub Date : 2010-07-19 DOI: 10.1145/1835449.1835589

Johannes Leveling, G. Jones

引用次数: 2

Proximity-based opinion retrieval 基于接近度的意见检索

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval Pub Date : 2010-07-19 DOI: 10.1145/1835449.1835517

Shima Gerani, Mark James Carman, F. Crestani

{"title":"Proximity-based opinion retrieval","authors":"Shima Gerani, Mark James Carman, F. Crestani","doi":"10.1145/1835449.1835517","DOIUrl":"https://doi.org/10.1145/1835449.1835517","url":null,"abstract":"Blog post opinion retrieval aims at finding blog posts that are relevant and opinionated about a user's query. In this paper we propose a simple probabilistic model for assigning relevant opinion scores to documents. The key problem is how to capture opinion expressions in the document, that are related to the query topic. Current solutions enrich general opinion lexicons by finding query-specific opinion lexicons using pseudo-relevance feedback on external corpora or the collection itself. In this paper we use a general opinion lexicon and propose using proximity information in order to capture opinion term relatedness to the query. We propose a proximity-based opinion propagation method to calculate the opinion density at each point in a document. The opinion density at the position of a query term in the document can then be considered as the probability of opinion about the query term at that position. The effect of different kernels for capturing the proximity is also discussed. Experimental results on the BLOG06 dataset show that the proposed method provides significant improvement over standard TREC baselines and achieves a 2.5% increase in MAP over the best performing run in the TREC 2008 blog track.","PeriodicalId":378368,"journal":{"name":"Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125760938","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 73

User centered story tracking 以用户为中心跟踪故事

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval Pub Date : 2010-07-19 DOI: 10.1145/1835449.1835693

Ilija Subasic

{"title":"User centered story tracking","authors":"Ilija Subasic","doi":"10.1145/1835449.1835693","DOIUrl":"https://doi.org/10.1145/1835449.1835693","url":null,"abstract":"Using data collections available on the Internet has for many people became the main medium for staying informed about the world. Many of these collections are in nature dynamic, evolving as the subjects they describe change. The goal of different research areas is to identify and highlight these changes to better enable readers to track stories. In this work we restrict ourselves to news collections and investigate \"real-life\" effectiveness and usability of temporal text mining (TTM) story tracking methods. We propose a new story tracking method and build a tool to support it. Additionally, we investigate the effectiveness and usability of story tracking methods and define a new frameworks for automatic and user oriented evaluation. We built methods and tools which allow for understanding, discovery, and search through user interaction. Although there are many TTM methods developed there is a lack of common evaluation procedure. Therefore, we propose an evaluation framework for measuring how different TTM methods discover novel \"facts\". Apart from the automatic evaluation we are interested in how can users interact with pattens and learn about the underlying subjects of the story they track. For this purpose we propose a user testing environment that measures speed and accuracy in which users can use story tracking methods to discover predefined sets of ground-truth sentences.","PeriodicalId":378368,"journal":{"name":"Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122239503","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Session details: Clustering II 会话细节:集群II

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval Pub Date : 2010-07-19 DOI: 10.1145/3254371

Omar Alonso

引用次数: 0

A joint probabilistic classification model for resource selection 资源选择的联合概率分类模型

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval Pub Date : 2010-07-19 DOI: 10.1145/1835449.1835468

Dzung Hong, Luo Si, Paul J. Bracke, M. Witt, Timothy C Juchcinski

引用次数: 30

Robust audio identification for MP3 popular music MP3流行音乐的鲁棒音频识别

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval Pub Date : 2010-07-19 DOI: 10.1145/1835449.1835554

Wei Li, Yaduo Liu, X. Xue

{"title":"Robust audio identification for MP3 popular music","authors":"Wei Li, Yaduo Liu, X. Xue","doi":"10.1145/1835449.1835554","DOIUrl":"https://doi.org/10.1145/1835449.1835554","url":null,"abstract":"Audio identification via fingerprint has been an active research field with wide applications for years. Many technical papers were published and commercial software systems were also employed. However, most of these previously reported methods work on the raw audio format in spite of the fact that nowadays compressed format audio, especially MP3 music, has grown into the dominant way to store on personal computers and transmit on the Internet. It would be interesting if a compressed unknown audio fragment is able to be directly recognized from the database without the fussy and time-consuming decompression-identification-recompression procedure. So far, very few algorithms run directly in the compressed domain for music information retrieval, and most of them take advantage of MDCT coefficients or derived energy type of features. As a first attempt, we propose in this paper utilizing compressed-domain spectral entropy as the audio feature to implement a novel audio fingerprinting algorithm. The compressed songs stored in a music database and the possibly distorted compressed query excerpts are first partially decompressed to obtain the MDCT coefficients as the intermediate result. Then by grouping granules into longer blocks, remapping the MDCT coefficients into 192 new frequency lines to unify the frequency distribution of long and short windows, and defining 9 new subbands which cover the main frequency bandwidth of popular songs in accordance with the scale-factor bands of short windows, we calculate the spectral entropy of all consecutive blocks and come to the final fingerprint sequence by means of magnitude relationship modeling. Experiments show that such fingerprints exhibit strong robustness against various audio signal distortions like recompression, noise interference, echo addition, equalization, band-pass filtering, pitch shifting, and slight time-scale modification etc. For 5s-long query examples which might be severely degraded, an average top-five retrieval precision rate of more than 90% can be obtained in our test data set composed of 1822 popular songs.","PeriodicalId":378368,"journal":{"name":"Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132776262","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 18

Closed form solution of similarity algorithms 相似算法的封闭形式解

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval Pub Date : 2010-07-19 DOI: 10.1145/1835449.1835577

Yuanzhe Cai, Miao Zhang, C. Ding, Sharma Chakravarthy

引用次数: 8

Exploring reductions for long web queries 探索减少长网页查询

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval Pub Date : 2010-07-19 DOI: 10.1145/1835449.1835545

Niranjan Balasubramanian, G. Kumaran, Vitor R. Carvalho

{"title":"Exploring reductions for long web queries","authors":"Niranjan Balasubramanian, G. Kumaran, Vitor R. Carvalho","doi":"10.1145/1835449.1835545","DOIUrl":"https://doi.org/10.1145/1835449.1835545","url":null,"abstract":"Long queries form a difficult, but increasingly important segment for web search engines. Query reduction, a technique for dropping unnecessary query terms from long queries, improves performance of ad-hoc retrieval on TREC collections. Also, it has great potential for improving long web queries (upto 25% improvement in NDCG@5). However, query reduction on the web is hampered by the lack of accurate query performance predictors and the constraints imposed by search engine architectures and ranking algorithms. In this paper, we present query reduction techniques for long web queries that leverage effective and efficient query performance predictors. We propose three learning formulations that combine these predictors to perform automatic query reduction. These formulations enable trading of average improvements for the number of queries impacted, and enable easy integration into the search engine's architecture for rank-time query reduction. Experiments on a large collection of long queries issued to a commercial search engine show that the proposed techniques significantly outperform baselines, with more than 12% improvement in NDCG@5 in the impacted set of queries. Extension to the formulations such as result interleaving further improves results. We find that the proposed techniques deliver consistent retrieval gains where it matters most: poorly performing long web queries.","PeriodicalId":378368,"journal":{"name":"Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval","volume":"374 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115990497","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 89

Session details: Applications I 会话细节:应用程序

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval Pub Date : 2010-07-19 DOI: 10.1145/3254367

Luo Si

引用次数: 0