{"title":"Relevance-driven Clustering for Visual Information Retrieval on Twitter","authors":"Mohamed Reda Bouadjenek, S. Sanner","doi":"10.1145/3295750.3298914","DOIUrl":null,"url":null,"abstract":"Geo-temporal visualization of Twitter search results is a challenging task since the simultaneous display of all matching tweets would result in a saturated and unreadable display. In such settings, clustering search results can assist users to scan only a few coherent groups of related tweets rather than many individual tweets. However, in practice, the use of unsupervised clustering methods such as K -Means does not necessarily guarantee that the clusters themselves are relevant. Therefore, we develop a novel method of relevance-driven clustering for visual information retrieval to supply users with highly relevant clusters representing different information perspectives of their queries. We specifically propose a Visual Twitter Information Retrieval (Viz-TIR) tool for relevance-driven clustering and ranking of Twitter search results. At the heart of Viz-TIR is a fast greedy algorithm that optimizes an approximation of an expected F1-Score metric to generate these clusters. We demonstrate its effectiveness w.r.t. K -Means and a baseline method that shows all top matching results on a scenario related to searching natural disasters in US-based Twitter data spanning 2013 and 2014. Our demo shows that Viz-TIR is easy to use and more precise in extracting geo-temporally coherent clusters given search queries in comparison to K-Means, thus aiding the user in visually searching and browsing social network content. Overall, we believe this work enables new opportunities for the synthesis of information retrieval as well as combined relevance and display-aware optimization techniques to support query-adaptive visual information exploration interfaces.","PeriodicalId":187771,"journal":{"name":"Proceedings of the 2019 Conference on Human Information Interaction and Retrieval","volume":"110 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 Conference on Human Information Interaction and Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3295750.3298914","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10
Abstract
Geo-temporal visualization of Twitter search results is a challenging task since the simultaneous display of all matching tweets would result in a saturated and unreadable display. In such settings, clustering search results can assist users to scan only a few coherent groups of related tweets rather than many individual tweets. However, in practice, the use of unsupervised clustering methods such as K -Means does not necessarily guarantee that the clusters themselves are relevant. Therefore, we develop a novel method of relevance-driven clustering for visual information retrieval to supply users with highly relevant clusters representing different information perspectives of their queries. We specifically propose a Visual Twitter Information Retrieval (Viz-TIR) tool for relevance-driven clustering and ranking of Twitter search results. At the heart of Viz-TIR is a fast greedy algorithm that optimizes an approximation of an expected F1-Score metric to generate these clusters. We demonstrate its effectiveness w.r.t. K -Means and a baseline method that shows all top matching results on a scenario related to searching natural disasters in US-based Twitter data spanning 2013 and 2014. Our demo shows that Viz-TIR is easy to use and more precise in extracting geo-temporally coherent clusters given search queries in comparison to K-Means, thus aiding the user in visually searching and browsing social network content. Overall, we believe this work enables new opportunities for the synthesis of information retrieval as well as combined relevance and display-aware optimization techniques to support query-adaptive visual information exploration interfaces.
Twitter搜索结果的地理时间可视化是一项具有挑战性的任务,因为同时显示所有匹配的tweet将导致饱和且不可读的显示。在这种设置中,聚类搜索结果可以帮助用户只扫描几组连贯的相关tweet,而不是许多单独的tweet。然而,在实践中,使用K -Means等无监督聚类方法并不一定保证聚类本身是相关的。因此,我们开发了一种新的关联驱动聚类方法用于视觉信息检索,为用户提供代表其查询的不同信息视角的高度相关聚类。我们特别提出了一个可视化Twitter信息检索(Viz-TIR)工具,用于Twitter搜索结果的相关性驱动聚类和排名。Viz-TIR的核心是一个快速贪婪算法,它优化了预期F1-Score指标的近似值来生成这些聚类。我们展示了它的有效性w.r.t. K -Means和一个基线方法,该方法显示了在2013年和2014年美国Twitter数据中搜索自然灾害相关场景的所有顶级匹配结果。我们的演示表明,与K-Means相比,Viz-TIR易于使用,并且在提取给定搜索查询的地理时间相干簇方面更精确,从而帮助用户在视觉上搜索和浏览社交网络内容。总的来说,我们相信这项工作为综合信息检索以及结合相关性和显示感知优化技术提供了新的机会,以支持查询自适应的视觉信息探索界面。