Worbel: Aggregating Point Labels into Word Clouds

IF 1.2 Q4 REMOTE SENSING

ACM Transactions on Spatial Algorithms and Systems Pub Date : 2023-08-16 DOI:10.1145/3603376

Sujoy Bhore, Robert Ganian, Guangping Li, Martin Nöllenburg, Jules Wulms

{"title":"Worbel: Aggregating Point Labels into Word Clouds","authors":"Sujoy Bhore, Robert Ganian, Guangping Li, Martin Nöllenburg, Jules Wulms","doi":"10.1145/3603376","DOIUrl":null,"url":null,"abstract":"Point feature labeling is a classical problem in cartography and GIS that has been extensively studied for geospatial point data. At the same time, word clouds are a popular visualization tool to show the most important words in text data which has also been extended to visualize geospatial data (Buchin et al. PacificVis 2016). In this article, we study a hybrid visualization, which combines aspects of word clouds and point labeling. In the considered setting, the input data consist of a set of points grouped into categories and our aim is to place multiple disjoint and axis-aligned rectangles, each representing a category, such that they cover points of (mostly) the same category under some natural quality constraints. In our visualization, we then place category names inside the computed rectangles to produce a labeling of the covered points which summarizes the predominant categories globally (in a word-cloud-like fashion) while locally avoiding excessive misrepresentation of points (i.e., retaining the precision of point labeling). We show that computing a minimum set of such rectangles is NP -hard. Hence, we turn our attention to developing a heuristic with (optional) exact components using SAT models to compute our visualizations. We evaluate our algorithms quantitatively, measuring running time and quality of the produced solutions, on several synthetic and real-world data sets. Our experiments show that the fully heuristic approach produces solutions of comparable quality to heuristics combined with exact SAT models, while running much faster.","PeriodicalId":43641,"journal":{"name":"ACM Transactions on Spatial Algorithms and Systems","volume":"53 1","pages":"0"},"PeriodicalIF":1.2000,"publicationDate":"2023-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Spatial Algorithms and Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3603376","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"REMOTE SENSING","Score":null,"Total":0}

引用次数: 0

Abstract

Point feature labeling is a classical problem in cartography and GIS that has been extensively studied for geospatial point data. At the same time, word clouds are a popular visualization tool to show the most important words in text data which has also been extended to visualize geospatial data (Buchin et al. PacificVis 2016). In this article, we study a hybrid visualization, which combines aspects of word clouds and point labeling. In the considered setting, the input data consist of a set of points grouped into categories and our aim is to place multiple disjoint and axis-aligned rectangles, each representing a category, such that they cover points of (mostly) the same category under some natural quality constraints. In our visualization, we then place category names inside the computed rectangles to produce a labeling of the covered points which summarizes the predominant categories globally (in a word-cloud-like fashion) while locally avoiding excessive misrepresentation of points (i.e., retaining the precision of point labeling). We show that computing a minimum set of such rectangles is NP -hard. Hence, we turn our attention to developing a heuristic with (optional) exact components using SAT models to compute our visualizations. We evaluate our algorithms quantitatively, measuring running time and quality of the produced solutions, on several synthetic and real-world data sets. Our experiments show that the fully heuristic approach produces solutions of comparable quality to heuristics combined with exact SAT models, while running much faster.

查看原文本刊更多论文

将点标签聚合到词云中

点特征标注是地图学和GIS中的一个经典问题，对地理空间点数据进行了广泛的研究。同时，词云是一种流行的可视化工具，用于显示文本数据中最重要的词，也已扩展到可视化地理空间数据(Buchin等)。PacificVis 2016)。在本文中，我们研究了一种混合可视化，它结合了词云和点标记的各个方面。在考虑的设置中，输入数据由一组分组到类别中的点组成，我们的目标是放置多个不相交且轴对齐的矩形，每个矩形代表一个类别，这样它们在一些自然质量约束下覆盖(大多数)相同类别的点。在我们的可视化中，我们将类别名称放置在计算的矩形内，以生成覆盖点的标签，该标签在全局(以类似单词云的方式)总结了主要类别，同时在局部避免了对点的过度错误表示(即，保留了点标记的精度)。我们证明了计算这些矩形的最小集合是NP困难的。因此，我们将注意力转向开发一个启发式的(可选的)精确组件，使用SAT模型来计算我们的可视化。我们定量地评估我们的算法，测量运行时间和产生的解决方案的质量，在几个合成和现实世界的数据集上。我们的实验表明，完全启发式方法产生的解决方案与启发式与精确SAT模型相结合的解决方案质量相当，同时运行速度快得多。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ACM Transactions on Spatial Algorithms and Systems REMOTE SENSING-

CiteScore

4.40

自引率

5.30%

发文量

期刊介绍： ACM Transactions on Spatial Algorithms and Systems (TSAS) is a scholarly journal that publishes the highest quality papers on all aspects of spatial algorithms and systems and closely related disciplines. It has a multi-disciplinary perspective in that it spans a large number of areas where spatial data is manipulated or visualized (regardless of how it is specified - i.e., geometrically or textually) such as geography, geographic information systems (GIS), geospatial and spatiotemporal databases, spatial and metric indexing, location-based services, web-based spatial applications, geographic information retrieval (GIR), spatial reasoning and mining, security and privacy, as well as the related visual computing areas of computer graphics, computer vision, geometric modeling, and visualization where the spatial, geospatial, and spatiotemporal data is central.