{"title":"Density-aware Stratified Sampling for Visualizing Large Volume Geo-Spatial Data","authors":"Liming Dong, Bin Feng, Weidong Liu","doi":"10.1145/3406971.3406991","DOIUrl":null,"url":null,"abstract":"Sampling is a popular approach in big data visualization, however, current sampling approaches don't work well when visualization type is scatter plot, and are even worse in supporting keyword search queries. In this paper, we present an approach of density-aware stratified sampling, it first probing the density of record in different areas of the visualization, then taking the density data to guide the stratified sampling. We conducted an extensively user study to show the efficiency and efficacy of our approach, the experiment shows that our approach can provide very close scatter plots of keyword search queries of a 200 million record dataset within 0.2 second, and the construction time is only 1/4 of an alternative method.","PeriodicalId":111905,"journal":{"name":"Proceedings of the 4th International Conference on Graphics and Signal Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th International Conference on Graphics and Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3406971.3406991","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Sampling is a popular approach in big data visualization, however, current sampling approaches don't work well when visualization type is scatter plot, and are even worse in supporting keyword search queries. In this paper, we present an approach of density-aware stratified sampling, it first probing the density of record in different areas of the visualization, then taking the density data to guide the stratified sampling. We conducted an extensively user study to show the efficiency and efficacy of our approach, the experiment shows that our approach can provide very close scatter plots of keyword search queries of a 200 million record dataset within 0.2 second, and the construction time is only 1/4 of an alternative method.