{"title":"Where's Waldo?: Geosocial Search over Myriad Geotagged Posts","authors":"Barak Pat, Y. Kanza","doi":"10.1145/3139958.3139962","DOIUrl":null,"url":null,"abstract":"The myriad geotagged posts in the social media constitute a vibrant information source that can be used to support geosocial search, that is, a search for geographic locations based on user activities in online social networks and microblogging platforms. Unlike a traditional geographic search, the results of a geosocial search are not restricted to predefined entities, and may reflect events, sentiments, and other matters that are expressed in the social media. A search for \"jogging\", for instance, will indicate popular jogging places. A search for \"4-th of July Fireworks\" would point out places where people watch the spectacle and tweet about it. Yet, geosocial search is different from ordinary Web search because there is no natural partition of the space into documents. There is a need to find new ways to effectively rank, filter, and present results. In this paper, we introduce a novel two-step search process of first, quickly finding relevant areas by using an arbitrarily indexed partition of the space, and second, applying clustering to the geotagged posts in the discovered areas, to present more accurate results. We propose and compare four different ranking measures for evaluating the relevance of an area to a given query. Our experiments, over a dataset of more than 40 million geotagged posts, illustrate the effectiveness of geosocial search, e.g., for finding events, or in a search based on a sentiment, in comparison to ordinary geographic search. Online search is supported by a partition-aware inverted index. Using the index, results are retrieved in a fraction of a second over millions of posts, even on a single standard machine.","PeriodicalId":270649,"journal":{"name":"Proceedings of the 25th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems","volume":"83 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 25th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3139958.3139962","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
The myriad geotagged posts in the social media constitute a vibrant information source that can be used to support geosocial search, that is, a search for geographic locations based on user activities in online social networks and microblogging platforms. Unlike a traditional geographic search, the results of a geosocial search are not restricted to predefined entities, and may reflect events, sentiments, and other matters that are expressed in the social media. A search for "jogging", for instance, will indicate popular jogging places. A search for "4-th of July Fireworks" would point out places where people watch the spectacle and tweet about it. Yet, geosocial search is different from ordinary Web search because there is no natural partition of the space into documents. There is a need to find new ways to effectively rank, filter, and present results. In this paper, we introduce a novel two-step search process of first, quickly finding relevant areas by using an arbitrarily indexed partition of the space, and second, applying clustering to the geotagged posts in the discovered areas, to present more accurate results. We propose and compare four different ranking measures for evaluating the relevance of an area to a given query. Our experiments, over a dataset of more than 40 million geotagged posts, illustrate the effectiveness of geosocial search, e.g., for finding events, or in a search based on a sentiment, in comparison to ordinary geographic search. Online search is supported by a partition-aware inverted index. Using the index, results are retrieved in a fraction of a second over millions of posts, even on a single standard machine.