{"title":"基于异构标记点模式的热点相关标签排序与提取的探索性分析","authors":"M. Ruocco, H. Ramampiaro","doi":"10.1145/2442796.2442802","DOIUrl":null,"url":null,"abstract":"The availability of a huge amount of geotagged resources on the web can be exploited to extract new useful information. We propose a set of estimators that are able to evaluate the degree of clustering of the spatial distribution of terms used to tag such geotagged resources. We introduce the concept of tag point pattern to derive indexes from the exploratory analysis by taking advantage of the second order Ripley's K-function, previously used in epidemiology, geo-statistics and ecology. The derived model estimates the degree of aggregation of the geotagged resources, taking into account the heterogeneity of the spatial distribution of the underlying population. Further, thanks to subsampling techniques, our approach is able to handle large datasets. Without losing of generality, we perform our experiments on a dataset derived Flickr pictures, as a use case. This dataset consists of tags that were extracted from a set of 1.2 million of pictures. We evaluate our proposed indexes with respect to their ability to extract tags related to geographical landmarks and hotspots. Our experiments show that we get good results using our estimators.","PeriodicalId":107369,"journal":{"name":"Workshop on Location-based Social Networks","volume":"82 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Exploratory analysis on heterogeneous tag-point patterns for ranking and extracting hot-spot related tags\",\"authors\":\"M. Ruocco, H. Ramampiaro\",\"doi\":\"10.1145/2442796.2442802\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The availability of a huge amount of geotagged resources on the web can be exploited to extract new useful information. We propose a set of estimators that are able to evaluate the degree of clustering of the spatial distribution of terms used to tag such geotagged resources. We introduce the concept of tag point pattern to derive indexes from the exploratory analysis by taking advantage of the second order Ripley's K-function, previously used in epidemiology, geo-statistics and ecology. The derived model estimates the degree of aggregation of the geotagged resources, taking into account the heterogeneity of the spatial distribution of the underlying population. Further, thanks to subsampling techniques, our approach is able to handle large datasets. Without losing of generality, we perform our experiments on a dataset derived Flickr pictures, as a use case. This dataset consists of tags that were extracted from a set of 1.2 million of pictures. We evaluate our proposed indexes with respect to their ability to extract tags related to geographical landmarks and hotspots. Our experiments show that we get good results using our estimators.\",\"PeriodicalId\":107369,\"journal\":{\"name\":\"Workshop on Location-based Social Networks\",\"volume\":\"82 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-11-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Workshop on Location-based Social Networks\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2442796.2442802\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Workshop on Location-based Social Networks","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2442796.2442802","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Exploratory analysis on heterogeneous tag-point patterns for ranking and extracting hot-spot related tags
The availability of a huge amount of geotagged resources on the web can be exploited to extract new useful information. We propose a set of estimators that are able to evaluate the degree of clustering of the spatial distribution of terms used to tag such geotagged resources. We introduce the concept of tag point pattern to derive indexes from the exploratory analysis by taking advantage of the second order Ripley's K-function, previously used in epidemiology, geo-statistics and ecology. The derived model estimates the degree of aggregation of the geotagged resources, taking into account the heterogeneity of the spatial distribution of the underlying population. Further, thanks to subsampling techniques, our approach is able to handle large datasets. Without losing of generality, we perform our experiments on a dataset derived Flickr pictures, as a use case. This dataset consists of tags that were extracted from a set of 1.2 million of pictures. We evaluate our proposed indexes with respect to their ability to extract tags related to geographical landmarks and hotspots. Our experiments show that we get good results using our estimators.