Shohei Yokoyama, Ágnes Bogárdi-Mészöly, H. Ishikawa
{"title":"EBSCAN: An Entanglement-based Algorithm for Discovering Dense Regions in Large Geo-social Data Streams with Noise","authors":"Shohei Yokoyama, Ágnes Bogárdi-Mészöly, H. Ishikawa","doi":"10.1145/2830657.2830661","DOIUrl":null,"url":null,"abstract":"The remarkable growth of social networking services on global positioning system (GPS)-enabled handheld devices has produced enormous amounts of georeferenced big data. Given a large spatial dataset, the challenge is to effectively discover dense regions from the dataset. Dense regions might be the most attractive area in a city or the most dangerous zone of a town. A solution to this problem can be useful in many applications, including marketing, tourism, and social research. Density-based clustering methods, such as DBSCAN, are often used for this purpose. Nevertheless, current spatial clustering methods emphasize density while neglecting human behavior derived from geographical features. In this paper, we propose EBSCAN, which is based on the novel idea of an entanglement-based approach. Our method considers not only spatial information but also human behavior derived from geographical features. Another problem is that competing methods such as DBSCAN have two input parameters. Thus, it is difficult to determine optimal values. EBSCAN requires only a single intuitive parameter, tooFar, to discover dense regions. Finally, we evaluate the effectiveness of the proposed method using both toy examples and real datasets. Our experimentally obtained results reveal the properties of EBSCAN and show that it is >10 times faster than the competitor.","PeriodicalId":198109,"journal":{"name":"Proceedings of the 8th ACM SIGSPATIAL International Workshop on Location-Based Social Networks","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 8th ACM SIGSPATIAL International Workshop on Location-Based Social Networks","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2830657.2830661","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
The remarkable growth of social networking services on global positioning system (GPS)-enabled handheld devices has produced enormous amounts of georeferenced big data. Given a large spatial dataset, the challenge is to effectively discover dense regions from the dataset. Dense regions might be the most attractive area in a city or the most dangerous zone of a town. A solution to this problem can be useful in many applications, including marketing, tourism, and social research. Density-based clustering methods, such as DBSCAN, are often used for this purpose. Nevertheless, current spatial clustering methods emphasize density while neglecting human behavior derived from geographical features. In this paper, we propose EBSCAN, which is based on the novel idea of an entanglement-based approach. Our method considers not only spatial information but also human behavior derived from geographical features. Another problem is that competing methods such as DBSCAN have two input parameters. Thus, it is difficult to determine optimal values. EBSCAN requires only a single intuitive parameter, tooFar, to discover dense regions. Finally, we evaluate the effectiveness of the proposed method using both toy examples and real datasets. Our experimentally obtained results reveal the properties of EBSCAN and show that it is >10 times faster than the competitor.