{"title":"Efficient Distributed Spatial Semijoins and Their Application in Multiple-Site Queries","authors":"Nawshad Farruque, W. Osborn","doi":"10.1109/AINA.2014.132","DOIUrl":null,"url":null,"abstract":"Applications exist today that require the management of distributed spatial data. Since spatial data is more complex than non-spatial data, performing distributed queries on it requires the consideration of both local processing (i.e. CPU and I/O) time and data transmission cost. To reduce these costs, one can use a distributed spatial semi join as it eliminates unnecessary objects before their transmission to other sites and the query site. In this paper, we propose both new approaches for representing the spatial semi join in a distributed setting, and their use for processing distributed queries consisting of any number of sites. We have tested our algorithms for four sites, which are a part of an actual working distributed system. We compare our algorithms with respect to data transmission cost, CPU time, I/O time and false positive results. We show that our algorithms are superior in many cases at optimizing the above criteria.","PeriodicalId":316052,"journal":{"name":"2014 IEEE 28th International Conference on Advanced Information Networking and Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE 28th International Conference on Advanced Information Networking and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AINA.2014.132","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Applications exist today that require the management of distributed spatial data. Since spatial data is more complex than non-spatial data, performing distributed queries on it requires the consideration of both local processing (i.e. CPU and I/O) time and data transmission cost. To reduce these costs, one can use a distributed spatial semi join as it eliminates unnecessary objects before their transmission to other sites and the query site. In this paper, we propose both new approaches for representing the spatial semi join in a distributed setting, and their use for processing distributed queries consisting of any number of sites. We have tested our algorithms for four sites, which are a part of an actual working distributed system. We compare our algorithms with respect to data transmission cost, CPU time, I/O time and false positive results. We show that our algorithms are superior in many cases at optimizing the above criteria.