{"title":"A distributed geospatial data storage and processing framework for large-scale WebGIS","authors":"Yunqin Zhong, Jizhong Han, Tieying Zhang, Jinyun Fang","doi":"10.1109/Geoinformatics.2012.6270347","DOIUrl":null,"url":null,"abstract":"With the rapid growth of geospatial data and concurrent users, the state-of-the-art WebGIS cannot support massive data storage and processing due to poor scalability of underlying centralized systems (e.g., native file systems and SDBMS). In this paper, we propose a novel distributed geospatial data storage and processing framework for large-scale WebGIS. Our proposal contains three significant characteristics. Firstly, a scalable cloud-based architecture is designed to provide elastic storage and computation resources of shared-nothing commodity cluster for WebGIS. Secondly, we present efficient geospatial data placement and geospatial data access refinement schemes to improve I/O efficiency. Thirdly, we propose MapReduce based localized geospatial computing model for parallel processing of massive geospatial data, which improves geospatial computation performance. We have implemented a prototype named VegaCI on top of the emerging Hadoop cloud platform. Comprehensive experiments demonstrate that our proposal is efficient and applicable in practical large-scale WebGIS.","PeriodicalId":259976,"journal":{"name":"2012 20th International Conference on Geoinformatics","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"22","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 20th International Conference on Geoinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/Geoinformatics.2012.6270347","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 22
Abstract
With the rapid growth of geospatial data and concurrent users, the state-of-the-art WebGIS cannot support massive data storage and processing due to poor scalability of underlying centralized systems (e.g., native file systems and SDBMS). In this paper, we propose a novel distributed geospatial data storage and processing framework for large-scale WebGIS. Our proposal contains three significant characteristics. Firstly, a scalable cloud-based architecture is designed to provide elastic storage and computation resources of shared-nothing commodity cluster for WebGIS. Secondly, we present efficient geospatial data placement and geospatial data access refinement schemes to improve I/O efficiency. Thirdly, we propose MapReduce based localized geospatial computing model for parallel processing of massive geospatial data, which improves geospatial computation performance. We have implemented a prototype named VegaCI on top of the emerging Hadoop cloud platform. Comprehensive experiments demonstrate that our proposal is efficient and applicable in practical large-scale WebGIS.