{"title":"A General Method for Resampling Autocorrelated Spatial Data","authors":"Rudy Arthur","doi":"10.1111/gean.12417","DOIUrl":null,"url":null,"abstract":"<p>Comparing spatial data sets is a ubiquitous task in data analysis, however the presence of spatial autocorrelation means that standard estimates of variance will be wrong and tend to over-estimate the statistical significance of correlations and other observations. While there are a number of existing approaches to this problem, none are ideal, requiring detailed analytical calculations, which are hard to generalize or detailed modeling of the data generating process, which may not be straightforward. In this work we propose an approach based on permuting or resampling at fixed spatial autocorrelation, measured by Moran's I, in order to generate a null model that accounts for spatial dependence. Testing on real and synthetic data, we find that, as long as the spatial autocorrelation is not too strong, this approach works just as well as if we knew the data generating process exactly and allows us to compute <i>P</i>-values with the correct Type-I error rate.</p>","PeriodicalId":12533,"journal":{"name":"Geographical Analysis","volume":"57 2","pages":"302-319"},"PeriodicalIF":3.3000,"publicationDate":"2024-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/gean.12417","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geographical Analysis","FirstCategoryId":"89","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/gean.12417","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOGRAPHY","Score":null,"Total":0}
引用次数: 0
Abstract
Comparing spatial data sets is a ubiquitous task in data analysis, however the presence of spatial autocorrelation means that standard estimates of variance will be wrong and tend to over-estimate the statistical significance of correlations and other observations. While there are a number of existing approaches to this problem, none are ideal, requiring detailed analytical calculations, which are hard to generalize or detailed modeling of the data generating process, which may not be straightforward. In this work we propose an approach based on permuting or resampling at fixed spatial autocorrelation, measured by Moran's I, in order to generate a null model that accounts for spatial dependence. Testing on real and synthetic data, we find that, as long as the spatial autocorrelation is not too strong, this approach works just as well as if we knew the data generating process exactly and allows us to compute P-values with the correct Type-I error rate.
期刊介绍:
First in its specialty area and one of the most frequently cited publications in geography, Geographical Analysis has, since 1969, presented significant advances in geographical theory, model building, and quantitative methods to geographers and scholars in a wide spectrum of related fields. Traditionally, mathematical and nonmathematical articulations of geographical theory, and statements and discussions of the analytic paradigm are published in the journal. Spatial data analyses and spatial econometrics and statistics are strongly represented.