Statistical Analysis in the Presence of Spatial Autocorrelation: Selected Sampling Strategy Effects

IF 0.9 Q4 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Stats Pub Date : 2022-12-16 DOI:10.3390/stats5040081

D. Griffith, R. Plant

{"title":"Statistical Analysis in the Presence of Spatial Autocorrelation: Selected Sampling Strategy Effects","authors":"D. Griffith, R. Plant","doi":"10.3390/stats5040081","DOIUrl":null,"url":null,"abstract":"Fundamental to most classical data collection sampling theory development is the random drawings assumption requiring that each targeted population member has a known sample selection (i.e., inclusion) probability. Frequently, however, unrestricted random sampling of spatially autocorrelated data is impractical and/or inefficient. Instead, randomly choosing a population subset accounts for its exhibited spatial pattern by utilizing a grid, which often provides improved parameter estimates, such as the geographic landscape mean, at least via its precision. Unfortunately, spatial autocorrelation latent in these data can produce a questionable mean and/or standard error estimate because each sampled population member contains information about its nearby members, a data feature explicitly acknowledged in model-based inference, but ignored in design-based inference. This autocorrelation effect prompted the development of formulae for calculating an effective sample size (i.e., the equivalent number of sample selections from a geographically randomly distributed population that would yield the same sampling error) estimate. Some researchers recently challenged this and other aspects of spatial statistics as being incorrect/invalid/misleading. This paper seeks to address this category of misconceptions, demonstrating that the effective geographic sample size is a valid and useful concept regardless of the inferential basis invoked. Its spatial statistical methodology builds upon the preceding ingredients.","PeriodicalId":93142,"journal":{"name":"Stats","volume":" ","pages":""},"PeriodicalIF":0.9000,"publicationDate":"2022-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Stats","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/stats5040081","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 3

Abstract

Fundamental to most classical data collection sampling theory development is the random drawings assumption requiring that each targeted population member has a known sample selection (i.e., inclusion) probability. Frequently, however, unrestricted random sampling of spatially autocorrelated data is impractical and/or inefficient. Instead, randomly choosing a population subset accounts for its exhibited spatial pattern by utilizing a grid, which often provides improved parameter estimates, such as the geographic landscape mean, at least via its precision. Unfortunately, spatial autocorrelation latent in these data can produce a questionable mean and/or standard error estimate because each sampled population member contains information about its nearby members, a data feature explicitly acknowledged in model-based inference, but ignored in design-based inference. This autocorrelation effect prompted the development of formulae for calculating an effective sample size (i.e., the equivalent number of sample selections from a geographically randomly distributed population that would yield the same sampling error) estimate. Some researchers recently challenged this and other aspects of spatial statistics as being incorrect/invalid/misleading. This paper seeks to address this category of misconceptions, demonstrating that the effective geographic sample size is a valid and useful concept regardless of the inferential basis invoked. Its spatial statistical methodology builds upon the preceding ingredients.

查看原文本刊更多论文

存在空间自相关的统计分析：选定的采样策略效果

大多数经典数据收集抽样理论发展的基础是随机绘图假设，要求每个目标人口成员具有已知的样本选择(即纳入)概率。然而，空间自相关数据的不受限制的随机抽样通常是不切实际和/或低效的。相反，通过使用网格随机选择一个人口子集来解释其展示的空间格局，这通常提供改进的参数估计，例如地理景观平均值，至少通过其精度。不幸的是，这些数据中潜在的空间自相关可能会产生可疑的平均值和/或标准误差估计，因为每个抽样总体成员包含有关其附近成员的信息，这是基于模型的推理中明确承认的数据特征，但在基于设计的推理中被忽略。这种自相关效应促使了计算有效样本量(即，从地理上随机分布的人口中选出的产生相同抽样误差的相同数量的样本)估计公式的发展。一些研究人员最近质疑这一点以及空间统计的其他方面是不正确的/无效的/误导性的。本文试图解决这类误解，证明有效的地理样本量是一个有效和有用的概念，无论援引的推理基础如何。其空间统计方法建立在上述成分的基础上。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Stats

CiteScore

0.60

自引率

0.00%

发文量

审稿时长

7 weeks