{"title":"Pivotal discrepancy measures for Bayesian modelling of spatio-temporal data","authors":"Lindsay R. Morris, Nokuthaba Sibanda","doi":"10.1007/s10651-022-00529-4","DOIUrl":null,"url":null,"abstract":"<p>Within the field of geostatistics, Gaussian processes are a staple for modelling spatial and spatio-temporal data. Statistical literature is rich with estimation methods for the mean and covariance of such processes (in both frequentist and Bayesian contexts). Considerably less attention has been paid to developing goodness-of-fit tests for assessment of model adequacy. Jun et al. (Environmetrics 25(8):584–595, 2014) introduced a statistical test that uses pivotal discrepancy measures to assess goodness-of-fit in the Bayesian context. We present a modification and generalization of their statistical test. The initial method involves spatial partitioning of the data, followed by evaluation of a pivotal discrepancy measure at each posterior draw to obtain a posterior distribution of pivotal statistics. Order statistics from this distribution are used to obtain approximate p-values. Jun et al. (Environmetrics 25(8):584–595, 2014) use arbitrary partitions based on pre-existing spatial boundaries. The partitions are made to be of equal size. Our contribution is two-fold. We use K-means clustering to create the spatial partitions and we generalise Jun et al.’s approach to incorporate unequal partition sizes. Observations from a spatial or spatio-temporal process are partitioned using an appropriate feature vector that incorporates the geographic location of the observations into subsets (not necessarily of the same size). The method’s viability is illustrated in a simulation study, and in an application to hoki (<i>Macruronus novaezelandiae</i>) catch data from a survey of the sub-Antarctic region.</p>","PeriodicalId":50519,"journal":{"name":"Environmental and Ecological Statistics","volume":"32 1","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2022-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental and Ecological Statistics","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1007/s10651-022-00529-4","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 1
Abstract
Within the field of geostatistics, Gaussian processes are a staple for modelling spatial and spatio-temporal data. Statistical literature is rich with estimation methods for the mean and covariance of such processes (in both frequentist and Bayesian contexts). Considerably less attention has been paid to developing goodness-of-fit tests for assessment of model adequacy. Jun et al. (Environmetrics 25(8):584–595, 2014) introduced a statistical test that uses pivotal discrepancy measures to assess goodness-of-fit in the Bayesian context. We present a modification and generalization of their statistical test. The initial method involves spatial partitioning of the data, followed by evaluation of a pivotal discrepancy measure at each posterior draw to obtain a posterior distribution of pivotal statistics. Order statistics from this distribution are used to obtain approximate p-values. Jun et al. (Environmetrics 25(8):584–595, 2014) use arbitrary partitions based on pre-existing spatial boundaries. The partitions are made to be of equal size. Our contribution is two-fold. We use K-means clustering to create the spatial partitions and we generalise Jun et al.’s approach to incorporate unequal partition sizes. Observations from a spatial or spatio-temporal process are partitioned using an appropriate feature vector that incorporates the geographic location of the observations into subsets (not necessarily of the same size). The method’s viability is illustrated in a simulation study, and in an application to hoki (Macruronus novaezelandiae) catch data from a survey of the sub-Antarctic region.
期刊介绍:
Environmental and Ecological Statistics publishes papers on practical applications of statistics and related quantitative methods to environmental science addressing contemporary issues.
Emphasis is on applied mathematical statistics, statistical methodology, and data interpretation and improvement for future use, with a view to advance statistics for environment, ecology and environmental health, and to advance environmental theory and practice using valid statistics.
Besides clarity of exposition, a single most important criterion for publication is the appropriateness of the statistical method to the particular environmental problem. The Journal covers all aspects of the collection, analysis, presentation and interpretation of environmental data for research, policy and regulation. The Journal is cross-disciplinary within the context of contemporary environmental issues and the associated statistical tools, concepts and methods. The Journal broadly covers theory and methods, case studies and applications, environmental change and statistical ecology, environmental health statistics and stochastics, and related areas. Special features include invited discussion papers; research communications; technical notes and consultation corner; mini-reviews; letters to the Editor; news, views and announcements; hardware and software reviews; data management etc.