{"title":"Geographical Gaussian Process Regression: A Spatial Machine-Learning Model Based on Spatial Similarity","authors":"Zhenzhi Jiao, Ran Tao","doi":"10.1111/gean.12423","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>This study proposes a new spatial machine-learning model called geographical Gaussian process regression (GGPR). GGPR is extended from Gaussian process regression (GPR) by adopting the principle of spatial similarity for calibration, and it is designed to conduct spatial prediction and exploratory spatial data analysis (ESDA). GGPR addresses several key challenges in spatial machine learning. First, as a probabilistic model, GGPR avoids the conflict between spatial autocorrelation and the assumption of independent and identically distributed (i.i.d.), thus enhancing the model's objectivity and reliability in spatial prediction. Second, GGPR is suitable for small-sample prediction, a task that most existing models struggle with. Finally, when integrated with GeoShapley, GGPR is an explainable model that can measure spatial effects and explain the outcomes. Evaluated on two distinct datasets, GGPR demonstrates superior predictive performance compared to other popular machine-learning models across various sampling ratios, with its advantage becoming especially evident with smaller sample sizes. As an ESDA model, GGPR demonstrates enhanced accuracy, better computational efficiency, and a comparable ability to measure spatial effects against both multiscale geographically weighted regression and geographical random forests. In short, GGPR offers spatial data scientists a new method for predicting and exploring complex geographical processes.</p>\n </div>","PeriodicalId":12533,"journal":{"name":"Geographical Analysis","volume":"57 3","pages":"507-520"},"PeriodicalIF":4.3000,"publicationDate":"2025-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geographical Analysis","FirstCategoryId":"89","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/gean.12423","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOGRAPHY","Score":null,"Total":0}
引用次数: 0
Abstract
This study proposes a new spatial machine-learning model called geographical Gaussian process regression (GGPR). GGPR is extended from Gaussian process regression (GPR) by adopting the principle of spatial similarity for calibration, and it is designed to conduct spatial prediction and exploratory spatial data analysis (ESDA). GGPR addresses several key challenges in spatial machine learning. First, as a probabilistic model, GGPR avoids the conflict between spatial autocorrelation and the assumption of independent and identically distributed (i.i.d.), thus enhancing the model's objectivity and reliability in spatial prediction. Second, GGPR is suitable for small-sample prediction, a task that most existing models struggle with. Finally, when integrated with GeoShapley, GGPR is an explainable model that can measure spatial effects and explain the outcomes. Evaluated on two distinct datasets, GGPR demonstrates superior predictive performance compared to other popular machine-learning models across various sampling ratios, with its advantage becoming especially evident with smaller sample sizes. As an ESDA model, GGPR demonstrates enhanced accuracy, better computational efficiency, and a comparable ability to measure spatial effects against both multiscale geographically weighted regression and geographical random forests. In short, GGPR offers spatial data scientists a new method for predicting and exploring complex geographical processes.
期刊介绍:
First in its specialty area and one of the most frequently cited publications in geography, Geographical Analysis has, since 1969, presented significant advances in geographical theory, model building, and quantitative methods to geographers and scholars in a wide spectrum of related fields. Traditionally, mathematical and nonmathematical articulations of geographical theory, and statements and discussions of the analytic paradigm are published in the journal. Spatial data analyses and spatial econometrics and statistics are strongly represented.