在空间概率模型中寻找正确的规范。经典方法与梯度提升算法

IF 2.5 2区数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY

Spatial Statistics Pub Date : 2024-04-06 DOI:10.1016/j.spasta.2024.100815

Miguel De la Llave , Fernando A. López

{"title":"在空间概率模型中寻找正确的规范。经典方法与梯度提升算法","authors":"Miguel De la Llave , Fernando A. López","doi":"10.1016/j.spasta.2024.100815","DOIUrl":null,"url":null,"abstract":"<div><p>Selecting correct specification in spatial model frameworks is a relevant research topic in spatial econometrics. The purpose of this paper is to examine and contrast two well-known model selection strategies, Specific-to-General, Stge, and General-to-Specific, Gets, in the context of spatial probit models. The results obtained from these classical methods are juxtaposed with those generated through the utilization of a powerful machine learning algorithm: Gradient Boosting. The paper includes an extensive Monte Carlo experiment to compare the performance of these three strategies with small and medium sample sizes. The results show that under ideal conditions, both classical strategies obtain similar results for medium-sized samples, but for small samples, Stge performs slightly better than Gets. The Gradient Boosting algorithm obtains slightly higher success rates than the classical strategies, especially with small samples sizes. Finally, the flow of both strategies is illustrated using a well-known dataset on the probability of businesses reopening in New Orleans in the aftermath of Hurricane Katrina.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"61 ","pages":"Article 100815"},"PeriodicalIF":2.5000,"publicationDate":"2024-04-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Searching for correct specification in spatial probit models. Classical approaches versus Gradient Boosting algorithm\",\"authors\":\"Miguel De la Llave , Fernando A. López\",\"doi\":\"10.1016/j.spasta.2024.100815\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Selecting correct specification in spatial model frameworks is a relevant research topic in spatial econometrics. The purpose of this paper is to examine and contrast two well-known model selection strategies, Specific-to-General, Stge, and General-to-Specific, Gets, in the context of spatial probit models. The results obtained from these classical methods are juxtaposed with those generated through the utilization of a powerful machine learning algorithm: Gradient Boosting. The paper includes an extensive Monte Carlo experiment to compare the performance of these three strategies with small and medium sample sizes. The results show that under ideal conditions, both classical strategies obtain similar results for medium-sized samples, but for small samples, Stge performs slightly better than Gets. The Gradient Boosting algorithm obtains slightly higher success rates than the classical strategies, especially with small samples sizes. Finally, the flow of both strategies is illustrated using a well-known dataset on the probability of businesses reopening in New Orleans in the aftermath of Hurricane Katrina.</p></div>\",\"PeriodicalId\":48771,\"journal\":{\"name\":\"Spatial Statistics\",\"volume\":\"61 \",\"pages\":\"Article 100815\"},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2024-04-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Spatial Statistics\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S221167532400006X\",\"RegionNum\":2,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"GEOSCIENCES, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spatial Statistics","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S221167532400006X","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

摘要

在空间模型框架中选择正确的规范是空间计量经济学的一个相关研究课题。本文旨在以空间 probit 模型为背景，研究和对比两种著名的模型选择策略：Specific-to-General（Stge）和 General-to-Specific（Gets）。本文将这些经典方法得出的结果与利用强大的机器学习算法得出的结果进行对比：梯度提升。论文包括一个广泛的蒙特卡罗实验，以比较这三种策略在中小样本量下的性能。结果表明，在理想条件下，两种经典策略在中等样本量时都能获得相似的结果，但在小样本量时，Stge 的表现略好于 Gets。梯度提升算法的成功率略高于经典策略，尤其是在样本量较小的情况下。最后，我们使用一个著名的数据集来说明这两种策略的流程，该数据集涉及卡特里娜飓风过后新奥尔良企业重新开业的概率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Searching for correct specification in spatial probit models. Classical approaches versus Gradient Boosting algorithm

Selecting correct specification in spatial model frameworks is a relevant research topic in spatial econometrics. The purpose of this paper is to examine and contrast two well-known model selection strategies, Specific-to-General, Stge, and General-to-Specific, Gets, in the context of spatial probit models. The results obtained from these classical methods are juxtaposed with those generated through the utilization of a powerful machine learning algorithm: Gradient Boosting. The paper includes an extensive Monte Carlo experiment to compare the performance of these three strategies with small and medium sample sizes. The results show that under ideal conditions, both classical strategies obtain similar results for medium-sized samples, but for small samples, Stge performs slightly better than Gets. The Gradient Boosting algorithm obtains slightly higher success rates than the classical strategies, especially with small samples sizes. Finally, the flow of both strategies is illustrated using a well-known dataset on the probability of businesses reopening in New Orleans in the aftermath of Hurricane Katrina.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Spatial Statistics GEOSCIENCES, MULTIDISCIPLINARY-MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

CiteScore

4.00

自引率

21.70%

发文量

审稿时长

55 days

期刊介绍： Spatial Statistics publishes articles on the theory and application of spatial and spatio-temporal statistics. It favours manuscripts that present theory generated by new applications, or in which new theory is applied to an important practical case. A purely theoretical study will only rarely be accepted. Pure case studies without methodological development are not acceptable for publication. Spatial statistics concerns the quantitative analysis of spatial and spatio-temporal data, including their statistical dependencies, accuracy and uncertainties. Methodology for spatial statistics is typically found in probability theory, stochastic modelling and mathematical statistics as well as in information science. Spatial statistics is used in mapping, assessing spatial data quality, sampling design optimisation, modelling of dependence structures, and drawing of valid inference from a limited set of spatio-temporal data.