Searching for correct specification in spatial probit models. Classical approaches versus Gradient Boosting algorithm

IF 2.1 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY
Miguel De la Llave , Fernando A. López
{"title":"Searching for correct specification in spatial probit models. Classical approaches versus Gradient Boosting algorithm","authors":"Miguel De la Llave ,&nbsp;Fernando A. López","doi":"10.1016/j.spasta.2024.100815","DOIUrl":null,"url":null,"abstract":"<div><p>Selecting correct specification in spatial model frameworks is a relevant research topic in spatial econometrics. The purpose of this paper is to examine and contrast two well-known model selection strategies, Specific-to-General, Stge, and General-to-Specific, Gets, in the context of spatial probit models. The results obtained from these classical methods are juxtaposed with those generated through the utilization of a powerful machine learning algorithm: Gradient Boosting. The paper includes an extensive Monte Carlo experiment to compare the performance of these three strategies with small and medium sample sizes. The results show that under ideal conditions, both classical strategies obtain similar results for medium-sized samples, but for small samples, Stge performs slightly better than Gets. The Gradient Boosting algorithm obtains slightly higher success rates than the classical strategies, especially with small samples sizes. Finally, the flow of both strategies is illustrated using a well-known dataset on the probability of businesses reopening in New Orleans in the aftermath of Hurricane Katrina.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":null,"pages":null},"PeriodicalIF":2.1000,"publicationDate":"2024-04-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spatial Statistics","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S221167532400006X","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Selecting correct specification in spatial model frameworks is a relevant research topic in spatial econometrics. The purpose of this paper is to examine and contrast two well-known model selection strategies, Specific-to-General, Stge, and General-to-Specific, Gets, in the context of spatial probit models. The results obtained from these classical methods are juxtaposed with those generated through the utilization of a powerful machine learning algorithm: Gradient Boosting. The paper includes an extensive Monte Carlo experiment to compare the performance of these three strategies with small and medium sample sizes. The results show that under ideal conditions, both classical strategies obtain similar results for medium-sized samples, but for small samples, Stge performs slightly better than Gets. The Gradient Boosting algorithm obtains slightly higher success rates than the classical strategies, especially with small samples sizes. Finally, the flow of both strategies is illustrated using a well-known dataset on the probability of businesses reopening in New Orleans in the aftermath of Hurricane Katrina.

在空间概率模型中寻找正确的规范。经典方法与梯度提升算法
在空间模型框架中选择正确的规范是空间计量经济学的一个相关研究课题。本文旨在以空间 probit 模型为背景,研究和对比两种著名的模型选择策略:Specific-to-General(Stge)和 General-to-Specific(Gets)。本文将这些经典方法得出的结果与利用强大的机器学习算法得出的结果进行对比:梯度提升。论文包括一个广泛的蒙特卡罗实验,以比较这三种策略在中小样本量下的性能。结果表明,在理想条件下,两种经典策略在中等样本量时都能获得相似的结果,但在小样本量时,Stge 的表现略好于 Gets。梯度提升算法的成功率略高于经典策略,尤其是在样本量较小的情况下。最后,我们使用一个著名的数据集来说明这两种策略的流程,该数据集涉及卡特里娜飓风过后新奥尔良企业重新开业的概率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Spatial Statistics
Spatial Statistics GEOSCIENCES, MULTIDISCIPLINARY-MATHEMATICS, INTERDISCIPLINARY APPLICATIONS
CiteScore
4.00
自引率
21.70%
发文量
89
审稿时长
55 days
期刊介绍: Spatial Statistics publishes articles on the theory and application of spatial and spatio-temporal statistics. It favours manuscripts that present theory generated by new applications, or in which new theory is applied to an important practical case. A purely theoretical study will only rarely be accepted. Pure case studies without methodological development are not acceptable for publication. Spatial statistics concerns the quantitative analysis of spatial and spatio-temporal data, including their statistical dependencies, accuracy and uncertainties. Methodology for spatial statistics is typically found in probability theory, stochastic modelling and mathematical statistics as well as in information science. Spatial statistics is used in mapping, assessing spatial data quality, sampling design optimisation, modelling of dependence structures, and drawing of valid inference from a limited set of spatio-temporal data.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信