Interpretable spatial machine learning for understanding spatial heterogeneity in factors affecting street theft crime

IF 4 2区 地球科学 Q1 GEOGRAPHY
Han Yue, Jianguo Chen
{"title":"Interpretable spatial machine learning for understanding spatial heterogeneity in factors affecting street theft crime","authors":"Han Yue,&nbsp;Jianguo Chen","doi":"10.1016/j.apgeog.2024.103503","DOIUrl":null,"url":null,"abstract":"<div><div>Machine learning techniques, such as random forest (RF), are increasingly utilized in geographical studies, including crime analysis. While RF excels in predictive accuracy and reduces overfitting, it does not account for spatial heterogeneity, where relationships between crime and its determinants vary by location. To address this limitation, this study utilizes a geographically weighted random forest (GWRF), which breaks down RF into local sub-models. A comprehensive set of variables is integrated to provide a thorough understanding of the factors influencing street theft crimes, including risk populations, streetscape environments, social disorganization, crime attractors and generators, and transportation accessibility. The results of the machine learning analysis are interpreted through an interpretability framework. Findings indicate that GWRF achieves greater predictive accuracy than traditional methods, reaching up to 80% accuracy on unseen data. Notably, the on-street population, derived from street view images, is the most significant contributor to street theft. Additionally, the results facilitate the examination of spatial non-stationarity concerning the importance of explanatory variables. GWRF effectively bridges machine learning and geographical models, enhancing our understanding of street theft occurrences. The local importance results enable the development of targeted safety interventions, rather than relying on general guidelines applicable to the entire region.</div></div>","PeriodicalId":48396,"journal":{"name":"Applied Geography","volume":"175 ","pages":"Article 103503"},"PeriodicalIF":4.0000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Geography","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0143622824003084","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOGRAPHY","Score":null,"Total":0}
引用次数: 0

Abstract

Machine learning techniques, such as random forest (RF), are increasingly utilized in geographical studies, including crime analysis. While RF excels in predictive accuracy and reduces overfitting, it does not account for spatial heterogeneity, where relationships between crime and its determinants vary by location. To address this limitation, this study utilizes a geographically weighted random forest (GWRF), which breaks down RF into local sub-models. A comprehensive set of variables is integrated to provide a thorough understanding of the factors influencing street theft crimes, including risk populations, streetscape environments, social disorganization, crime attractors and generators, and transportation accessibility. The results of the machine learning analysis are interpreted through an interpretability framework. Findings indicate that GWRF achieves greater predictive accuracy than traditional methods, reaching up to 80% accuracy on unseen data. Notably, the on-street population, derived from street view images, is the most significant contributor to street theft. Additionally, the results facilitate the examination of spatial non-stationarity concerning the importance of explanatory variables. GWRF effectively bridges machine learning and geographical models, enhancing our understanding of street theft occurrences. The local importance results enable the development of targeted safety interventions, rather than relying on general guidelines applicable to the entire region.
基于可解释空间机器学习的街头盗窃犯罪影响因素空间异质性研究
机器学习技术,如随机森林(RF),越来越多地用于地理研究,包括犯罪分析。虽然RF在预测准确性和减少过度拟合方面表现出色,但它没有考虑到空间异质性,即犯罪与其决定因素之间的关系因地点而异。为了解决这一限制,本研究利用地理加权随机森林(GWRF),将RF分解为局部子模型。综合了一套全面的变量,以全面了解影响街头盗窃犯罪的因素,包括风险人群、街道景观环境、社会混乱、犯罪吸引和产生因素以及交通可达性。机器学习分析的结果通过可解释性框架进行解释。研究结果表明,与传统方法相比,GWRF的预测精度更高,在未见数据上的准确率高达80%。值得注意的是,从街景图像中得出的街道人口是街头盗窃的最重要贡献者。此外,研究结果有助于检验解释变量重要性的空间非平稳性。GWRF有效地连接了机器学习和地理模型,增强了我们对街头盗窃事件的理解。当地重要性的结果使我们能够制定有针对性的安全干预措施,而不是依赖于适用于整个地区的一般准则。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Applied Geography
Applied Geography GEOGRAPHY-
CiteScore
8.00
自引率
2.00%
发文量
134
期刊介绍: Applied Geography is a journal devoted to the publication of research which utilizes geographic approaches (human, physical, nature-society and GIScience) to resolve human problems that have a spatial dimension. These problems may be related to the assessment, management and allocation of the world physical and/or human resources. The underlying rationale of the journal is that only through a clear understanding of the relevant societal, physical, and coupled natural-humans systems can we resolve such problems. Papers are invited on any theme involving the application of geographical theory and methodology in the resolution of human problems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信