{"title":"Interpretable spatial machine learning for understanding spatial heterogeneity in factors affecting street theft crime","authors":"Han Yue, Jianguo Chen","doi":"10.1016/j.apgeog.2024.103503","DOIUrl":null,"url":null,"abstract":"<div><div>Machine learning techniques, such as random forest (RF), are increasingly utilized in geographical studies, including crime analysis. While RF excels in predictive accuracy and reduces overfitting, it does not account for spatial heterogeneity, where relationships between crime and its determinants vary by location. To address this limitation, this study utilizes a geographically weighted random forest (GWRF), which breaks down RF into local sub-models. A comprehensive set of variables is integrated to provide a thorough understanding of the factors influencing street theft crimes, including risk populations, streetscape environments, social disorganization, crime attractors and generators, and transportation accessibility. The results of the machine learning analysis are interpreted through an interpretability framework. Findings indicate that GWRF achieves greater predictive accuracy than traditional methods, reaching up to 80% accuracy on unseen data. Notably, the on-street population, derived from street view images, is the most significant contributor to street theft. Additionally, the results facilitate the examination of spatial non-stationarity concerning the importance of explanatory variables. GWRF effectively bridges machine learning and geographical models, enhancing our understanding of street theft occurrences. The local importance results enable the development of targeted safety interventions, rather than relying on general guidelines applicable to the entire region.</div></div>","PeriodicalId":48396,"journal":{"name":"Applied Geography","volume":"175 ","pages":"Article 103503"},"PeriodicalIF":4.0000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Geography","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0143622824003084","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOGRAPHY","Score":null,"Total":0}
引用次数: 0
Abstract
Machine learning techniques, such as random forest (RF), are increasingly utilized in geographical studies, including crime analysis. While RF excels in predictive accuracy and reduces overfitting, it does not account for spatial heterogeneity, where relationships between crime and its determinants vary by location. To address this limitation, this study utilizes a geographically weighted random forest (GWRF), which breaks down RF into local sub-models. A comprehensive set of variables is integrated to provide a thorough understanding of the factors influencing street theft crimes, including risk populations, streetscape environments, social disorganization, crime attractors and generators, and transportation accessibility. The results of the machine learning analysis are interpreted through an interpretability framework. Findings indicate that GWRF achieves greater predictive accuracy than traditional methods, reaching up to 80% accuracy on unseen data. Notably, the on-street population, derived from street view images, is the most significant contributor to street theft. Additionally, the results facilitate the examination of spatial non-stationarity concerning the importance of explanatory variables. GWRF effectively bridges machine learning and geographical models, enhancing our understanding of street theft occurrences. The local importance results enable the development of targeted safety interventions, rather than relying on general guidelines applicable to the entire region.
期刊介绍:
Applied Geography is a journal devoted to the publication of research which utilizes geographic approaches (human, physical, nature-society and GIScience) to resolve human problems that have a spatial dimension. These problems may be related to the assessment, management and allocation of the world physical and/or human resources. The underlying rationale of the journal is that only through a clear understanding of the relevant societal, physical, and coupled natural-humans systems can we resolve such problems. Papers are invited on any theme involving the application of geographical theory and methodology in the resolution of human problems.