使用深度聚类方法的空间引导:空间机器学习在伦巴第高科技企业中的应用

IF 2.1 Q2 GEOGRAPHY
Bumbea Alessio , Mazzitelli Andrea , Giuffrida Annamaria , Espa Giuseppe
{"title":"使用深度聚类方法的空间引导:空间机器学习在伦巴第高科技企业中的应用","authors":"Bumbea Alessio ,&nbsp;Mazzitelli Andrea ,&nbsp;Giuffrida Annamaria ,&nbsp;Espa Giuseppe","doi":"10.1016/j.rspp.2025.100242","DOIUrl":null,"url":null,"abstract":"<div><div>Bootstrap and clustering techniques are foundational tools across scientific disciplines, playing a particularly important role in spatial analysis. However, traditional bootstrap methods often fall short in preserving spatial dependencies and complex attribute relationships during resampling. In this work, we introduce a novel framework in the Spatial Machine Learning domain that leverages deep learning techniques to enhance stratified bootstrap procedures for spatial data. Deep learning has already revolutionized prediction and classification tasks in data with temporal and spatial dependencies. In this work we want to extend the scope of application to bootstrap analysis by using tools like entity embeddings and autoencoders. By encoding high-cardinality categorical variables into continuous representations, entity embeddings facilitate the discovery of meaningful spatial and attribute-based cluster. These embeddings are then passed to a Deep Embedded Clustering (DEC) algorithm that can use them to create clusters. This algorithm is able to handle high-dimensional big data using an autoencoder based architecture that performs dimensionality reduction and clustering simultaneously to avoid loss of information. These clusters can be finally used as strata that guide a stratified bootstrap approach which preserves spatial autocorrelation and heterogeneity. We demonstrate the utility of our framework by performing a bootstrap analysis of high-tech firm productivity in the Lombardy region. Our approach is able to analyze efficiently large amounts of high dimensional data with complex attributes.</div></div>","PeriodicalId":45520,"journal":{"name":"Regional Science Policy and Practice","volume":"17 12","pages":"Article 100242"},"PeriodicalIF":2.1000,"publicationDate":"2025-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Spatial bootstrapping using deep clustering methods: Spatial machine learning applied to Lombardy high-tech businesses\",\"authors\":\"Bumbea Alessio ,&nbsp;Mazzitelli Andrea ,&nbsp;Giuffrida Annamaria ,&nbsp;Espa Giuseppe\",\"doi\":\"10.1016/j.rspp.2025.100242\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Bootstrap and clustering techniques are foundational tools across scientific disciplines, playing a particularly important role in spatial analysis. However, traditional bootstrap methods often fall short in preserving spatial dependencies and complex attribute relationships during resampling. In this work, we introduce a novel framework in the Spatial Machine Learning domain that leverages deep learning techniques to enhance stratified bootstrap procedures for spatial data. Deep learning has already revolutionized prediction and classification tasks in data with temporal and spatial dependencies. In this work we want to extend the scope of application to bootstrap analysis by using tools like entity embeddings and autoencoders. By encoding high-cardinality categorical variables into continuous representations, entity embeddings facilitate the discovery of meaningful spatial and attribute-based cluster. These embeddings are then passed to a Deep Embedded Clustering (DEC) algorithm that can use them to create clusters. This algorithm is able to handle high-dimensional big data using an autoencoder based architecture that performs dimensionality reduction and clustering simultaneously to avoid loss of information. These clusters can be finally used as strata that guide a stratified bootstrap approach which preserves spatial autocorrelation and heterogeneity. We demonstrate the utility of our framework by performing a bootstrap analysis of high-tech firm productivity in the Lombardy region. Our approach is able to analyze efficiently large amounts of high dimensional data with complex attributes.</div></div>\",\"PeriodicalId\":45520,\"journal\":{\"name\":\"Regional Science Policy and Practice\",\"volume\":\"17 12\",\"pages\":\"Article 100242\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2025-08-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Regional Science Policy and Practice\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1757780225000721\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"GEOGRAPHY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Regional Science Policy and Practice","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1757780225000721","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GEOGRAPHY","Score":null,"Total":0}
引用次数: 0

摘要

Bootstrap和聚类技术是跨科学学科的基础工具,在空间分析中起着特别重要的作用。然而,传统的自举方法在重采样过程中往往无法保持空间依赖关系和复杂的属性关系。在这项工作中,我们在空间机器学习领域引入了一个新的框架,该框架利用深度学习技术来增强空间数据的分层自举过程。深度学习已经彻底改变了具有时间和空间依赖性的数据的预测和分类任务。在这项工作中,我们希望通过使用实体嵌入和自动编码器等工具,将应用范围扩展到自举分析。通过将高基数分类变量编码为连续表示,实体嵌入有助于发现有意义的空间和基于属性的聚类。然后将这些嵌入传递给深度嵌入聚类(DEC)算法,该算法可以使用它们来创建聚类。该算法使用基于自编码器的架构来处理高维大数据,该架构同时进行降维和聚类,以避免信息丢失。这些聚类最终可以用作地层,指导分层自举方法,从而保持空间自相关和异质性。我们通过对伦巴第地区的高科技企业生产率进行自举分析来证明我们的框架的实用性。我们的方法能够有效地分析具有复杂属性的大量高维数据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Spatial bootstrapping using deep clustering methods: Spatial machine learning applied to Lombardy high-tech businesses
Bootstrap and clustering techniques are foundational tools across scientific disciplines, playing a particularly important role in spatial analysis. However, traditional bootstrap methods often fall short in preserving spatial dependencies and complex attribute relationships during resampling. In this work, we introduce a novel framework in the Spatial Machine Learning domain that leverages deep learning techniques to enhance stratified bootstrap procedures for spatial data. Deep learning has already revolutionized prediction and classification tasks in data with temporal and spatial dependencies. In this work we want to extend the scope of application to bootstrap analysis by using tools like entity embeddings and autoencoders. By encoding high-cardinality categorical variables into continuous representations, entity embeddings facilitate the discovery of meaningful spatial and attribute-based cluster. These embeddings are then passed to a Deep Embedded Clustering (DEC) algorithm that can use them to create clusters. This algorithm is able to handle high-dimensional big data using an autoencoder based architecture that performs dimensionality reduction and clustering simultaneously to avoid loss of information. These clusters can be finally used as strata that guide a stratified bootstrap approach which preserves spatial autocorrelation and heterogeneity. We demonstrate the utility of our framework by performing a bootstrap analysis of high-tech firm productivity in the Lombardy region. Our approach is able to analyze efficiently large amounts of high dimensional data with complex attributes.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
3.60
自引率
5.90%
发文量
92
期刊介绍: Regional Science Policy & Practice (RSPP) is the official policy and practitioner orientated journal of the Regional Science Association International. It is an international journal that publishes high quality papers in applied regional science that explore policy and practice issues in regional and local development. It welcomes papers from a range of academic disciplines and practitioners including planning, public policy, geography, economics and environmental science and related fields. Papers should address the interface between academic debates and policy development and application. RSPP provides an opportunity for academics and policy makers to develop a dialogue to identify and explore many of the challenges facing local and regional economies.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信