Transfer learning for high dimensional spatial autoregressive model

IF 2.5 2区数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY

Spatial Statistics Pub Date : 2025-06-13 DOI:10.1016/j.spasta.2025.100908

Yunquan Song, Xuan Chen, Rui Yang, Yijun Li

{"title":"Transfer learning for high dimensional spatial autoregressive model","authors":"Yunquan Song, Xuan Chen, Rui Yang, Yijun Li","doi":"10.1016/j.spasta.2025.100908","DOIUrl":null,"url":null,"abstract":"<div><div>Transfer learning is a learning process that applies models learned in old domains to new domains by utilizing similarities between data, tasks, or models. At present, transfer learning has been widely applied, such as natural language processing, recommendation systems, drug analysis, etc. Research in statistical models mostly focuses on classic linear models such as classification and regression. It is still unclear how transfer learning affects spatial data. Spatial data is an important type of data and has been a hot research topic in statistics and econometrics in recent years. However, in reality, its collection and labeling are expensive and labor-intensive, and there may not be enough data to train a robust model. Therefore, this article considers using auxiliary sample sets that are different from the target dataset but have some similarity to help us estimate and predict the target model, and specifies criteria for determining similarity. We propose transfer learning algorithms based on spatial autoregressive models, which can transfer knowledge from auxiliary datasets to target models of interest to us. Its performance has been demonstrated in numerical simulations and real housing price datasets.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"68 ","pages":"Article 100908"},"PeriodicalIF":2.5000,"publicationDate":"2025-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spatial Statistics","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2211675325000302","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

Transfer learning is a learning process that applies models learned in old domains to new domains by utilizing similarities between data, tasks, or models. At present, transfer learning has been widely applied, such as natural language processing, recommendation systems, drug analysis, etc. Research in statistical models mostly focuses on classic linear models such as classification and regression. It is still unclear how transfer learning affects spatial data. Spatial data is an important type of data and has been a hot research topic in statistics and econometrics in recent years. However, in reality, its collection and labeling are expensive and labor-intensive, and there may not be enough data to train a robust model. Therefore, this article considers using auxiliary sample sets that are different from the target dataset but have some similarity to help us estimate and predict the target model, and specifies criteria for determining similarity. We propose transfer learning algorithms based on spatial autoregressive models, which can transfer knowledge from auxiliary datasets to target models of interest to us. Its performance has been demonstrated in numerical simulations and real housing price datasets.

查看原文本刊更多论文

高维空间自回归模型的迁移学习

迁移学习是一种学习过程，通过利用数据、任务或模型之间的相似性，将在旧领域学习到的模型应用到新领域。目前，迁移学习已经得到了广泛的应用，如自然语言处理、推荐系统、药物分析等。统计模型的研究主要集中在分类、回归等经典线性模型上。目前尚不清楚迁移学习如何影响空间数据。空间数据是一种重要的数据类型，是近年来统计学和计量经济学研究的热点。然而，在现实中，它的收集和标记是昂贵和劳动密集型的，并且可能没有足够的数据来训练一个鲁棒模型。因此，本文考虑使用与目标数据集不同但具有一定相似性的辅助样本集来帮助我们估计和预测目标模型，并规定了确定相似性的标准。我们提出了基于空间自回归模型的迁移学习算法，该算法可以将知识从辅助数据集转移到我们感兴趣的目标模型。通过数值模拟和实际房价数据集验证了其性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Spatial Statistics GEOSCIENCES, MULTIDISCIPLINARY-MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

CiteScore

4.00

自引率

21.70%

发文量

审稿时长

55 days

期刊介绍： Spatial Statistics publishes articles on the theory and application of spatial and spatio-temporal statistics. It favours manuscripts that present theory generated by new applications, or in which new theory is applied to an important practical case. A purely theoretical study will only rarely be accepted. Pure case studies without methodological development are not acceptable for publication. Spatial statistics concerns the quantitative analysis of spatial and spatio-temporal data, including their statistical dependencies, accuracy and uncertainties. Methodology for spatial statistics is typically found in probability theory, stochastic modelling and mathematical statistics as well as in information science. Spatial statistics is used in mapping, assessing spatial data quality, sampling design optimisation, modelling of dependence structures, and drawing of valid inference from a limited set of spatio-temporal data.