{"title":"Transfer learning for high dimensional spatial autoregressive model","authors":"Yunquan Song, Xuan Chen, Rui Yang, Yijun Li","doi":"10.1016/j.spasta.2025.100908","DOIUrl":null,"url":null,"abstract":"<div><div>Transfer learning is a learning process that applies models learned in old domains to new domains by utilizing similarities between data, tasks, or models. At present, transfer learning has been widely applied, such as natural language processing, recommendation systems, drug analysis, etc. Research in statistical models mostly focuses on classic linear models such as classification and regression. It is still unclear how transfer learning affects spatial data. Spatial data is an important type of data and has been a hot research topic in statistics and econometrics in recent years. However, in reality, its collection and labeling are expensive and labor-intensive, and there may not be enough data to train a robust model. Therefore, this article considers using auxiliary sample sets that are different from the target dataset but have some similarity to help us estimate and predict the target model, and specifies criteria for determining similarity. We propose transfer learning algorithms based on spatial autoregressive models, which can transfer knowledge from auxiliary datasets to target models of interest to us. Its performance has been demonstrated in numerical simulations and real housing price datasets.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"68 ","pages":"Article 100908"},"PeriodicalIF":2.1000,"publicationDate":"2025-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spatial Statistics","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2211675325000302","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Transfer learning is a learning process that applies models learned in old domains to new domains by utilizing similarities between data, tasks, or models. At present, transfer learning has been widely applied, such as natural language processing, recommendation systems, drug analysis, etc. Research in statistical models mostly focuses on classic linear models such as classification and regression. It is still unclear how transfer learning affects spatial data. Spatial data is an important type of data and has been a hot research topic in statistics and econometrics in recent years. However, in reality, its collection and labeling are expensive and labor-intensive, and there may not be enough data to train a robust model. Therefore, this article considers using auxiliary sample sets that are different from the target dataset but have some similarity to help us estimate and predict the target model, and specifies criteria for determining similarity. We propose transfer learning algorithms based on spatial autoregressive models, which can transfer knowledge from auxiliary datasets to target models of interest to us. Its performance has been demonstrated in numerical simulations and real housing price datasets.
期刊介绍:
Spatial Statistics publishes articles on the theory and application of spatial and spatio-temporal statistics. It favours manuscripts that present theory generated by new applications, or in which new theory is applied to an important practical case. A purely theoretical study will only rarely be accepted. Pure case studies without methodological development are not acceptable for publication.
Spatial statistics concerns the quantitative analysis of spatial and spatio-temporal data, including their statistical dependencies, accuracy and uncertainties. Methodology for spatial statistics is typically found in probability theory, stochastic modelling and mathematical statistics as well as in information science. Spatial statistics is used in mapping, assessing spatial data quality, sampling design optimisation, modelling of dependence structures, and drawing of valid inference from a limited set of spatio-temporal data.