{"title":"高维鲁棒回归的迁移学习","authors":"Xiaohui Yuan, Shujie Ren","doi":"arxiv-2406.17567","DOIUrl":null,"url":null,"abstract":"Transfer learning has become an essential technique for utilizing information\nfrom source datasets to improve the performance of the target task. However, in\nthe context of high-dimensional data, heterogeneity arises due to\nheteroscedastic variance or inhomogeneous covariate effects. To solve this\nproblem, this paper proposes a robust transfer learning based on the Huber\nregression, specifically designed for scenarios where the transferable source\ndata set is known. This method effectively mitigates the impact of data\nheteroscedasticity, leading to improvements in estimation and prediction\naccuracy. Moreover, when the transferable source data set is unknown, the paper\nintroduces an efficient detection algorithm to identify informative sources.\nThe effectiveness of the proposed method is proved through numerical simulation\nand empirical analysis using superconductor data.","PeriodicalId":501323,"journal":{"name":"arXiv - STAT - Other Statistics","volume":"60 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Transfer Learning for High Dimensional Robust Regression\",\"authors\":\"Xiaohui Yuan, Shujie Ren\",\"doi\":\"arxiv-2406.17567\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Transfer learning has become an essential technique for utilizing information\\nfrom source datasets to improve the performance of the target task. However, in\\nthe context of high-dimensional data, heterogeneity arises due to\\nheteroscedastic variance or inhomogeneous covariate effects. To solve this\\nproblem, this paper proposes a robust transfer learning based on the Huber\\nregression, specifically designed for scenarios where the transferable source\\ndata set is known. This method effectively mitigates the impact of data\\nheteroscedasticity, leading to improvements in estimation and prediction\\naccuracy. Moreover, when the transferable source data set is unknown, the paper\\nintroduces an efficient detection algorithm to identify informative sources.\\nThe effectiveness of the proposed method is proved through numerical simulation\\nand empirical analysis using superconductor data.\",\"PeriodicalId\":501323,\"journal\":{\"name\":\"arXiv - STAT - Other Statistics\",\"volume\":\"60 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-06-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - STAT - Other Statistics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2406.17567\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Other Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2406.17567","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Transfer Learning for High Dimensional Robust Regression
Transfer learning has become an essential technique for utilizing information
from source datasets to improve the performance of the target task. However, in
the context of high-dimensional data, heterogeneity arises due to
heteroscedastic variance or inhomogeneous covariate effects. To solve this
problem, this paper proposes a robust transfer learning based on the Huber
regression, specifically designed for scenarios where the transferable source
data set is known. This method effectively mitigates the impact of data
heteroscedasticity, leading to improvements in estimation and prediction
accuracy. Moreover, when the transferable source data set is unknown, the paper
introduces an efficient detection algorithm to identify informative sources.
The effectiveness of the proposed method is proved through numerical simulation
and empirical analysis using superconductor data.