{"title":"Variable selection via penalized ridge regression with error-prone variables","authors":"Li-Pang Chen","doi":"10.1007/s10463-025-00940-1","DOIUrl":null,"url":null,"abstract":"<div><p>Variable selection is a fundamental topic in statistical analysis and data science. Regularization methods have been widely employed to identify informative variables related to the response. However, challenges such as collinearity and measurement error often arise in real-world datasets. In this paper, we address variable selection and estimation for linear models and focus parameters. To simultaneously handle collinearity and measurement error, we propose a valid correction strategy for error-prone continuous, binary, and discrete covariates, and develop a penalized ridge regression method to perform variable selection and estimation. We establish the theoretical properties of the proposed method, including variable selection consistency and asymptotic normality. Numerical studies are conducted to evaluate its performance, and the results demonstrate that the proposed method outperforms existing approaches.</p></div>","PeriodicalId":55511,"journal":{"name":"Annals of the Institute of Statistical Mathematics","volume":"78 2","pages":"225 - 261"},"PeriodicalIF":0.6000,"publicationDate":"2025-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of the Institute of Statistical Mathematics","FirstCategoryId":"100","ListUrlMain":"https://link.springer.com/article/10.1007/s10463-025-00940-1","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0
Abstract
Variable selection is a fundamental topic in statistical analysis and data science. Regularization methods have been widely employed to identify informative variables related to the response. However, challenges such as collinearity and measurement error often arise in real-world datasets. In this paper, we address variable selection and estimation for linear models and focus parameters. To simultaneously handle collinearity and measurement error, we propose a valid correction strategy for error-prone continuous, binary, and discrete covariates, and develop a penalized ridge regression method to perform variable selection and estimation. We establish the theoretical properties of the proposed method, including variable selection consistency and asymptotic normality. Numerical studies are conducted to evaluate its performance, and the results demonstrate that the proposed method outperforms existing approaches.
期刊介绍:
Annals of the Institute of Statistical Mathematics (AISM) aims to provide a forum for open communication among statisticians, and to contribute to the advancement of statistics as a science to enable humans to handle information in order to cope with uncertainties. It publishes high-quality papers that shed new light on the theoretical, computational and/or methodological aspects of statistical science. Emphasis is placed on (a) development of new methodologies motivated by real data, (b) development of unifying theories, and (c) analysis and improvement of existing methodologies and theories.