{"title":"Regularization Parameter Selection via Cross-Validation in the Presence of Dependent Regressors: A Simulation Study","authors":"Yoshimasa Uematsu, Shinya Tanaka","doi":"10.2139/ssrn.2700945","DOIUrl":null,"url":null,"abstract":"This letter reveals using simulation studies that regularization parameter selection via cross-validation (CV) in penalized regressions (e.g., Lasso) is valid even if the regressors are weakly dependent. In CV procedure, the time series structure of the data set is broken, meaning that there may occur a fatal problem unless the sample is i.i.d.; the estimation accuracy in the training step could be worse due to corruption of data continuity, which may furthermore lead to a bad choice of the regularization parameter. Even in such a situation, we find that CV works well as long as the sample size grows. These findings encourage us to apply the selection procedure via CV to macroeconomic empirical analyses with dependent regressors.","PeriodicalId":364869,"journal":{"name":"ERN: Simulation Methods (Topic)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ERN: Simulation Methods (Topic)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.2700945","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
This letter reveals using simulation studies that regularization parameter selection via cross-validation (CV) in penalized regressions (e.g., Lasso) is valid even if the regressors are weakly dependent. In CV procedure, the time series structure of the data set is broken, meaning that there may occur a fatal problem unless the sample is i.i.d.; the estimation accuracy in the training step could be worse due to corruption of data continuity, which may furthermore lead to a bad choice of the regularization parameter. Even in such a situation, we find that CV works well as long as the sample size grows. These findings encourage us to apply the selection procedure via CV to macroeconomic empirical analyses with dependent regressors.