{"title":"Variable selection in convex nonparametric least squares via structured Lasso: An application to the Swedish electricity market","authors":"Zhiqiang Liao","doi":"arxiv-2409.01911","DOIUrl":null,"url":null,"abstract":"We study the problem of variable selection in convex nonparametric least\nsquares (CNLS). Whereas the least absolute shrinkage and selection operator\n(Lasso) is a popular technique for least squares, its variable selection\nperformance is unknown in CNLS problems. In this work, we investigate the\nperformance of the Lasso CNLS estimator and find out it is usually unable to\nselect variables efficiently. Exploiting the unique structure of the\nsubgradients in CNLS, we develop a structured Lasso by combining $\\ell_1$-norm\nand $\\ell_{\\infty}$-norm. To improve its predictive performance, we propose a\nrelaxed version of the structured Lasso where we can control the two\neffects--variable selection and model shrinkage--using an additional tuning\nparameter. A Monte Carlo study is implemented to verify the finite sample\nperformances of the proposed approaches. In the application of Swedish\nelectricity distribution networks, when the regression model is assumed to be\nsemi-nonparametric, our methods are extended to the doubly penalized CNLS\nestimators. The results from the simulation and application confirm that the\nproposed structured Lasso performs favorably, generally leading to sparser and\nmore accurate predictive models, relative to the other variable selection\nmethods in the literature.","PeriodicalId":501293,"journal":{"name":"arXiv - ECON - Econometrics","volume":"141 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - ECON - Econometrics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.01911","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
We study the problem of variable selection in convex nonparametric least
squares (CNLS). Whereas the least absolute shrinkage and selection operator
(Lasso) is a popular technique for least squares, its variable selection
performance is unknown in CNLS problems. In this work, we investigate the
performance of the Lasso CNLS estimator and find out it is usually unable to
select variables efficiently. Exploiting the unique structure of the
subgradients in CNLS, we develop a structured Lasso by combining $\ell_1$-norm
and $\ell_{\infty}$-norm. To improve its predictive performance, we propose a
relaxed version of the structured Lasso where we can control the two
effects--variable selection and model shrinkage--using an additional tuning
parameter. A Monte Carlo study is implemented to verify the finite sample
performances of the proposed approaches. In the application of Swedish
electricity distribution networks, when the regression model is assumed to be
semi-nonparametric, our methods are extended to the doubly penalized CNLS
estimators. The results from the simulation and application confirm that the
proposed structured Lasso performs favorably, generally leading to sparser and
more accurate predictive models, relative to the other variable selection
methods in the literature.