LASSO 回归解的数值特性

IF 2.4 2区 数学 Q1 MATHEMATICS, APPLIED
Mayur V. Lakshmi , Joab R. Winkler
{"title":"LASSO 回归解的数值特性","authors":"Mayur V. Lakshmi ,&nbsp;Joab R. Winkler","doi":"10.1016/j.apnum.2024.03.010","DOIUrl":null,"url":null,"abstract":"<div><div>The determination of a concise model of a linear system when there are fewer samples <em>m</em> than predictors <em>n</em> requires the solution of the equation <span><math><mi>A</mi><mi>x</mi><mo>=</mo><mi>b</mi></math></span>, where <span><math><mi>A</mi><mo>∈</mo><msup><mrow><mi>R</mi></mrow><mrow><mi>m</mi><mo>×</mo><mi>n</mi></mrow></msup></math></span> and <span><math><mtext>rank</mtext><mspace></mspace><mi>A</mi><mo>=</mo><mi>m</mi></math></span>, such that the selected solution from the infinite number of solutions is sparse, that is, many of its components are zero. This leads to the minimisation with respect to <em>x</em> of <span><math><mi>f</mi><mo>(</mo><mi>x</mi><mo>,</mo><mi>λ</mi><mo>)</mo><mo>=</mo><msubsup><mrow><mo>‖</mo><mi>A</mi><mi>x</mi><mo>−</mo><mi>b</mi><mo>‖</mo></mrow><mrow><mn>2</mn></mrow><mrow><mn>2</mn></mrow></msubsup><mo>+</mo><mi>λ</mi><msub><mrow><mo>‖</mo><mi>x</mi><mo>‖</mo></mrow><mrow><mn>1</mn></mrow></msub></math></span>, where <em>λ</em> is the regularisation parameter. This problem, which is called LASSO regression, yields a family of functions <span><math><msub><mrow><mi>x</mi></mrow><mrow><mtext>lasso</mtext></mrow></msub><mo>(</mo><mi>λ</mi><mo>)</mo></math></span> and it is necessary to determine the optimal value of <em>λ</em>, that is, the value of <em>λ</em> that balances the fidelity of the model, <span><math><mrow><mo>‖</mo><mi>A</mi><msub><mrow><mi>x</mi></mrow><mrow><mtext>lasso</mtext></mrow></msub><mo>(</mo><mi>λ</mi><mo>)</mo><mo>−</mo><mi>b</mi><mo>‖</mo></mrow><mo>≈</mo><mn>0</mn></math></span>, and the satisfaction of the constraint that <span><math><msub><mrow><mi>x</mi></mrow><mrow><mtext>lasso</mtext></mrow></msub><mo>(</mo><mi>λ</mi><mo>)</mo></math></span> be sparse. The aim of this paper is an investigation of the numerical properties of <span><math><msub><mrow><mi>x</mi></mrow><mrow><mtext>lasso</mtext></mrow></msub><mo>(</mo><mi>λ</mi><mo>)</mo></math></span>, and the main conclusion of this investigation is the incompatibility of sparsity and stability, that is, a sparse solution <span><math><msub><mrow><mi>x</mi></mrow><mrow><mtext>lasso</mtext></mrow></msub><mo>(</mo><mi>λ</mi><mo>)</mo></math></span> that preserves the fidelity of the model exists if the least squares (LS) solution <span><math><msub><mrow><mi>x</mi></mrow><mrow><mtext>ls</mtext></mrow></msub><mo>=</mo><msup><mrow><mi>A</mi></mrow><mrow><mi>†</mi></mrow></msup><mi>b</mi></math></span> is unstable. Two methods, cross validation and the L-curve, for the computation of the optimal value of <em>λ</em> are compared and it is shown that the L-curve yields significantly better results. This difference between stable and unstable solutions <span><math><msub><mrow><mi>x</mi></mrow><mrow><mtext>ls</mtext></mrow></msub></math></span> of the LS problem manifests itself in the very different forms of the L-curve for these two solutions. The paper includes examples of stable and unstable solutions <span><math><msub><mrow><mi>x</mi></mrow><mrow><mtext>ls</mtext></mrow></msub></math></span> that demonstrate the theory.</div></div>","PeriodicalId":8199,"journal":{"name":"Applied Numerical Mathematics","volume":"208 ","pages":"Pages 297-309"},"PeriodicalIF":2.4000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Numerical properties of solutions of LASSO regression\",\"authors\":\"Mayur V. Lakshmi ,&nbsp;Joab R. Winkler\",\"doi\":\"10.1016/j.apnum.2024.03.010\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The determination of a concise model of a linear system when there are fewer samples <em>m</em> than predictors <em>n</em> requires the solution of the equation <span><math><mi>A</mi><mi>x</mi><mo>=</mo><mi>b</mi></math></span>, where <span><math><mi>A</mi><mo>∈</mo><msup><mrow><mi>R</mi></mrow><mrow><mi>m</mi><mo>×</mo><mi>n</mi></mrow></msup></math></span> and <span><math><mtext>rank</mtext><mspace></mspace><mi>A</mi><mo>=</mo><mi>m</mi></math></span>, such that the selected solution from the infinite number of solutions is sparse, that is, many of its components are zero. This leads to the minimisation with respect to <em>x</em> of <span><math><mi>f</mi><mo>(</mo><mi>x</mi><mo>,</mo><mi>λ</mi><mo>)</mo><mo>=</mo><msubsup><mrow><mo>‖</mo><mi>A</mi><mi>x</mi><mo>−</mo><mi>b</mi><mo>‖</mo></mrow><mrow><mn>2</mn></mrow><mrow><mn>2</mn></mrow></msubsup><mo>+</mo><mi>λ</mi><msub><mrow><mo>‖</mo><mi>x</mi><mo>‖</mo></mrow><mrow><mn>1</mn></mrow></msub></math></span>, where <em>λ</em> is the regularisation parameter. This problem, which is called LASSO regression, yields a family of functions <span><math><msub><mrow><mi>x</mi></mrow><mrow><mtext>lasso</mtext></mrow></msub><mo>(</mo><mi>λ</mi><mo>)</mo></math></span> and it is necessary to determine the optimal value of <em>λ</em>, that is, the value of <em>λ</em> that balances the fidelity of the model, <span><math><mrow><mo>‖</mo><mi>A</mi><msub><mrow><mi>x</mi></mrow><mrow><mtext>lasso</mtext></mrow></msub><mo>(</mo><mi>λ</mi><mo>)</mo><mo>−</mo><mi>b</mi><mo>‖</mo></mrow><mo>≈</mo><mn>0</mn></math></span>, and the satisfaction of the constraint that <span><math><msub><mrow><mi>x</mi></mrow><mrow><mtext>lasso</mtext></mrow></msub><mo>(</mo><mi>λ</mi><mo>)</mo></math></span> be sparse. The aim of this paper is an investigation of the numerical properties of <span><math><msub><mrow><mi>x</mi></mrow><mrow><mtext>lasso</mtext></mrow></msub><mo>(</mo><mi>λ</mi><mo>)</mo></math></span>, and the main conclusion of this investigation is the incompatibility of sparsity and stability, that is, a sparse solution <span><math><msub><mrow><mi>x</mi></mrow><mrow><mtext>lasso</mtext></mrow></msub><mo>(</mo><mi>λ</mi><mo>)</mo></math></span> that preserves the fidelity of the model exists if the least squares (LS) solution <span><math><msub><mrow><mi>x</mi></mrow><mrow><mtext>ls</mtext></mrow></msub><mo>=</mo><msup><mrow><mi>A</mi></mrow><mrow><mi>†</mi></mrow></msup><mi>b</mi></math></span> is unstable. Two methods, cross validation and the L-curve, for the computation of the optimal value of <em>λ</em> are compared and it is shown that the L-curve yields significantly better results. This difference between stable and unstable solutions <span><math><msub><mrow><mi>x</mi></mrow><mrow><mtext>ls</mtext></mrow></msub></math></span> of the LS problem manifests itself in the very different forms of the L-curve for these two solutions. The paper includes examples of stable and unstable solutions <span><math><msub><mrow><mi>x</mi></mrow><mrow><mtext>ls</mtext></mrow></msub></math></span> that demonstrate the theory.</div></div>\",\"PeriodicalId\":8199,\"journal\":{\"name\":\"Applied Numerical Mathematics\",\"volume\":\"208 \",\"pages\":\"Pages 297-309\"},\"PeriodicalIF\":2.4000,\"publicationDate\":\"2025-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Numerical Mathematics\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0168927424000576\",\"RegionNum\":2,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MATHEMATICS, APPLIED\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Numerical Mathematics","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168927424000576","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICS, APPLIED","Score":null,"Total":0}
引用次数: 0

摘要

当样本数m小于预测数n时,确定线性系统的简明模型需要方程Ax=b的解,其中a∈Rm×n, rankA=m,使得从无穷多个解中选择的解是稀疏的,即它的许多分量为零。这导致f(x,λ)=‖Ax−b‖22+λ‖x‖1相对于x的最小化,其中λ是正则化参数。这个问题被称为LASSO回归,它产生了一系列函数xlasso(λ),需要确定λ的最优值,即λ的值平衡模型的保真度,‖Axlasso(λ)−b‖≈0,以及xlasso(λ)是稀疏的约束的满足。本文的目的是研究xlasso(λ)的数值性质,研究的主要结论是稀疏性与稳定性的不相容,即当最小二乘(LS)解xls= a†b不稳定时,存在一个保持模型保真度的稀疏解xlasso(λ)。比较了交叉验证法和l曲线法两种计算λ最优值的方法,结果表明,l曲线法的结果明显优于交叉验证法。LS问题的稳定解和不稳定解之间的差异表现在这两个解的l曲线的非常不同的形式上。文中给出了稳定解和不稳定解的实例来证明这一理论。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Numerical properties of solutions of LASSO regression
The determination of a concise model of a linear system when there are fewer samples m than predictors n requires the solution of the equation Ax=b, where ARm×n and rankA=m, such that the selected solution from the infinite number of solutions is sparse, that is, many of its components are zero. This leads to the minimisation with respect to x of f(x,λ)=Axb22+λx1, where λ is the regularisation parameter. This problem, which is called LASSO regression, yields a family of functions xlasso(λ) and it is necessary to determine the optimal value of λ, that is, the value of λ that balances the fidelity of the model, Axlasso(λ)b0, and the satisfaction of the constraint that xlasso(λ) be sparse. The aim of this paper is an investigation of the numerical properties of xlasso(λ), and the main conclusion of this investigation is the incompatibility of sparsity and stability, that is, a sparse solution xlasso(λ) that preserves the fidelity of the model exists if the least squares (LS) solution xls=Ab is unstable. Two methods, cross validation and the L-curve, for the computation of the optimal value of λ are compared and it is shown that the L-curve yields significantly better results. This difference between stable and unstable solutions xls of the LS problem manifests itself in the very different forms of the L-curve for these two solutions. The paper includes examples of stable and unstable solutions xls that demonstrate the theory.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Applied Numerical Mathematics
Applied Numerical Mathematics 数学-应用数学
CiteScore
5.60
自引率
7.10%
发文量
225
审稿时长
7.2 months
期刊介绍: The purpose of the journal is to provide a forum for the publication of high quality research and tutorial papers in computational mathematics. In addition to the traditional issues and problems in numerical analysis, the journal also publishes papers describing relevant applications in such fields as physics, fluid dynamics, engineering and other branches of applied science with a computational mathematics component. The journal strives to be flexible in the type of papers it publishes and their format. Equally desirable are: (i) Full papers, which should be complete and relatively self-contained original contributions with an introduction that can be understood by the broad computational mathematics community. Both rigorous and heuristic styles are acceptable. Of particular interest are papers about new areas of research, in which other than strictly mathematical arguments may be important in establishing a basis for further developments. (ii) Tutorial review papers, covering some of the important issues in Numerical Mathematics, Scientific Computing and their Applications. The journal will occasionally publish contributions which are larger than the usual format for regular papers. (iii) Short notes, which present specific new results and techniques in a brief communication.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信