稀疏高维线性回归。估计平方误差和相变

D. Gamarnik, Ilias Zadik
{"title":"稀疏高维线性回归。估计平方误差和相变","authors":"D. Gamarnik, Ilias Zadik","doi":"10.1214/21-aos2130","DOIUrl":null,"url":null,"abstract":"We consider a sparse high dimensional regression model where the goal is to recover a k-sparse unknown binary vector β∗ from n noisy linear observations of the form Y = Xβ∗+W ∈ R where X ∈ Rn×p has i.i.d. N(0, 1) entries and W ∈ R has i.i.d. N(0, σ) entries. In the high signal-to-noise ratio regime and sublinear sparsity regime, while the order of the sample size needed to recover the unknown vector information-theoretially is known to be n∗ := 2k log p/ log(k/σ + 1), no polynomial-time algorithm is known to succeed unless n > nalg := (2k + σ) log p. In this work, we offer a series of results investigating multiple computational and statistical aspects of the recovery task in the regime n ∈ [n∗, nalg]. First, we establish a novel information-theoretic property of the MLE of the problem happening around n = n∗ samples, which we coin as an “all-or-nothing behavior”: when n > n∗ it recovers almost perfectly the support of β∗, while if n < n∗ it fails to recover any fraction of it correctly. Second, at an attempt to understand the computational hardness in the regime n ∈ [n∗, nalg] we prove that at order nalg samples there is an Overlap Gap Property (OGP) phase transition occurring at the landscape of the MLE: for constants c, C > 0 when n < cnalg OGP appears in the landscape of MLE while if n > Cnalg OGP disappears. OGP is a geometric “disconnectivity” property which initially appeared in the theory of spin glasses and is known to suggest algorithmic hardness when it occurs. Finally, using certain technical results obtained to establish the OGP phase transition, we additionally establish various novel positive and negative algorithmic results for the recovery task of interest, including the failure of LASSO with access to n < cnalg samples and the success of a simple Local Search method with access to n > Cnalg samples.","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"2000 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Sparse high-dimensional linear regression. Estimating squared error and a phase transition\",\"authors\":\"D. Gamarnik, Ilias Zadik\",\"doi\":\"10.1214/21-aos2130\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We consider a sparse high dimensional regression model where the goal is to recover a k-sparse unknown binary vector β∗ from n noisy linear observations of the form Y = Xβ∗+W ∈ R where X ∈ Rn×p has i.i.d. N(0, 1) entries and W ∈ R has i.i.d. N(0, σ) entries. In the high signal-to-noise ratio regime and sublinear sparsity regime, while the order of the sample size needed to recover the unknown vector information-theoretially is known to be n∗ := 2k log p/ log(k/σ + 1), no polynomial-time algorithm is known to succeed unless n > nalg := (2k + σ) log p. In this work, we offer a series of results investigating multiple computational and statistical aspects of the recovery task in the regime n ∈ [n∗, nalg]. First, we establish a novel information-theoretic property of the MLE of the problem happening around n = n∗ samples, which we coin as an “all-or-nothing behavior”: when n > n∗ it recovers almost perfectly the support of β∗, while if n < n∗ it fails to recover any fraction of it correctly. Second, at an attempt to understand the computational hardness in the regime n ∈ [n∗, nalg] we prove that at order nalg samples there is an Overlap Gap Property (OGP) phase transition occurring at the landscape of the MLE: for constants c, C > 0 when n < cnalg OGP appears in the landscape of MLE while if n > Cnalg OGP disappears. OGP is a geometric “disconnectivity” property which initially appeared in the theory of spin glasses and is known to suggest algorithmic hardness when it occurs. Finally, using certain technical results obtained to establish the OGP phase transition, we additionally establish various novel positive and negative algorithmic results for the recovery task of interest, including the failure of LASSO with access to n < cnalg samples and the success of a simple Local Search method with access to n > Cnalg samples.\",\"PeriodicalId\":22375,\"journal\":{\"name\":\"The Annals of Statistics\",\"volume\":\"2000 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The Annals of Statistics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1214/21-aos2130\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Annals of Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1214/21-aos2130","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

摘要

我们考虑一个稀疏的高维回归模型,其目标是从n个形式为Y = Xβ∗+W∈R的噪声线性观测中恢复k稀疏未知二进制向量β∗,其中X∈Rn×p有i.i.d.n(0,1)个条目,W∈R有i.i.d.n (0, σ)个条目。在高信噪比政权和次线性稀疏的政权,而恢复所需的样本大小的顺序已知未知向量information-theoretially n∗:= 2 k日志p /日志(k /σ+ 1),没有多项式时间算法被成功除非n > nalg:日志p = (2 k +σ)。在这工作,我们提供一系列的结果调查多个计算和统计方面的恢复任务的政权n∈(n∗,nalg)。首先,我们建立了发生在n = n∗样本周围的问题的MLE的一个新的信息理论性质,我们称之为“全有或全无行为”:当n > n∗时,它几乎完美地恢复β∗的支持,而如果n < n∗,它不能正确地恢复它的任何部分。其次,在试图理解n∈[n∗,nalg]区域的计算硬度时,我们证明了在nalg阶样本中,在MLE的横向上发生重叠间隙属性(OGP)相变:对于常数c, c > 0,当n < cnalg OGP出现在MLE的横向上,而如果n > cnalg OGP消失。OGP是一种几何“不连通性”性质,最初出现在自旋玻璃理论中,当它发生时,已知表明算法硬度。最后,利用获得的某些技术结果来建立OGP相变,我们还为感兴趣的恢复任务建立了各种新的正负算法结果,包括访问n < cnalg样本的LASSO失败以及访问n > cnalg样本的简单局部搜索方法的成功。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Sparse high-dimensional linear regression. Estimating squared error and a phase transition
We consider a sparse high dimensional regression model where the goal is to recover a k-sparse unknown binary vector β∗ from n noisy linear observations of the form Y = Xβ∗+W ∈ R where X ∈ Rn×p has i.i.d. N(0, 1) entries and W ∈ R has i.i.d. N(0, σ) entries. In the high signal-to-noise ratio regime and sublinear sparsity regime, while the order of the sample size needed to recover the unknown vector information-theoretially is known to be n∗ := 2k log p/ log(k/σ + 1), no polynomial-time algorithm is known to succeed unless n > nalg := (2k + σ) log p. In this work, we offer a series of results investigating multiple computational and statistical aspects of the recovery task in the regime n ∈ [n∗, nalg]. First, we establish a novel information-theoretic property of the MLE of the problem happening around n = n∗ samples, which we coin as an “all-or-nothing behavior”: when n > n∗ it recovers almost perfectly the support of β∗, while if n < n∗ it fails to recover any fraction of it correctly. Second, at an attempt to understand the computational hardness in the regime n ∈ [n∗, nalg] we prove that at order nalg samples there is an Overlap Gap Property (OGP) phase transition occurring at the landscape of the MLE: for constants c, C > 0 when n < cnalg OGP appears in the landscape of MLE while if n > Cnalg OGP disappears. OGP is a geometric “disconnectivity” property which initially appeared in the theory of spin glasses and is known to suggest algorithmic hardness when it occurs. Finally, using certain technical results obtained to establish the OGP phase transition, we additionally establish various novel positive and negative algorithmic results for the recovery task of interest, including the failure of LASSO with access to n < cnalg samples and the success of a simple Local Search method with access to n > Cnalg samples.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信