Variance-reduced first-order methods for deterministically constrained stochastic nonconvex optimization with strong convergence guarantees

Zhaosong Lu, Sanyou Mei, Yifeng Xiao
{"title":"Variance-reduced first-order methods for deterministically constrained stochastic nonconvex optimization with strong convergence guarantees","authors":"Zhaosong Lu, Sanyou Mei, Yifeng Xiao","doi":"arxiv-2409.09906","DOIUrl":null,"url":null,"abstract":"In this paper, we study a class of deterministically constrained stochastic\noptimization problems. Existing methods typically aim to find an\n$\\epsilon$-stochastic stationary point, where the expected violations of both\nthe constraints and first-order stationarity are within a prescribed accuracy\nof $\\epsilon$. However, in many practical applications, it is crucial that the\nconstraints be nearly satisfied with certainty, making such an\n$\\epsilon$-stochastic stationary point potentially undesirable due to the risk\nof significant constraint violations. To address this issue, we propose\nsingle-loop variance-reduced stochastic first-order methods, where the\nstochastic gradient of the stochastic component is computed using either a\ntruncated recursive momentum scheme or a truncated Polyak momentum scheme for\nvariance reduction, while the gradient of the deterministic component is\ncomputed exactly. Under the error bound condition with a parameter $\\theta \\geq\n1$ and other suitable assumptions, we establish that the proposed methods\nachieve a sample complexity and first-order operation complexity of $\\widetilde\nO(\\epsilon^{-\\max\\{4, 2\\theta\\}})$ for finding a stronger $\\epsilon$-stochastic\nstationary point, where the constraint violation is within $\\epsilon$ with\ncertainty, and the expected violation of first-order stationarity is within\n$\\epsilon$. To the best of our knowledge, this is the first work to develop\nmethods with provable complexity guarantees for finding an approximate\nstochastic stationary point of such problems that nearly satisfies all\nconstraints with certainty.","PeriodicalId":501340,"journal":{"name":"arXiv - STAT - Machine Learning","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.09906","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In this paper, we study a class of deterministically constrained stochastic optimization problems. Existing methods typically aim to find an $\epsilon$-stochastic stationary point, where the expected violations of both the constraints and first-order stationarity are within a prescribed accuracy of $\epsilon$. However, in many practical applications, it is crucial that the constraints be nearly satisfied with certainty, making such an $\epsilon$-stochastic stationary point potentially undesirable due to the risk of significant constraint violations. To address this issue, we propose single-loop variance-reduced stochastic first-order methods, where the stochastic gradient of the stochastic component is computed using either a truncated recursive momentum scheme or a truncated Polyak momentum scheme for variance reduction, while the gradient of the deterministic component is computed exactly. Under the error bound condition with a parameter $\theta \geq 1$ and other suitable assumptions, we establish that the proposed methods achieve a sample complexity and first-order operation complexity of $\widetilde O(\epsilon^{-\max\{4, 2\theta\}})$ for finding a stronger $\epsilon$-stochastic stationary point, where the constraint violation is within $\epsilon$ with certainty, and the expected violation of first-order stationarity is within $\epsilon$. To the best of our knowledge, this is the first work to develop methods with provable complexity guarantees for finding an approximate stochastic stationary point of such problems that nearly satisfies all constraints with certainty.
具有强收敛保证的确定性约束随机非凸优化的方差缩小一阶方法
本文研究了一类确定性约束随机优化问题。现有方法通常旨在找到一个$epsilon$随机静止点,在这个点上,对约束条件和一阶静止性的预期违反都在$\epsilon$的规定精度之内。然而,在许多实际应用中,约束条件必须近乎确定无疑地得到满足,这就使得这种$epsilon$-随机静止点可能不可取,因为存在严重违反约束条件的风险。为了解决这个问题,我们提出了单环方差降低随机一阶方法,其中随机分量的随机梯度使用截断递归动量方案或截断波利亚克动量方案进行方差降低计算,而确定分量的梯度则精确计算。在参数为 $\theta \geq1$ 和其他合适假设的误差约束条件下,我们确定了所提出方法的采样复杂度和一阶运算复杂度为 $\widetildeO(\epsilon^{-\max\{4、(\epsilon^{-\max\{4, 2\theta\})$)$来找到一个更强的$\epsilon$-随机静止点,其中违反约束的情况在$\epsilon$以内,并且预期违反一阶静止性的情况在$\epsilon$以内。据我们所知,这是第一部为寻找这类问题的近似随机静止点而开发具有可证明复杂性保证的方法的著作,该方法几乎可以肯定地满足所有约束条件。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信