一阶方法的频域表示:一个简单而稳健的分析框架

Ioannis Anagnostides, Ioannis Panageas
{"title":"一阶方法的频域表示:一个简单而稳健的分析框架","authors":"Ioannis Anagnostides, Ioannis Panageas","doi":"10.1137/1.9781611977066.10","DOIUrl":null,"url":null,"abstract":"Motivated by recent applications in min-max optimization, we employ tools from nonlinear control theory in order to analyze a class of\"historical\"gradient-based methods, for which the next step lies in the span of the previously observed gradients within a time horizon. Specifically, we leverage techniques developed by Hu and Lessard (2017) to build a frequency-domain framework which reduces the analysis of such methods to numerically-solvable algebraic tasks, establishing linear convergence under a class of strongly monotone and co-coercive operators. On the applications' side, we focus on the Optimistic Gradient Descent (OGD) method, which augments the standard Gradient Descent with an additional past-gradient in the optimization step. The proposed framework leads to a simple and sharp analysis of OGD -- and generalizations thereof -- under a much broader regime of parameters. Notably, this characterization directly extends under adversarial noise in the observed value of the gradient. Moreover, our frequency-domain framework provides an exact quantitative comparison between simultaneous and alternating updates of OGD. An interesting byproduct is that OGD -- and variants thereof -- is an instance of PID control, arguably one of the most influential algorithms of the last century; this observation sheds more light to the stabilizing properties of\"optimism\".","PeriodicalId":93491,"journal":{"name":"Proceedings of the SIAM Symposium on Simplicity in Algorithms (SOSA)","volume":"38 1","pages":"131-160"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Frequency-Domain Representation of First-Order Methods: A Simple and Robust Framework of Analysis\",\"authors\":\"Ioannis Anagnostides, Ioannis Panageas\",\"doi\":\"10.1137/1.9781611977066.10\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Motivated by recent applications in min-max optimization, we employ tools from nonlinear control theory in order to analyze a class of\\\"historical\\\"gradient-based methods, for which the next step lies in the span of the previously observed gradients within a time horizon. Specifically, we leverage techniques developed by Hu and Lessard (2017) to build a frequency-domain framework which reduces the analysis of such methods to numerically-solvable algebraic tasks, establishing linear convergence under a class of strongly monotone and co-coercive operators. On the applications' side, we focus on the Optimistic Gradient Descent (OGD) method, which augments the standard Gradient Descent with an additional past-gradient in the optimization step. The proposed framework leads to a simple and sharp analysis of OGD -- and generalizations thereof -- under a much broader regime of parameters. Notably, this characterization directly extends under adversarial noise in the observed value of the gradient. Moreover, our frequency-domain framework provides an exact quantitative comparison between simultaneous and alternating updates of OGD. An interesting byproduct is that OGD -- and variants thereof -- is an instance of PID control, arguably one of the most influential algorithms of the last century; this observation sheds more light to the stabilizing properties of\\\"optimism\\\".\",\"PeriodicalId\":93491,\"journal\":{\"name\":\"Proceedings of the SIAM Symposium on Simplicity in Algorithms (SOSA)\",\"volume\":\"38 1\",\"pages\":\"131-160\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the SIAM Symposium on Simplicity in Algorithms (SOSA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1137/1.9781611977066.10\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the SIAM Symposium on Simplicity in Algorithms (SOSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1137/1.9781611977066.10","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

受最近在最小-最大优化中的应用的启发,我们使用非线性控制理论的工具来分析一类基于“历史”梯度的方法,其下一步是在一个时间范围内先前观察到的梯度的跨度。具体来说,我们利用Hu和Lessard(2017)开发的技术构建了一个频域框架,该框架将这些方法的分析减少到数值可解的代数任务,在一类强单调和共强制算子下建立线性收敛。在应用方面,我们重点研究了乐观梯度下降(OGD)方法,该方法在优化步骤中增加了一个额外的过去梯度来增强标准梯度下降。所建议的框架导致在更广泛的参数体系下对OGD进行简单而清晰的分析——及其一般化。值得注意的是,这种特性在梯度观测值的对抗噪声下直接扩展。此外,我们的频域框架在OGD的同步更新和交替更新之间提供了精确的定量比较。一个有趣的副产品是,OGD及其变体是PID控制的一个实例,可以说是上个世纪最有影响力的算法之一;这一观察结果进一步揭示了“乐观主义”的稳定特性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Frequency-Domain Representation of First-Order Methods: A Simple and Robust Framework of Analysis
Motivated by recent applications in min-max optimization, we employ tools from nonlinear control theory in order to analyze a class of"historical"gradient-based methods, for which the next step lies in the span of the previously observed gradients within a time horizon. Specifically, we leverage techniques developed by Hu and Lessard (2017) to build a frequency-domain framework which reduces the analysis of such methods to numerically-solvable algebraic tasks, establishing linear convergence under a class of strongly monotone and co-coercive operators. On the applications' side, we focus on the Optimistic Gradient Descent (OGD) method, which augments the standard Gradient Descent with an additional past-gradient in the optimization step. The proposed framework leads to a simple and sharp analysis of OGD -- and generalizations thereof -- under a much broader regime of parameters. Notably, this characterization directly extends under adversarial noise in the observed value of the gradient. Moreover, our frequency-domain framework provides an exact quantitative comparison between simultaneous and alternating updates of OGD. An interesting byproduct is that OGD -- and variants thereof -- is an instance of PID control, arguably one of the most influential algorithms of the last century; this observation sheds more light to the stabilizing properties of"optimism".
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信