Another Approach to Build Lyapunov Functions for the First Order Methods in the Quadratic Case

IF 0.7 4区数学 Q3 MATHEMATICS, APPLIED

Computational Mathematics and Mathematical Physics Pub Date : 2024-06-07 DOI:10.1134/s0965542524700131

D. M. Merkulov, I. V. Oseledets

{"title":"Another Approach to Build Lyapunov Functions for the First Order Methods in the Quadratic Case","authors":"D. M. Merkulov, I. V. Oseledets","doi":"10.1134/s0965542524700131","DOIUrl":null,"url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Abstract</h3><p>Lyapunov functions play a fundamental role in analyzing the stability and convergence properties of optimization methods. In this paper, we propose a novel and straightforward approach for constructing Lyapunov functions for first-order methods applied to quadratic functions. Our approach involves bringing the iteration matrix to an upper triangular form using Schur decomposition, then examining the value of the last coordinate of the state vector. This value is multiplied by a magnitude smaller than one at each iteration. Consequently, this value should decrease at each iteration, provided that the method converges. We rigorously prove the suitability of this Lyapunov function for all first-order methods and derive the necessary conditions for the proposed function to decrease monotonically. Experiments conducted with general convex functions are also presented, alongside a study on the limitations of the proposed approach. Remarkably, the newly discovered L-yapunov function is straightforward and does not explicitly depend on the exact method formulation or function characteristics like strong convexity or smoothness constants. In essence, a single expression serves as a Lyapunov function for several methods, including Heavy Ball, Nesterov Accelerated Gradient, and Triple Momentum, among others. To the best of our knowledge, this approach has not been previously reported in the literature.</p>","PeriodicalId":55230,"journal":{"name":"Computational Mathematics and Mathematical Physics","volume":"49 1","pages":""},"PeriodicalIF":0.7000,"publicationDate":"2024-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Mathematics and Mathematical Physics","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1134/s0965542524700131","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MATHEMATICS, APPLIED","Score":null,"Total":0}

引用次数: 0

Abstract

Lyapunov functions play a fundamental role in analyzing the stability and convergence properties of optimization methods. In this paper, we propose a novel and straightforward approach for constructing Lyapunov functions for first-order methods applied to quadratic functions. Our approach involves bringing the iteration matrix to an upper triangular form using Schur decomposition, then examining the value of the last coordinate of the state vector. This value is multiplied by a magnitude smaller than one at each iteration. Consequently, this value should decrease at each iteration, provided that the method converges. We rigorously prove the suitability of this Lyapunov function for all first-order methods and derive the necessary conditions for the proposed function to decrease monotonically. Experiments conducted with general convex functions are also presented, alongside a study on the limitations of the proposed approach. Remarkably, the newly discovered L-yapunov function is straightforward and does not explicitly depend on the exact method formulation or function characteristics like strong convexity or smoothness constants. In essence, a single expression serves as a Lyapunov function for several methods, including Heavy Ball, Nesterov Accelerated Gradient, and Triple Momentum, among others. To the best of our knowledge, this approach has not been previously reported in the literature.

Abstract Image

查看原文本刊更多论文

在二次情况下为一阶方法建立 Lyapunov 函数的另一种方法

摘要李雅普诺夫函数在分析优化方法的稳定性和收敛性方面起着重要作用。在本文中，我们提出了一种新颖而直接的方法，用于构建适用于二次函数的一阶方法的 Lyapunov 函数。我们的方法包括利用舒尔分解将迭代矩阵转化为上三角形式，然后检查状态向量最后一个坐标的值。每次迭代时，该值都会乘以一个小于 1 的量级。因此，只要方法收敛，该值在每次迭代时都会减小。我们严格证明了这个 Lyapunov 函数适用于所有一阶方法，并推导出了建议函数单调递减的必要条件。我们还介绍了用一般凸函数进行的实验，以及对所提方法局限性的研究。值得注意的是，新发现的 L-yapunov 函数简单明了，并不明确依赖于确切的方法表述或函数特征，如强凸性或平滑常数。从本质上讲，一个表达式就可以作为多种方法的 Lyapunov 函数，包括重球、涅斯捷罗夫加速梯度和三重动量等。据我们所知，这种方法以前从未在文献中报道过。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computational Mathematics and Mathematical Physics MATHEMATICS, APPLIED-PHYSICS, MATHEMATICAL

CiteScore

1.50

自引率

14.30%

发文量

125

审稿时长

4-8 weeks

期刊介绍： Computational Mathematics and Mathematical Physics is a monthly journal published in collaboration with the Russian Academy of Sciences. The journal includes reviews and original papers on computational mathematics, computational methods of mathematical physics, informatics, and other mathematical sciences. The journal welcomes reviews and original articles from all countries in the English or Russian language.