{"title":"Perseus: a simple and optimal high-order method for variational inequalities","authors":"Tianyi Lin, Michael I. Jordan","doi":"10.1007/s10107-024-02075-2","DOIUrl":null,"url":null,"abstract":"<p>This paper settles an open and challenging question pertaining to the design of simple and optimal high-order methods for solving smooth and monotone variational inequalities (VIs). A VI involves finding <span>\\(x^\\star \\in {\\mathcal {X}}\\)</span> such that <span>\\(\\langle F(x), x - x^\\star \\rangle \\ge 0\\)</span> for all <span>\\(x \\in {\\mathcal {X}}\\)</span>. We consider the setting in which <span>\\(F: {\\mathbb {R}}^d \\rightarrow {\\mathbb {R}}^d\\)</span> is smooth with up to <span>\\((p-1)^{\\text {th}}\\)</span>-order derivatives. For <span>\\(p = 2\\)</span>, the cubic regularization of Newton’s method has been extended to VIs with a global rate of <span>\\(O(\\epsilon ^{-1})\\)</span> (Nesterov in Cubic regularization of Newton’s method for convex problems with constraints, Tech. rep., Université catholique de Louvain, Center for Operations Research and Econometrics (CORE), 2006). An improved rate of <span>\\(O(\\epsilon ^{-2/3}\\log \\log (1/\\epsilon ))\\)</span> can be obtained via an alternative second-order method, but this method requires a nontrivial line-search procedure as an inner loop. Similarly, the existing high-order methods based on line-search procedures have been shown to achieve a rate of <span>\\(O(\\epsilon ^{-2/(p+1)}\\log \\log (1/\\epsilon ))\\)</span> (Bullins and Lai in SIAM J Optim 32(3):2208–2229, 2022; Jiang and Mokhtari in Generalized optimistic methods for convex–concave saddle point problems, 2022; Lin and Jordan in Math Oper Res 48(4):2353–2382, 2023). As emphasized by Nesterov (Lectures on convex optimization, vol 137, Springer, Berlin, 2018), however, such procedures do not necessarily imply the practical applicability in large-scale applications, and it is desirable to complement these results with a simple high-order VI method that retains the optimality of the more complex methods. We propose a <span>\\(p^{\\text {th}}\\)</span>-order method that does <i>not</i> require any line search procedure and provably converges to a weak solution at a rate of <span>\\(O(\\epsilon ^{-2/(p+1)})\\)</span>. We prove that our <span>\\(p^{\\text {th}}\\)</span>-order method is optimal in the monotone setting by establishing a lower bound of <span>\\(\\Omega (\\epsilon ^{-2/(p+1)})\\)</span> under a generalized linear span assumption. A restarted version of our <span>\\(p^{\\text {th}}\\)</span>-order method attains a linear rate for smooth and <span>\\(p^{\\text {th}}\\)</span>-order uniformly monotone VIs and another restarted version of our <span>\\(p^{\\text {th}}\\)</span>-order method attains a local superlinear rate for smooth and strongly monotone VIs. Further, the similar <span>\\(p^{\\text {th}}\\)</span>-order method achieves a global rate of <span>\\(O(\\epsilon ^{-2/p})\\)</span> for solving smooth and nonmonotone VIs satisfying the Minty condition. Two restarted versions attain a global linear rate under additional <span>\\(p^{\\text {th}}\\)</span>-order uniform Minty condition and a local superlinear rate under additional strong Minty condition.</p>","PeriodicalId":18297,"journal":{"name":"Mathematical Programming","volume":"16 1","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2024-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mathematical Programming","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1007/s10107-024-02075-2","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0
Abstract
This paper settles an open and challenging question pertaining to the design of simple and optimal high-order methods for solving smooth and monotone variational inequalities (VIs). A VI involves finding \(x^\star \in {\mathcal {X}}\) such that \(\langle F(x), x - x^\star \rangle \ge 0\) for all \(x \in {\mathcal {X}}\). We consider the setting in which \(F: {\mathbb {R}}^d \rightarrow {\mathbb {R}}^d\) is smooth with up to \((p-1)^{\text {th}}\)-order derivatives. For \(p = 2\), the cubic regularization of Newton’s method has been extended to VIs with a global rate of \(O(\epsilon ^{-1})\) (Nesterov in Cubic regularization of Newton’s method for convex problems with constraints, Tech. rep., Université catholique de Louvain, Center for Operations Research and Econometrics (CORE), 2006). An improved rate of \(O(\epsilon ^{-2/3}\log \log (1/\epsilon ))\) can be obtained via an alternative second-order method, but this method requires a nontrivial line-search procedure as an inner loop. Similarly, the existing high-order methods based on line-search procedures have been shown to achieve a rate of \(O(\epsilon ^{-2/(p+1)}\log \log (1/\epsilon ))\) (Bullins and Lai in SIAM J Optim 32(3):2208–2229, 2022; Jiang and Mokhtari in Generalized optimistic methods for convex–concave saddle point problems, 2022; Lin and Jordan in Math Oper Res 48(4):2353–2382, 2023). As emphasized by Nesterov (Lectures on convex optimization, vol 137, Springer, Berlin, 2018), however, such procedures do not necessarily imply the practical applicability in large-scale applications, and it is desirable to complement these results with a simple high-order VI method that retains the optimality of the more complex methods. We propose a \(p^{\text {th}}\)-order method that does not require any line search procedure and provably converges to a weak solution at a rate of \(O(\epsilon ^{-2/(p+1)})\). We prove that our \(p^{\text {th}}\)-order method is optimal in the monotone setting by establishing a lower bound of \(\Omega (\epsilon ^{-2/(p+1)})\) under a generalized linear span assumption. A restarted version of our \(p^{\text {th}}\)-order method attains a linear rate for smooth and \(p^{\text {th}}\)-order uniformly monotone VIs and another restarted version of our \(p^{\text {th}}\)-order method attains a local superlinear rate for smooth and strongly monotone VIs. Further, the similar \(p^{\text {th}}\)-order method achieves a global rate of \(O(\epsilon ^{-2/p})\) for solving smooth and nonmonotone VIs satisfying the Minty condition. Two restarted versions attain a global linear rate under additional \(p^{\text {th}}\)-order uniform Minty condition and a local superlinear rate under additional strong Minty condition.
期刊介绍:
Mathematical Programming publishes original articles dealing with every aspect of mathematical optimization; that is, everything of direct or indirect use concerning the problem of optimizing a function of many variables, often subject to a set of constraints. This involves theoretical and computational issues as well as application studies. Included, along with the standard topics of linear, nonlinear, integer, conic, stochastic and combinatorial optimization, are techniques for formulating and applying mathematical programming models, convex, nonsmooth and variational analysis, the theory of polyhedra, variational inequalities, and control and game theory viewed from the perspective of mathematical programming.