Foundations of Computational Mathematics最新文献

Simple matrix expressions for the curvatures of Grassmannian 格拉斯曼曲率的简单矩阵表达式

IF 3 1区数学

Foundations of Computational Mathematics Pub Date : 2025-08-27 DOI: 10.1007/s10208-025-09723-9

Zehua Lai, Lek-Heng Lim, Ke Ye

引用次数: 0

Improved global performance guarantees of second-order methods in convex minimization 改进的二阶凸最小化方法的全局性能保证

IF 3 1区数学

Foundations of Computational Mathematics Pub Date : 2025-08-13 DOI: 10.1007/s10208-025-09726-6

Pavel Dvurechensky, Yurii Nesterov

{"title":"Improved global performance guarantees of second-order methods in convex minimization","authors":"Pavel Dvurechensky, Yurii Nesterov","doi":"10.1007/s10208-025-09726-6","DOIUrl":"https://doi.org/10.1007/s10208-025-09726-6","url":null,"abstract":"<p>In this paper, we attempt to compare two distinct branches of research on second-order optimization methods. The first one studies self-concordant functions and barriers, the main assumption being that the third derivative of the objective is bounded by the second derivative. The second branch studies cubic regularized Newton methods (CRNMs) with the main assumption that the second derivative is Lipschitz continuous. We develop a new theoretical analysis for a path-following scheme (PFS) for general self-concordant functions, as opposed to the classical path-following scheme developed for self-concordant barriers. We show that the complexity bound for this scheme is better than that of the Damped Newton Method (DNM) and show that our method has global superlinear convergence. We propose also a new predictor-corrector path-following scheme (PCPFS) that leads to further improvement of constant factors in the complexity guarantees for minimizing general self-concordant functions. We also apply path-following schemes to different classes of constrained optimization problems and obtain the resulting complexity bounds. Finally, we analyze an important subclass of general self-concordant functions, namely a class of strongly convex functions with Lipschitz continuous second derivative, and show that for this subclass CRNMs give even better complexity bounds.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"14 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144924525","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Doubly Regularized Entropic Wasserstein Barycenter 双重正则化熵Wasserstein质心

IF 3 1区数学

Foundations of Computational Mathematics Pub Date : 2025-08-12 DOI: 10.1007/s10208-025-09724-8

Lénaïc Chizat

{"title":"Doubly Regularized Entropic Wasserstein Barycenter","authors":"Lénaïc Chizat","doi":"10.1007/s10208-025-09724-8","DOIUrl":"https://doi.org/10.1007/s10208-025-09724-8","url":null,"abstract":"<p>We study a general formulation of regularized Wasserstein barycenters that enjoy favorable regularity, approximation, stability and (grid-free) optimization properties. This barycenter is defined as the unique probability measure that minimizes the sum of entropic optimal transport (EOT) costs with respect to a family of given probability measures, plus an entropy term. We denote it the <span><span style=\"\"></span><span style=\"font-size: 100%; display: inline-block;\" tabindex=\"0\"><svg focusable=\"false\" height=\"2.614ex\" role=\"img\" style=\"vertical-align: -0.706ex;\" viewbox=\"0 -821.4 2325.2 1125.3\" width=\"5.4ex\" xmlns:xlink=\"http://www.w3.org/1999/xlink\"><g fill=\"currentColor\" stroke=\"currentColor\" stroke-width=\"0\" transform=\"matrix(1 0 0 -1 0 0)\"><use x=\"0\" xlink:href=\"#MJMAIN-28\" y=\"0\"></use><use x=\"389\" xlink:href=\"#MJMATHI-3BB\" y=\"0\"></use><use x=\"973\" xlink:href=\"#MJMAIN-2C\" y=\"0\"></use><use x=\"1418\" xlink:href=\"#MJMATHI-3C4\" y=\"0\"></use><use x=\"1935\" xlink:href=\"#MJMAIN-29\" y=\"0\"></use></g></svg></span><script type=\"math/tex\">(lambda ,tau )</script></span>-barycenter, where <span><span style=\"\"></span><span style=\"font-size: 100%; display: inline-block;\" tabindex=\"0\"><svg focusable=\"false\" height=\"2.013ex\" role=\"img\" style=\"vertical-align: -0.205ex;\" viewbox=\"0 -778.3 583.5 866.5\" width=\"1.355ex\" xmlns:xlink=\"http://www.w3.org/1999/xlink\"><g fill=\"currentColor\" stroke=\"currentColor\" stroke-width=\"0\" transform=\"matrix(1 0 0 -1 0 0)\"><use x=\"0\" xlink:href=\"#MJMATHI-3BB\" y=\"0\"></use></g></svg></span><script type=\"math/tex\">lambda </script></span> is the inner regularization strength and <span><span style=\"\"></span><span style=\"font-size: 100%; display: inline-block;\" tabindex=\"0\"><svg focusable=\"false\" height=\"1.409ex\" role=\"img\" style=\"vertical-align: -0.205ex;\" viewbox=\"0 -518.7 517.5 606.8\" width=\"1.202ex\" xmlns:xlink=\"http://www.w3.org/1999/xlink\"><g fill=\"currentColor\" stroke=\"currentColor\" stroke-width=\"0\" transform=\"matrix(1 0 0 -1 0 0)\"><use x=\"0\" xlink:href=\"#MJMATHI-3C4\" y=\"0\"></use></g></svg></span><script type=\"math/tex\">tau </script></span> the outer one. This formulation recovers several previously proposed EOT barycenters for various choices of <span><span style=\"\"></span><span style=\"font-size: 100%; display: inline-block;\" tabindex=\"0\"><svg focusable=\"false\" height=\"2.413ex\" role=\"img\" style=\"vertical-align: -0.606ex;\" viewbox=\"0 -778.3 3380.7 1039.1\" width=\"7.852ex\" xmlns:xlink=\"http://www.w3.org/1999/xlink\"><g fill=\"currentColor\" stroke=\"currentColor\" stroke-width=\"0\" transform=\"matrix(1 0 0 -1 0 0)\"><use x=\"0\" xlink:href=\"#MJMATHI-3BB\" y=\"0\"></use><use x=\"583\" xlink:href=\"#MJMAIN-2C\" y=\"0\"></use><use x=\"1028\" xlink:href=\"#MJMATHI-3C4\" y=\"0\"></use><use x=\"1823\" xlink:href=\"#MJMAIN-2265\" y=\"0\"></use><use x=\"2880\" xlink:href=\"#MJMAIN-30\" y=\"0\"></use></g></svg></span><script type=\"math/tex\">lambda ,tau ge 0</script></span> and generalizes them. First, we show that, as <span><span style=\"\"></span><span style=\"font-size: 100%; di","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"52 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144924523","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Algorithms for Mean-Field Variational Inference Via Polyhedral Optimization in the Wasserstein Space 基于Wasserstein空间多面体优化的平均场变分推理算法

IF 3 1区数学

Foundations of Computational Mathematics Pub Date : 2025-08-12 DOI: 10.1007/s10208-025-09721-x

Yiheng Jiang, Sinho Chewi, Aram-Alexandre Pooladian

{"title":"Algorithms for Mean-Field Variational Inference Via Polyhedral Optimization in the Wasserstein Space","authors":"Yiheng Jiang, Sinho Chewi, Aram-Alexandre Pooladian","doi":"10.1007/s10208-025-09721-x","DOIUrl":"https://doi.org/10.1007/s10208-025-09721-x","url":null,"abstract":"<p>We develop a theory of finite-dimensional polyhedral subsets over the Wasserstein space and optimization of functionals over them via first-order methods. Our main application is to the problem of mean-field variational inference (MFVI), which seeks to approximate a distribution <span><span>pi </span><script type=\"math/tex\">pi </script></span> over <span><span>mathbb {R}^d</span><script type=\"math/tex\">mathbb {R}^d</script></span> by a product measure <span><span>pi ^star </span><script type=\"math/tex\">pi ^star </script></span>. When <span><span>pi </span><script type=\"math/tex\">pi </script></span> is strongly log-concave and log-smooth, we provide (1) approximation rates certifying that <span><span>pi ^star </span><script type=\"math/tex\">pi ^star </script></span> is close to the minimizer <span><span>pi ^star _diamond </span><script type=\"math/tex\">pi ^star _diamond </script></span> of the KL divergence over a <i>polyhedral</i> set <span><span>mathcal {P}_diamond </span><script type=\"math/tex\">mathcal {P}_diamond </script></span>, and (2) an algorithm for minimizing <span><span>mathop {textrm{KL}}limits (cdot !;Vert ; !pi )</span><script type=\"math/tex\">mathop {textrm{KL}}limits (cdot !;Vert ; !pi )</script></span> over <span><span>mathcal {P}_diamond </span><script type=\"math/tex\">mathcal {P}_diamond </script></span> based on accelerated gradient descent over <span><span>mathbb {R}^d</span><script type=\"math/tex\">mathbb {R}^d</script></span>. As a byproduct of our analysis, we obtain the first end-to-end analysis for gradient-based algorithms for MFVI.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"13 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144924678","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Time Splitting and Error Estimates for Nonlinear Schrödinger Equations with a Potential 带电位非线性Schrödinger方程的时间分裂和误差估计

IF 3 1区数学

Foundations of Computational Mathematics Pub Date : 2025-08-12 DOI: 10.1007/s10208-025-09727-5

Rémi Carles

引用次数: 0

A Fisher–Rao Gradient Flow for Entropy-Regularised Markov Decision Processes in Polish Spaces 波兰空间中熵正则马尔可夫决策过程的Fisher-Rao梯度流

IF 3 1区数学

Foundations of Computational Mathematics Pub Date : 2025-08-11 DOI: 10.1007/s10208-025-09729-3

Bekzhan Kerimkulov, James-Michael Leahy, David Siska, Lukasz Szpruch, Yufei Zhang

引用次数: 0

Irreducible Components of Sets of Points in the Plane that Satisfy Distance Conditions 平面上满足距离条件的点集的不可约分量

IF 3 1区数学

Foundations of Computational Mathematics Pub Date : 2025-08-07 DOI: 10.1007/s10208-025-09725-7

Niels Lubbes, Mehdi Makhul, Josef Schicho, Audie Warren

引用次数: 0

Multisymplecticity in Finite Element Exterior Calculus 有限元外演算中的多辛性

IF 3 1区数学

Foundations of Computational Mathematics Pub Date : 2025-07-14 DOI: 10.1007/s10208-025-09720-y

Ari Stern, Enrico Zampa

引用次数: 0

Interlacing Polynomial Method for Matrix Approximation via Generalized Column and Row Selection 基于广义列行选择的矩阵逼近的交错多项式方法

IF 3 1区数学

Foundations of Computational Mathematics Pub Date : 2025-07-14 DOI: 10.1007/s10208-025-09719-5

Jian-Feng Cai, Zhiqiang Xu, Zili Xu

{"title":"Interlacing Polynomial Method for Matrix Approximation via Generalized Column and Row Selection","authors":"Jian-Feng Cai, Zhiqiang Xu, Zili Xu","doi":"10.1007/s10208-025-09719-5","DOIUrl":"https://doi.org/10.1007/s10208-025-09719-5","url":null,"abstract":"<p>This paper delves into the spectral norm aspect of the Generalized Column and Row Subset Selection (GCRSS) problem. Given a target matrix <span><span>textbf{A}in mathbb {R}^{ntimes d}</span><script type=\"math/tex\">textbf{A}in mathbb {R}^{ntimes d}</script></span>, the objective of GCRSS is to select a column submatrix <span><span>textbf{B}_{:,S}in mathbb {R}^{ntimes k}</span><script type=\"math/tex\">textbf{B}_{:,S}in mathbb {R}^{ntimes k}</script></span> from the source matrix <span><span>textbf{B}in mathbb {R}^{ntimes d_B}</span><script type=\"math/tex\">textbf{B}in mathbb {R}^{ntimes d_B}</script></span> and a row submatrix <span><span>textbf{C}_{R,:}in mathbb {R}^{rtimes d}</span><script type=\"math/tex\">textbf{C}_{R,:}in mathbb {R}^{rtimes d}</script></span> from the source matrix <span><span>textbf{C}in mathbb {R}^{n_Ctimes d}</span><script type=\"math/tex\">textbf{C}in mathbb {R}^{n_Ctimes d}</script></span>, such that the residual matrix <span><span>(textbf{I}_n-textbf{B}_{:,S}textbf{B}_{:,S}^{dagger })textbf{A}(textbf{I}_d-textbf{C}_{R,:}^{dagger } textbf{C}_{R,:})</span><script type=\"math/tex\">(textbf{I}_n-textbf{B}_{:,S}textbf{B}_{:,S}^{dagger })textbf{A}(textbf{I}_d-textbf{C}_{R,:}^{dagger } textbf{C}_{R,:})</script></span> has a small spectral norm. By employing the method of interlacing polynomials, we show that the smallest possible spectral norm of a residual matrix can be bounded by the largest root of a related expected characteristic polynomial. A deterministic polynomial time algorithm is provided for the spectral norm case of the GCRSS problem. We next apply our results to two specific GCRSS scenarios, one where <span><span>r=0</span><script type=\"math/tex\">r=0</script></span>, simplifying the problem to the Generalized Column Subset Selection (GCSS) problem, and the other where <span><span>textbf{B}=textbf{C}=textbf{I}_d</span><script type=\"math/tex\">textbf{B}=textbf{C}=textbf{I}_d</script></span>, reducing the problem to the submatrix selection problem. In the GCSS scenario, we connect the expected characteristic polynomials to the convolution of multi-affine polynomials, leading to the derivation of the first provable reconstruction bound on the spectral norm of a residual matrix. In the submatrix selection scenario, we show that for any sufficiently small <span><span>varepsilon >0</span><script type=\"math/tex\">varepsilon >0</script></span> and any square matrix <span><span>textbf{A}in mathbb {R}^{dtimes d}</span><script type=\"math/tex\">textbf{A}in mathbb {R}^{dtimes d}</script></span>, there exist two subsets <span><span>Ssubset [d]</span><script type=\"math/tex\">Ssubset [d]</script></span> and <span><span>Rsubset [d]</span><script type=\"math/tex\">Rsubset [d]</script></span> of sizes <span><span>O(dcdot varepsilon ^2)</span><script type=\"math/tex\">O(dcdot varepsilon ^2)</script></span> such that <span><span>Vert textbf{A}_{S,R}Vert _2le varepsilon cdot Vert textbf{A}Vert _2</span><script type=\"math/tex\">Vert textbf{A}_{S,R}","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"18 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144924529","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Gradient Flows and Riemannian Structure in the Gromov-Wasserstein Geometry Gromov-Wasserstein几何中的梯度流和黎曼结构

IF 3 1区数学

Foundations of Computational Mathematics Pub Date : 2025-07-08 DOI: 10.1007/s10208-025-09722-w

Zhengxin Zhang, Ziv Goldfeld, Kristjan Greenewald, Youssef Mroueh, Bharath K. Sriperumbudur

{"title":"Gradient Flows and Riemannian Structure in the Gromov-Wasserstein Geometry","authors":"Zhengxin Zhang, Ziv Goldfeld, Kristjan Greenewald, Youssef Mroueh, Bharath K. Sriperumbudur","doi":"10.1007/s10208-025-09722-w","DOIUrl":"https://doi.org/10.1007/s10208-025-09722-w","url":null,"abstract":"<p>The Wasserstein space of probability measures is known for its intricate Riemannian structure, which underpins the Wasserstein geometry and enables gradient flow algorithms. However, the Wasserstein geometry may not be suitable for certain tasks or data modalities. Motivated by scenarios where the global structure of the data needs to be preserved, this work initiates the study of gradient flows and Riemannian structure in the Gromov-Wasserstein (GW) geometry, which is particularly suited for such purposes. We focus on the inner product GW (IGW) distance between distributions on <span><span>mathbb {R}^d</span><script type=\"math/tex\">mathbb {R}^d</script></span>, which preserves the angles within the data and serves as a convenient initial setting due to its analytic tractability. Given a functional <span><span>textsf{F}:mathcal {P}_2(mathbb {R}^d)rightarrow mathbb {R}</span><script type=\"math/tex\">textsf{F}:mathcal {P}_2(mathbb {R}^d)rightarrow mathbb {R}</script></span> to optimize and an initial distribution <span><span>rho _0in mathcal {P}_2(mathbb {R}^d)</span><script type=\"math/tex\">rho _0in mathcal {P}_2(mathbb {R}^d)</script></span>, we present an implicit IGW minimizing movement scheme that generates a sequence of distributions <span><span>{rho _i}_{i=0}^n</span><script type=\"math/tex\">{rho _i}_{i=0}^n</script></span>, which are close in IGW and aligned in the 2-Wasserstein sense. Taking the time step to zero, we prove that the (piecewise constant interpolation of the) discrete solution converges to an IGW generalized minimizing movement (GMM) <span><span>(rho _t)_t</span><script type=\"math/tex\">(rho _t)_t</script></span> that follows the continuity equation with a velocity field <span><span>v_tin L^2(rho _t;mathbb {R}^d)</span><script type=\"math/tex\">v_tin L^2(rho _t;mathbb {R}^d)</script></span>, specified by a global transformation of the Wasserstein gradient of <span><span>textsf{F}</span><script type=\"math/tex\">textsf{F}</script></span> (viz., the gradient of its first variation). The transformation is given by a mobility operator that modifies the Wasserstein gradient to encode not only local information, but also global structure, as expected for the IGW gradient flow. Our gradient flow analysis leads us to identify the Riemannian structure that gives rise to the intrinsic IGW geometry, using which we establish a Benamou-Brenier-like formula for IGW. We conclude with a formal derivation, akin to the Otto calculus, of the IGW gradient as the inverse mobility acting on the Wasserstein gradient. Numerical experiments demonstrating the global nature of IGW interpolations are provided to complement the theory.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"13 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144924530","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0