Algorithms for Mean-Field Variational Inference Via Polyhedral Optimization in the Wasserstein Space

IF 2.7 1区 数学 Q2 COMPUTER SCIENCE, THEORY & METHODS
Yiheng Jiang, Sinho Chewi, Aram-Alexandre Pooladian
{"title":"Algorithms for Mean-Field Variational Inference Via Polyhedral Optimization in the Wasserstein Space","authors":"Yiheng Jiang, Sinho Chewi, Aram-Alexandre Pooladian","doi":"10.1007/s10208-025-09721-x","DOIUrl":null,"url":null,"abstract":"<p>We develop a theory of finite-dimensional polyhedral subsets over the Wasserstein space and optimization of functionals over them via first-order methods. Our main application is to the problem of mean-field variational inference (MFVI), which seeks to approximate a distribution <span><span>\\pi </span><script type=\"math/tex\">\\pi </script></span> over <span><span>\\mathbb {R}^d</span><script type=\"math/tex\">\\mathbb {R}^d</script></span> by a product measure <span><span>\\pi ^\\star </span><script type=\"math/tex\">\\pi ^\\star </script></span>. When <span><span>\\pi </span><script type=\"math/tex\">\\pi </script></span> is strongly log-concave and log-smooth, we provide (1) approximation rates certifying that <span><span>\\pi ^\\star </span><script type=\"math/tex\">\\pi ^\\star </script></span> is close to the minimizer <span><span>\\pi ^\\star _\\diamond </span><script type=\"math/tex\">\\pi ^\\star _\\diamond </script></span> of the KL divergence over a <i>polyhedral</i> set <span><span>\\mathcal {P}_\\diamond </span><script type=\"math/tex\">\\mathcal {P}_\\diamond </script></span>, and (2) an algorithm for minimizing <span><span>\\mathop {\\textrm{KL}}\\limits (\\cdot \\!\\;\\Vert \\; \\!\\pi )</span><script type=\"math/tex\">\\mathop {\\textrm{KL}}\\limits (\\cdot \\!\\;\\Vert \\; \\!\\pi )</script></span> over <span><span>\\mathcal {P}_\\diamond </span><script type=\"math/tex\">\\mathcal {P}_\\diamond </script></span> based on accelerated gradient descent over <span><span>\\mathbb {R}^d</span><script type=\"math/tex\">\\mathbb {R}^d</script></span>. As a byproduct of our analysis, we obtain the first end-to-end analysis for gradient-based algorithms for MFVI.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"13 1","pages":""},"PeriodicalIF":2.7000,"publicationDate":"2025-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Foundations of Computational Mathematics","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1007/s10208-025-09721-x","RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

We develop a theory of finite-dimensional polyhedral subsets over the Wasserstein space and optimization of functionals over them via first-order methods. Our main application is to the problem of mean-field variational inference (MFVI), which seeks to approximate a distribution \pi over \mathbb {R}^d by a product measure \pi ^\star . When \pi is strongly log-concave and log-smooth, we provide (1) approximation rates certifying that \pi ^\star is close to the minimizer \pi ^\star _\diamond of the KL divergence over a polyhedral set \mathcal {P}_\diamond , and (2) an algorithm for minimizing \mathop {\textrm{KL}}\limits (\cdot \!\;\Vert \; \!\pi ) over \mathcal {P}_\diamond based on accelerated gradient descent over \mathbb {R}^d. As a byproduct of our analysis, we obtain the first end-to-end analysis for gradient-based algorithms for MFVI.

基于Wasserstein空间多面体优化的平均场变分推理算法
我们发展了Wasserstein空间上有限维多面体子集的理论,并通过一阶方法对其上的泛函进行了优化。我们的主要应用是平均场变分推理(MFVI)的问题,它寻求通过产品度量\pi ^ \star\pi ^ {}\star来近似分布{}\pi\pi / \mathbb R^d \mathbb R^d。当\pi\pi为强对数凹和对数平滑时,我们提供了(1)近似速率,证明\pi ^ \star\pi ^ \star接近于多面体集\mathcal P_ \diamond\mathcal P_\diamond的KL散度的最小值\pi ^ \star\diamond{}\pi ^ \star{}\diamond;(2)最小化\mathop{\textrm{KL}}\limits (\cdot \!\; \Vert \; \!\pi) \mathop{\textrm{KL}}\limits (\cdot \!\; \Vert \; \!\pi) over \mathcal P_{}\diamond\mathcal P_{}\diamond基于加速梯度下降over \mathbb R{^d }\mathbb R{^d。作为我们分析的副产品,我们获得了第一个基于梯度的MFVI算法的端到端分析。}
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Foundations of Computational Mathematics
Foundations of Computational Mathematics 数学-计算机:理论方法
CiteScore
6.90
自引率
3.30%
发文量
46
审稿时长
>12 weeks
期刊介绍: Foundations of Computational Mathematics (FoCM) will publish research and survey papers of the highest quality which further the understanding of the connections between mathematics and computation. The journal aims to promote the exploration of all fundamental issues underlying the creative tension among mathematics, computer science and application areas unencumbered by any external criteria such as the pressure for applications. The journal will thus serve an increasingly important and applicable area of mathematics. The journal hopes to further the understanding of the deep relationships between mathematical theory: analysis, topology, geometry and algebra, and the computational processes as they are evolving in tandem with the modern computer. With its distinguished editorial board selecting papers of the highest quality and interest from the international community, FoCM hopes to influence both mathematics and computation. Relevance to applications will not constitute a requirement for the publication of articles. The journal does not accept code for review however authors who have code/data related to the submission should include a weblink to the repository where the data/code is stored.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信