Projective Integral Updates for High-Dimensional Variational Inference

IF 1.9 3区工程技术 Q2 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Siam-Asa Journal on Uncertainty Quantification Pub Date : 2024-02-08 DOI:10.1137/22m1529919

Jed A. Duersch

{"title":"Projective Integral Updates for High-Dimensional Variational Inference","authors":"Jed A. Duersch","doi":"10.1137/22m1529919","DOIUrl":null,"url":null,"abstract":"SIAM/ASA Journal on Uncertainty Quantification, Volume 12, Issue 1, Page 69-100, March 2024. <br/> Abstract. Variational inference is an approximation framework for Bayesian inference that seeks to improve quantified uncertainty in predictions by optimizing a simplified distribution over parameters to stand in for the full posterior. Capturing model variations that remain consistent with training data enables more robust predictions by reducing parameter sensitivity. This work introduces a fixed-point optimization for variational inference that is applicable when every feasible log density can be expressed as a linear combination of functions from a given basis. In such cases, the optimizer becomes a fixed-point of projective integral updates. When the basis spans univariate quadratics in each parameter, the feasible distributions are Gaussian mean-fields and the projective integral updates yield quasi-Newton variational Bayes (QNVB). Other bases and updates are also possible. Since these updates require high-dimensional integration, this work begins by proposing an efficient quasirandom sequence of quadratures for mean-field distributions. Each iterate of the sequence contains two evaluation points that combine to correctly integrate all univariate quadratic functions and, if the mean-field factors are symmetric, all univariate cubics. More importantly, averaging results over short subsequences achieves periodic exactness on a much larger space of multivariate polynomials of quadratic total degree. The corresponding variational updates require four loss evaluations with standard (not second-order) backpropagation to eliminate error terms from over half of all multivariate quadratic basis functions. This integration technique is motivated by first proposing stochastic blocked mean-field quadratures, which may be useful in other contexts. A PyTorch implementation of QNVB allows for better control over model uncertainty during training than competing methods. Experiments demonstrate superior generalizability for multiple learning problems and architectures.","PeriodicalId":56064,"journal":{"name":"Siam-Asa Journal on Uncertainty Quantification","volume":"15 1","pages":""},"PeriodicalIF":1.9000,"publicationDate":"2024-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Siam-Asa Journal on Uncertainty Quantification","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1137/22m1529919","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

SIAM/ASA Journal on Uncertainty Quantification, Volume 12, Issue 1, Page 69-100, March 2024.
Abstract. Variational inference is an approximation framework for Bayesian inference that seeks to improve quantified uncertainty in predictions by optimizing a simplified distribution over parameters to stand in for the full posterior. Capturing model variations that remain consistent with training data enables more robust predictions by reducing parameter sensitivity. This work introduces a fixed-point optimization for variational inference that is applicable when every feasible log density can be expressed as a linear combination of functions from a given basis. In such cases, the optimizer becomes a fixed-point of projective integral updates. When the basis spans univariate quadratics in each parameter, the feasible distributions are Gaussian mean-fields and the projective integral updates yield quasi-Newton variational Bayes (QNVB). Other bases and updates are also possible. Since these updates require high-dimensional integration, this work begins by proposing an efficient quasirandom sequence of quadratures for mean-field distributions. Each iterate of the sequence contains two evaluation points that combine to correctly integrate all univariate quadratic functions and, if the mean-field factors are symmetric, all univariate cubics. More importantly, averaging results over short subsequences achieves periodic exactness on a much larger space of multivariate polynomials of quadratic total degree. The corresponding variational updates require four loss evaluations with standard (not second-order) backpropagation to eliminate error terms from over half of all multivariate quadratic basis functions. This integration technique is motivated by first proposing stochastic blocked mean-field quadratures, which may be useful in other contexts. A PyTorch implementation of QNVB allows for better control over model uncertainty during training than competing methods. Experiments demonstrate superior generalizability for multiple learning problems and architectures.

查看原文本刊更多论文

高维变量推理的投影积分更新

SIAM/ASA 不确定性量化期刊》，第 12 卷，第 1 期，第 69-100 页，2024 年 3 月。摘要。变异推理是贝叶斯推理的一种近似框架，旨在通过优化参数的简化分布来代替全后验，从而提高预测的量化不确定性。捕捉与训练数据保持一致的模型变化，可以通过降低参数敏感性来实现更稳健的预测。这项工作为变分推理引入了一种定点优化方法，适用于每一个可行的对数密度都可以表达为给定基础函数的线性组合的情况。在这种情况下，优化器成为投影积分更新的定点。当基跨越每个参数的单变量二次方时，可行分布为高斯均值场，投影积分更新产生准牛顿变分贝叶斯（QNVB）。其他基数和更新也是可能的。由于这些更新需要高维积分，本研究首先提出了均值场分布的高效准随机序列。序列的每个迭代点都包含两个评估点，结合起来可以正确积分所有单变量二次函数，如果均值场因子是对称的，还可以正确积分所有单变量三次函数。更重要的是，对短子序列的结果求平均，可以在更大的二次总阶数多元多项式空间中实现周期精确性。相应的变分更新需要使用标准（非二阶）反向传播进行四次损失评估，才能消除一半以上多元二次基函数的误差项。这种积分技术的动机是首先提出随机阻塞均场四元数，这可能在其他情况下有用。与其他竞争方法相比，QNVB 的 PyTorch 实现可以在训练过程中更好地控制模型的不确定性。实验证明，QNVB 对多种学习问题和架构都有很好的通用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Siam-Asa Journal on Uncertainty Quantification Mathematics-Statistics and Probability

CiteScore

3.70

自引率

0.00%

发文量

期刊介绍： SIAM/ASA Journal on Uncertainty Quantification (JUQ) publishes research articles presenting significant mathematical, statistical, algorithmic, and application advances in uncertainty quantification, defined as the interface of complex modeling of processes and data, especially characterizations of the uncertainties inherent in the use of such models. The journal also focuses on related fields such as sensitivity analysis, model validation, model calibration, data assimilation, and code verification. The journal also solicits papers describing new ideas that could lead to significant progress in methodology for uncertainty quantification as well as review articles on particular aspects. The journal is dedicated to nurturing synergistic interactions between the mathematical, statistical, computational, and applications communities involved in uncertainty quantification and related areas. JUQ is jointly offered by SIAM and the American Statistical Association.