Independent Finite Approximations for Bayesian Nonparametric Inference

IF 4.9 2区数学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Bayesian Analysis Pub Date : 2020-09-22 DOI:10.1214/23-ba1385

Tin D. Nguyen, Jonathan Huggins, L. Masoero, Lester W. Mackey, Tamara Broderick

{"title":"Independent Finite Approximations for Bayesian Nonparametric Inference","authors":"Tin D. Nguyen, Jonathan Huggins, L. Masoero, Lester W. Mackey, Tamara Broderick","doi":"10.1214/23-ba1385","DOIUrl":null,"url":null,"abstract":"Bayesian nonparametric priors based on completely random measures (CRMs) offer a flexible modeling approach when the number of latent components in a dataset is unknown. However, managing the infinite dimensionality of CRMs typically requires practitioners to derive ad-hoc algorithms, preventing the use of general-purpose inference methods and often leading to long compute times. We propose a general but explicit recipe to construct a simple finite-dimensional approximation that can replace the infinite-dimensional CRMs. Our independent finite approximation (IFA) is a generalization of important cases that are used in practice. The independence of atom weights in our approximation (i) makes the construction well-suited for parallel and distributed computation and (ii) facilitates more convenient inference schemes. We quantify the approximation error between IFAs and the target nonparametric prior. We compare IFAs with an alternative approximation scheme -- truncated finite approximations (TFAs), where the atom weights are constructed sequentially. We prove that, for worst-case choices of observation likelihoods, TFAs are a more efficient approximation than IFAs. However, in real-data experiments with image denoising and topic modeling, we find that IFAs perform very similarly to TFAs in terms of task-specific accuracy metrics.","PeriodicalId":55398,"journal":{"name":"Bayesian Analysis","volume":" ","pages":""},"PeriodicalIF":4.9000,"publicationDate":"2020-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bayesian Analysis","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1214/23-ba1385","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 1

Abstract

Bayesian nonparametric priors based on completely random measures (CRMs) offer a flexible modeling approach when the number of latent components in a dataset is unknown. However, managing the infinite dimensionality of CRMs typically requires practitioners to derive ad-hoc algorithms, preventing the use of general-purpose inference methods and often leading to long compute times. We propose a general but explicit recipe to construct a simple finite-dimensional approximation that can replace the infinite-dimensional CRMs. Our independent finite approximation (IFA) is a generalization of important cases that are used in practice. The independence of atom weights in our approximation (i) makes the construction well-suited for parallel and distributed computation and (ii) facilitates more convenient inference schemes. We quantify the approximation error between IFAs and the target nonparametric prior. We compare IFAs with an alternative approximation scheme -- truncated finite approximations (TFAs), where the atom weights are constructed sequentially. We prove that, for worst-case choices of observation likelihoods, TFAs are a more efficient approximation than IFAs. However, in real-data experiments with image denoising and topic modeling, we find that IFAs perform very similarly to TFAs in terms of task-specific accuracy metrics.

查看原文本刊更多论文

贝叶斯非参数推理的独立有限逼近

基于完全随机度量(CRMs)的贝叶斯非参数先验在数据集中潜在成分数量未知时提供了一种灵活的建模方法。然而，管理crm的无限维度通常需要从业者派生特设算法，这阻碍了通用推理方法的使用，并且通常导致较长的计算时间。我们提出了一个一般但明确的配方来构造一个简单的有限维近似，可以取代无限维crm。我们的独立有限近似(IFA)是对实际应用中重要情况的推广。我们的近似中原子质量的独立性(i)使得构造非常适合并行和分布式计算，(ii)促进了更方便的推理方案。我们量化了IFAs与目标非参数先验之间的近似误差。我们将IFAs与另一种近似方案进行比较——截断有限近似(tfa)，其中原子质量是顺序构建的。我们证明，对于观察概率的最坏情况选择，tfa是比IFAs更有效的近似。然而，在图像去噪和主题建模的真实数据实验中，我们发现IFAs在特定任务精度指标方面的表现与tfa非常相似。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Bayesian Analysis 数学-数学跨学科应用

CiteScore

6.50

自引率

13.60%

发文量

审稿时长

>12 weeks

期刊介绍： Bayesian Analysis is an electronic journal of the International Society for Bayesian Analysis. It seeks to publish a wide range of articles that demonstrate or discuss Bayesian methods in some theoretical or applied context. The journal welcomes submissions involving presentation of new computational and statistical methods; critical reviews and discussions of existing approaches; historical perspectives; description of important scientific or policy application areas; case studies; and methods for experimental design, data collection, data sharing, or data mining. Evaluation of submissions is based on importance of content and effectiveness of communication. Discussion papers are typically chosen by the Editor in Chief, or suggested by an Editor, among the regular submissions. In addition, the Journal encourages individual authors to submit manuscripts for consideration as discussion papers.