Sliced Wasserstein Variational Inference

Asian Conference on Machine Learning Pub Date : 2022-07-26 DOI:10.48550/arXiv.2207.13177

Mingxuan Yi, Song Liu

引用次数: 10

Abstract

Variational Inference approximates an unnormalized distribution via the minimization of Kullback-Leibler (KL) divergence. Although this divergence is efficient for computation and has been widely used in applications, it suffers from some unreasonable properties. For example, it is not a proper metric, i.e., it is non-symmetric and does not preserve the triangle inequality. On the other hand, optimal transport distances recently have shown some advantages over KL divergence. With the help of these advantages, we propose a new variational inference method by minimizing sliced Wasserstein distance, a valid metric arising from optimal transport. This sliced Wasserstein distance can be approximated simply by running MCMC but without solving any optimization problem. Our approximation also does not require a tractable density function of variational distributions so that approximating families can be amortized by generators like neural networks. Furthermore, we provide an analysis of the theoretical properties of our method. Experiments on synthetic and real data are illustrated to show the performance of the proposed method.

查看原文本刊更多论文

切片Wasserstein变分推理

变分推理通过最小化Kullback-Leibler (KL)散度来近似非标准化分布。虽然这种散度计算效率高，在实际应用中得到了广泛的应用，但也存在一些不合理的性质。例如，它不是一个适当的度规，也就是说，它是非对称的，不保持三角形不等式。另一方面，最近最优运输距离比KL散度显示出一些优势。利用这些优点，我们提出了一种新的变分推理方法，通过最小化切片沃瑟斯坦距离，这是一个由最优传输产生的有效度量。这个Wasserstein距离可以简单地通过运行MCMC来近似，但不需要解决任何优化问题。我们的近似也不需要易处理的变分分布密度函数，因此近似族可以由神经网络之类的生成器平摊。此外，我们还对该方法的理论性质进行了分析。仿真实验和实际数据验证了该方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Asian Conference on Machine Learning

自引率

0.00%

发文量