Learning Optimal Transport Between two Empirical Distributions with Normalizing Flows

Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference) Pub Date : 2022-07-04 DOI:10.48550/arXiv.2207.01246

Florentin Coeurdoux, N. Dobigeon, P. Chainais

{"title":"Learning Optimal Transport Between two Empirical Distributions with Normalizing Flows","authors":"Florentin Coeurdoux, N. Dobigeon, P. Chainais","doi":"10.48550/arXiv.2207.01246","DOIUrl":null,"url":null,"abstract":"Optimal transport (OT) provides effective tools for comparing and mapping probability measures. We propose to leverage the flexibility of neural networks to learn an approximate optimal transport map. More precisely, we present a new and original method to address the problem of transporting a finite set of samples associated with a first underlying unknown distribution towards another finite set of samples drawn from another unknown distribution. We show that a particular instance of invertible neural networks, namely the normalizing flows, can be used to approximate the solution of this OT problem between a pair of empirical distributions. To this aim, we propose to relax the Monge formulation of OT by replacing the equality constraint on the push-forward measure by the minimization of the corresponding Wasserstein distance. The push-forward operator to be retrieved is then restricted to be a normalizing flow which is trained by optimizing the resulting cost function. This approach allows the transport map to be discretized as a composition of functions. Each of these functions is associated to one sub-flow of the network, whose output provides intermediate steps of the transport between the original and target measures. This discretization yields also a set of intermediate barycenters between the two measures of interest. Experiments conducted on toy examples as well as a challenging task of unsupervised translation demonstrate the interest of the proposed method. Finally, some experiments show that the proposed approach leads to a good approximation of the true OT.","PeriodicalId":74091,"journal":{"name":"Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)","volume":"13 1","pages":"275-290"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2207.01246","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

Abstract

Optimal transport (OT) provides effective tools for comparing and mapping probability measures. We propose to leverage the flexibility of neural networks to learn an approximate optimal transport map. More precisely, we present a new and original method to address the problem of transporting a finite set of samples associated with a first underlying unknown distribution towards another finite set of samples drawn from another unknown distribution. We show that a particular instance of invertible neural networks, namely the normalizing flows, can be used to approximate the solution of this OT problem between a pair of empirical distributions. To this aim, we propose to relax the Monge formulation of OT by replacing the equality constraint on the push-forward measure by the minimization of the corresponding Wasserstein distance. The push-forward operator to be retrieved is then restricted to be a normalizing flow which is trained by optimizing the resulting cost function. This approach allows the transport map to be discretized as a composition of functions. Each of these functions is associated to one sub-flow of the network, whose output provides intermediate steps of the transport between the original and target measures. This discretization yields also a set of intermediate barycenters between the two measures of interest. Experiments conducted on toy examples as well as a challenging task of unsupervised translation demonstrate the interest of the proposed method. Finally, some experiments show that the proposed approach leads to a good approximation of the true OT.

查看原文本刊更多论文

用正规化流学习两个经验分布之间的最优传输

最优运输(OT)为比较和映射概率测度提供了有效的工具。我们建议利用神经网络的灵活性来学习一个近似的最优运输图。更准确地说，我们提出了一种新的和原始的方法来解决将与第一个潜在未知分布相关的有限样本集传输到从另一个未知分布提取的另一个有限样本集的问题。我们证明了可逆神经网络的一个特殊实例，即归一化流，可以用来近似这对经验分布之间的OT问题的解。为此，我们提出用相应的Wasserstein距离的最小化来代替推进测度的等式约束，从而放宽OT的Monge公式。然后，要检索的前推算子被限制为通过优化结果代价函数来训练的规范化流。这种方法允许将传输映射离散为函数的组合。这些功能中的每一个都与网络的一个子流相关联，其子流的输出提供了原始度量和目标度量之间传输的中间步骤。这种离散化也产生了两个感兴趣的度量之间的一组中间重心。在玩具示例和具有挑战性的无监督翻译任务上进行的实验证明了所提出方法的有效性。最后，一些实验表明，所提出的方法可以很好地逼近真实的OT。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Machine learning and knowledge discovery in databases : European Conference, ECML PKDD ... : proceedings. ECML PKDD (Conference)

自引率

0.00%

发文量