瓦瑟斯坦原型分析

IF 1.6 2区数学 Q2 MATHEMATICS, APPLIED

Applied Mathematics and Optimization Pub Date : 2024-09-02 DOI:10.1007/s00245-024-10175-w

Katy Craig, Braxton Osting, Dong Wang, Yiming Xu

{"title":"瓦瑟斯坦原型分析","authors":"Katy Craig, Braxton Osting, Dong Wang, Yiming Xu","doi":"10.1007/s00245-024-10175-w","DOIUrl":null,"url":null,"abstract":"<div><p>Archetypal analysis is an unsupervised machine learning method that summarizes data using a convex polytope. In its original formulation, for fixed <i>k</i>, the method finds a convex polytope with <i>k</i> vertices, called archetype points, such that the polytope is contained in the convex hull of the data and the mean squared Euclidean distance between the data and the polytope is minimal. In the present work, we consider an alternative formulation of archetypal analysis based on the Wasserstein metric, which we call Wasserstein archetypal analysis (WAA). In one dimension, there exists a unique solution of WAA and, in two dimensions, we prove the existence of a solution, as long as the data distribution is absolutely continuous with respect to the Lebesgue measure. We discuss obstacles to extending our result to higher dimensions and general data distributions. We then introduce an appropriate regularization of the problem, via a Rényi entropy, which allows us to obtain the existence of solutions of the regularized problem for general data distributions, in arbitrary dimensions. We prove a consistency result for the regularized problem, ensuring that if the data are iid samples from a probability measure, then as the number of samples is increased, a subsequence of the archetype points converges to the archetype points for the limiting data distribution, almost surely. Finally, we develop and implement a gradient-based computational approach for the two-dimensional problem, based on the semi-discrete formulation of the Wasserstein metric. Detailed numerical experiments are provided to support our theoretical findings.</p></div>","PeriodicalId":55566,"journal":{"name":"Applied Mathematics and Optimization","volume":"90 2","pages":""},"PeriodicalIF":1.6000,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Wasserstein Archetypal Analysis\",\"authors\":\"Katy Craig, Braxton Osting, Dong Wang, Yiming Xu\",\"doi\":\"10.1007/s00245-024-10175-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Archetypal analysis is an unsupervised machine learning method that summarizes data using a convex polytope. In its original formulation, for fixed <i>k</i>, the method finds a convex polytope with <i>k</i> vertices, called archetype points, such that the polytope is contained in the convex hull of the data and the mean squared Euclidean distance between the data and the polytope is minimal. In the present work, we consider an alternative formulation of archetypal analysis based on the Wasserstein metric, which we call Wasserstein archetypal analysis (WAA). In one dimension, there exists a unique solution of WAA and, in two dimensions, we prove the existence of a solution, as long as the data distribution is absolutely continuous with respect to the Lebesgue measure. We discuss obstacles to extending our result to higher dimensions and general data distributions. We then introduce an appropriate regularization of the problem, via a Rényi entropy, which allows us to obtain the existence of solutions of the regularized problem for general data distributions, in arbitrary dimensions. We prove a consistency result for the regularized problem, ensuring that if the data are iid samples from a probability measure, then as the number of samples is increased, a subsequence of the archetype points converges to the archetype points for the limiting data distribution, almost surely. Finally, we develop and implement a gradient-based computational approach for the two-dimensional problem, based on the semi-discrete formulation of the Wasserstein metric. Detailed numerical experiments are provided to support our theoretical findings.</p></div>\",\"PeriodicalId\":55566,\"journal\":{\"name\":\"Applied Mathematics and Optimization\",\"volume\":\"90 2\",\"pages\":\"\"},\"PeriodicalIF\":1.6000,\"publicationDate\":\"2024-09-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Mathematics and Optimization\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s00245-024-10175-w\",\"RegionNum\":2,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MATHEMATICS, APPLIED\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Mathematics and Optimization","FirstCategoryId":"100","ListUrlMain":"https://link.springer.com/article/10.1007/s00245-024-10175-w","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATHEMATICS, APPLIED","Score":null,"Total":0}

引用次数: 0

摘要

原型分析是一种无监督的机器学习方法，它利用凸多边形来总结数据。在其最初的表述中，对于固定的 k，该方法会找到一个具有 k 个顶点（称为原型点）的凸多面体，使得该多面体包含在数据的凸壳中，并且数据与该多面体之间的平均欧氏距离平方最小。在本研究中，我们考虑了基于 Wasserstein 度量的原型分析的另一种表述，我们称之为 Wasserstein 原型分析（WAA）。在一维中，WAA 存在唯一解；在二维中，只要数据分布相对于 Lebesgue 度量是绝对连续的，我们就证明了解的存在。我们讨论了将我们的结果扩展到更高维度和一般数据分布的障碍。然后，我们通过雷尼熵对问题进行适当的正则化，从而获得任意维度下一般数据分布的正则化问题解的存在性。我们证明了正则化问题的一致性结果，确保如果数据是从概率度量的 iid 样本，那么随着样本数量的增加，原型点的子序列几乎肯定会收敛到极限数据分布的原型点。最后，我们针对二维问题开发并实施了一种基于梯度的计算方法，该方法基于瓦瑟斯坦度量的半离散表述。我们提供了详细的数值实验来支持我们的理论发现。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Wasserstein Archetypal Analysis

查看原文本刊更多论文

Wasserstein Archetypal Analysis

Archetypal analysis is an unsupervised machine learning method that summarizes data using a convex polytope. In its original formulation, for fixed k, the method finds a convex polytope with k vertices, called archetype points, such that the polytope is contained in the convex hull of the data and the mean squared Euclidean distance between the data and the polytope is minimal. In the present work, we consider an alternative formulation of archetypal analysis based on the Wasserstein metric, which we call Wasserstein archetypal analysis (WAA). In one dimension, there exists a unique solution of WAA and, in two dimensions, we prove the existence of a solution, as long as the data distribution is absolutely continuous with respect to the Lebesgue measure. We discuss obstacles to extending our result to higher dimensions and general data distributions. We then introduce an appropriate regularization of the problem, via a Rényi entropy, which allows us to obtain the existence of solutions of the regularized problem for general data distributions, in arbitrary dimensions. We prove a consistency result for the regularized problem, ensuring that if the data are iid samples from a probability measure, then as the number of samples is increased, a subsequence of the archetype points converges to the archetype points for the limiting data distribution, almost surely. Finally, we develop and implement a gradient-based computational approach for the two-dimensional problem, based on the semi-discrete formulation of the Wasserstein metric. Detailed numerical experiments are provided to support our theoretical findings.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Applied Mathematics and Optimization 数学-应用数学

CiteScore

3.30

自引率

5.60%

发文量

103

审稿时长

>12 weeks

期刊介绍： The Applied Mathematics and Optimization Journal covers a broad range of mathematical methods in particular those that bridge with optimization and have some connection with applications. Core topics include calculus of variations, partial differential equations, stochastic control, optimization of deterministic or stochastic systems in discrete or continuous time, homogenization, control theory, mean field games, dynamic games and optimal transport. Algorithmic, data analytic, machine learning and numerical methods which support the modeling and analysis of optimization problems are encouraged. Of great interest are papers which show some novel idea in either the theory or model which include some connection with potential applications in science and engineering.