{"title":"机器学习和优化运输:一些统计和算法工具","authors":"Elsa Cazelles","doi":"10.1051/proc/202374158","DOIUrl":null,"url":null,"abstract":"In this paper, we focus on the analysis of data that can be described by probability measures supported on a Euclidean space, by way of optimal transport. Our main objective is to present a first and second order statistical analyses in the space of distributions in a concise manner, as a first approach to understand the general modes of variation of a set of observations. In the context of optimal transport, these studies correspond to the barycenter and the decomposition into geodesic principal components in theWasserstein space. In particular, we aim attention at a regularised estimator of the barycenter, in order to handle the noise coming from the observations. Additionally, we leverage these tools for time series analysis, whose spectral informations are compared using optimal transport.","PeriodicalId":505884,"journal":{"name":"ESAIM: Proceedings and Surveys","volume":"9 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine learning and optimal transport: some statistical and algorithmic tools\",\"authors\":\"Elsa Cazelles\",\"doi\":\"10.1051/proc/202374158\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we focus on the analysis of data that can be described by probability measures supported on a Euclidean space, by way of optimal transport. Our main objective is to present a first and second order statistical analyses in the space of distributions in a concise manner, as a first approach to understand the general modes of variation of a set of observations. In the context of optimal transport, these studies correspond to the barycenter and the decomposition into geodesic principal components in theWasserstein space. In particular, we aim attention at a regularised estimator of the barycenter, in order to handle the noise coming from the observations. Additionally, we leverage these tools for time series analysis, whose spectral informations are compared using optimal transport.\",\"PeriodicalId\":505884,\"journal\":{\"name\":\"ESAIM: Proceedings and Surveys\",\"volume\":\"9 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ESAIM: Proceedings and Surveys\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1051/proc/202374158\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ESAIM: Proceedings and Surveys","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1051/proc/202374158","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Machine learning and optimal transport: some statistical and algorithmic tools
In this paper, we focus on the analysis of data that can be described by probability measures supported on a Euclidean space, by way of optimal transport. Our main objective is to present a first and second order statistical analyses in the space of distributions in a concise manner, as a first approach to understand the general modes of variation of a set of observations. In the context of optimal transport, these studies correspond to the barycenter and the decomposition into geodesic principal components in theWasserstein space. In particular, we aim attention at a regularised estimator of the barycenter, in order to handle the noise coming from the observations. Additionally, we leverage these tools for time series analysis, whose spectral informations are compared using optimal transport.