{"title":"The conditional barycenter problem, its data-driven formulation and its solution through normalizing flows","authors":"Esteban G. Tabak, Giulio Trigila, Wenjun Zhao","doi":"10.4310/cms.2024.v22.n6.a8","DOIUrl":null,"url":null,"abstract":"A family of normalizing flows is introduced for selectively removing from a data set the variability attributable to a specific set of cofactors, while preserving the dependence on others. This is achieved by extending the barycenter problem of optimal transport theory to the newly introduced conditional barycenter problem. Rather than summarizing the data with a single probability distribution, as in the classical barycenter problem, the conditional barycenter is represented by a family of distributions labeled by the cofactors kept. The use of the conditional barycenter and its differences with the classical barycenter are illustrated on synthetic and real data addressing treatment effect estimation, super-resolution, anomaly detection and lightness transfer in image analysis.","PeriodicalId":50659,"journal":{"name":"Communications in Mathematical Sciences","volume":"11 1","pages":""},"PeriodicalIF":1.2000,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications in Mathematical Sciences","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.4310/cms.2024.v22.n6.a8","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATHEMATICS, APPLIED","Score":null,"Total":0}
引用次数: 0
Abstract
A family of normalizing flows is introduced for selectively removing from a data set the variability attributable to a specific set of cofactors, while preserving the dependence on others. This is achieved by extending the barycenter problem of optimal transport theory to the newly introduced conditional barycenter problem. Rather than summarizing the data with a single probability distribution, as in the classical barycenter problem, the conditional barycenter is represented by a family of distributions labeled by the cofactors kept. The use of the conditional barycenter and its differences with the classical barycenter are illustrated on synthetic and real data addressing treatment effect estimation, super-resolution, anomaly detection and lightness transfer in image analysis.
期刊介绍:
Covers modern applied mathematics in the fields of modeling, applied and stochastic analyses and numerical computations—on problems that arise in physical, biological, engineering, and financial applications. The journal publishes high-quality, original research articles, reviews, and expository papers.