{"title":"使用运输地图学习的累积分布函数的近似。","authors":"Dawen Wu, Ludovic Chamoin","doi":"10.1063/5.0276348","DOIUrl":null,"url":null,"abstract":"<p><p>This paper considers approximating the cumulative distribution function (CDF). For many important probability distributions, such as the normal distribution, their CDFs lack closed-form expressions representable by elementary functions. Although approximation methods exist, common techniques such as the empirical CDF typically rely on large amounts of sample data to construct sufficiently accurate approximations. The aim of this paper is to provide accurate and data-efficient closed-form approximations for CDFs. Our methodology is inspired by the theory of transport maps. We leverage the fundamental property that in the specific one-dimensional case, the transport map transforming a target random variable to the standard uniform distribution U(0,1) is identical to the target variable's CDF. Building upon this key insight, we propose Transport Map Learning (TML). We utilize TML to train a neural network whose output is subsequently processed by a sigmoid function. This composite architecture serves as our closed-form CDF approximation, inherently constraining the output to the [0,1] range appropriate for a CDF. The effectiveness of the proposed method is validated on three benchmark probability distributions: the standard normal distribution, the beta distribution, and the gamma distribution. The results demonstrate that, given the same amount of training data, the proposed TML method generates highly accurate closed-form approximations for the CDFs. These approximations achieve superior accuracy compared to established methods based on the empirical CDF combined with various interpolation strategies.</p>","PeriodicalId":9974,"journal":{"name":"Chaos","volume":"35 9","pages":""},"PeriodicalIF":3.2000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Approximations of the cumulative distribution function using transport maps learning.\",\"authors\":\"Dawen Wu, Ludovic Chamoin\",\"doi\":\"10.1063/5.0276348\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>This paper considers approximating the cumulative distribution function (CDF). For many important probability distributions, such as the normal distribution, their CDFs lack closed-form expressions representable by elementary functions. Although approximation methods exist, common techniques such as the empirical CDF typically rely on large amounts of sample data to construct sufficiently accurate approximations. The aim of this paper is to provide accurate and data-efficient closed-form approximations for CDFs. Our methodology is inspired by the theory of transport maps. We leverage the fundamental property that in the specific one-dimensional case, the transport map transforming a target random variable to the standard uniform distribution U(0,1) is identical to the target variable's CDF. Building upon this key insight, we propose Transport Map Learning (TML). We utilize TML to train a neural network whose output is subsequently processed by a sigmoid function. This composite architecture serves as our closed-form CDF approximation, inherently constraining the output to the [0,1] range appropriate for a CDF. The effectiveness of the proposed method is validated on three benchmark probability distributions: the standard normal distribution, the beta distribution, and the gamma distribution. The results demonstrate that, given the same amount of training data, the proposed TML method generates highly accurate closed-form approximations for the CDFs. These approximations achieve superior accuracy compared to established methods based on the empirical CDF combined with various interpolation strategies.</p>\",\"PeriodicalId\":9974,\"journal\":{\"name\":\"Chaos\",\"volume\":\"35 9\",\"pages\":\"\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2025-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Chaos\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.1063/5.0276348\",\"RegionNum\":2,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MATHEMATICS, APPLIED\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chaos","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1063/5.0276348","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICS, APPLIED","Score":null,"Total":0}
Approximations of the cumulative distribution function using transport maps learning.
This paper considers approximating the cumulative distribution function (CDF). For many important probability distributions, such as the normal distribution, their CDFs lack closed-form expressions representable by elementary functions. Although approximation methods exist, common techniques such as the empirical CDF typically rely on large amounts of sample data to construct sufficiently accurate approximations. The aim of this paper is to provide accurate and data-efficient closed-form approximations for CDFs. Our methodology is inspired by the theory of transport maps. We leverage the fundamental property that in the specific one-dimensional case, the transport map transforming a target random variable to the standard uniform distribution U(0,1) is identical to the target variable's CDF. Building upon this key insight, we propose Transport Map Learning (TML). We utilize TML to train a neural network whose output is subsequently processed by a sigmoid function. This composite architecture serves as our closed-form CDF approximation, inherently constraining the output to the [0,1] range appropriate for a CDF. The effectiveness of the proposed method is validated on three benchmark probability distributions: the standard normal distribution, the beta distribution, and the gamma distribution. The results demonstrate that, given the same amount of training data, the proposed TML method generates highly accurate closed-form approximations for the CDFs. These approximations achieve superior accuracy compared to established methods based on the empirical CDF combined with various interpolation strategies.
期刊介绍:
Chaos: An Interdisciplinary Journal of Nonlinear Science is a peer-reviewed journal devoted to increasing the understanding of nonlinear phenomena and describing the manifestations in a manner comprehensible to researchers from a broad spectrum of disciplines.