非凸弱光滑势的未调整朗格文算法

IF 1 4区数学 Q1 MATHEMATICS

Communications in Mathematics and Statistics Pub Date : 2023-12-09 DOI:10.1007/s40304-023-00350-w

Dao Nguyen, Xin Dang, Yixin Chen

{"title":"非凸弱光滑势的未调整朗格文算法","authors":"Dao Nguyen, Xin Dang, Yixin Chen","doi":"10.1007/s40304-023-00350-w","DOIUrl":null,"url":null,"abstract":"Discretization of continuous-time diffusion processes is a widely recognized method for sampling. However, the canonical Euler Maruyama discretization of the Langevin diffusion process, referred as unadjusted Langevin algorithm (ULA), studied mostly in the context of smooth (gradient Lipschitz) and strongly log-concave densities, is a considerable hindrance for its deployment in many sciences, including statistics and machine learning. In this paper, we establish several theoretical contributions to the literature on such sampling methods for non-convex distributions. Particularly, we introduce a new mixture weakly smooth condition, under which we prove that ULA will converge with additional log-Sobolev inequality. We also show that ULA for smoothing potential will converge in \\(L_{2}\\)-Wasserstein distance. Moreover, using convexification of nonconvex domain (Ma et al. in Proc Natl Acad Sci 116(42):20881–20885, 2019) in combination with regularization, we establish the convergence in Kullback–Leibler divergence with the number of iterations to reach \\(\\epsilon \\)-neighborhood of a target distribution in only polynomial dependence on the dimension. We relax the conditions of Vempala and Wibisono (Advances in Neural Information Processing Systems, 2019) and prove convergence guarantees under isoperimetry, and non-strongly convex at infinity.","PeriodicalId":10575,"journal":{"name":"Communications in Mathematics and Statistics","volume":"23 1","pages":""},"PeriodicalIF":1.0000,"publicationDate":"2023-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Unadjusted Langevin Algorithm for Non-convex Weakly Smooth Potentials\",\"authors\":\"Dao Nguyen, Xin Dang, Yixin Chen\",\"doi\":\"10.1007/s40304-023-00350-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Discretization of continuous-time diffusion processes is a widely recognized method for sampling. However, the canonical Euler Maruyama discretization of the Langevin diffusion process, referred as unadjusted Langevin algorithm (ULA), studied mostly in the context of smooth (gradient Lipschitz) and strongly log-concave densities, is a considerable hindrance for its deployment in many sciences, including statistics and machine learning. In this paper, we establish several theoretical contributions to the literature on such sampling methods for non-convex distributions. Particularly, we introduce a new mixture weakly smooth condition, under which we prove that ULA will converge with additional log-Sobolev inequality. We also show that ULA for smoothing potential will converge in \\\\(L_{2}\\\\)-Wasserstein distance. Moreover, using convexification of nonconvex domain (Ma et al. in Proc Natl Acad Sci 116(42):20881–20885, 2019) in combination with regularization, we establish the convergence in Kullback–Leibler divergence with the number of iterations to reach \\\\(\\\\epsilon \\\\)-neighborhood of a target distribution in only polynomial dependence on the dimension. We relax the conditions of Vempala and Wibisono (Advances in Neural Information Processing Systems, 2019) and prove convergence guarantees under isoperimetry, and non-strongly convex at infinity.\",\"PeriodicalId\":10575,\"journal\":{\"name\":\"Communications in Mathematics and Statistics\",\"volume\":\"23 1\",\"pages\":\"\"},\"PeriodicalIF\":1.0000,\"publicationDate\":\"2023-12-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Communications in Mathematics and Statistics\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.1007/s40304-023-00350-w\",\"RegionNum\":4,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MATHEMATICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications in Mathematics and Statistics","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1007/s40304-023-00350-w","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICS","Score":null,"Total":0}

引用次数: 0

摘要

连续时间扩散过程的离散化是一种公认的采样方法。然而，Langevin 扩散过程的典型 Euler Maruyama 离散化，即未调整 Langevin 算法（ULA），主要是在光滑（梯度 Lipschitz）和强对数凹密度的背景下研究的，这对其在包括统计和机器学习在内的许多科学领域的应用是一个相当大的障碍。在本文中，我们为有关非凸分布的此类采样方法的文献做出了一些理论贡献。特别是，我们引入了一个新的混合物弱光滑条件，在此条件下，我们证明了 ULA 将以额外的 log-Sobolev 不等式收敛。我们还证明了平滑势的 ULA 会以 \(L_{2}\)-Wasserstein 距离收敛。此外，利用非凸域的凸化（Ma et al. in Proc Natl Acad Sci 116(42):20881-20885, 2019）结合正则化，我们建立了库尔贝-莱布勒发散的收敛性，达到目标分布的\(\epsilon \)-邻域的迭代次数仅与维度的多项式相关。我们放宽了 Vempala 和 Wibisono（《神经信息处理系统进展》，2019 年）的条件，并证明了等距下的收敛保证，以及无穷大处的非强凸性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Unadjusted Langevin Algorithm for Non-convex Weakly Smooth Potentials

Discretization of continuous-time diffusion processes is a widely recognized method for sampling. However, the canonical Euler Maruyama discretization of the Langevin diffusion process, referred as unadjusted Langevin algorithm (ULA), studied mostly in the context of smooth (gradient Lipschitz) and strongly log-concave densities, is a considerable hindrance for its deployment in many sciences, including statistics and machine learning. In this paper, we establish several theoretical contributions to the literature on such sampling methods for non-convex distributions. Particularly, we introduce a new mixture weakly smooth condition, under which we prove that ULA will converge with additional log-Sobolev inequality. We also show that ULA for smoothing potential will converge in \(L_{2}\)-Wasserstein distance. Moreover, using convexification of nonconvex domain (Ma et al. in Proc Natl Acad Sci 116(42):20881–20885, 2019) in combination with regularization, we establish the convergence in Kullback–Leibler divergence with the number of iterations to reach \(\epsilon \)-neighborhood of a target distribution in only polynomial dependence on the dimension. We relax the conditions of Vempala and Wibisono (Advances in Neural Information Processing Systems, 2019) and prove convergence guarantees under isoperimetry, and non-strongly convex at infinity.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Communications in Mathematics and Statistics Mathematics-Statistics and Probability

CiteScore

1.80

自引率

0.00%

发文量

期刊介绍： Communications in Mathematics and Statistics is an international journal published by Springer-Verlag in collaboration with the School of Mathematical Sciences, University of Science and Technology of China (USTC). The journal will be committed to publish high level original peer reviewed research papers in various areas of mathematical sciences, including pure mathematics, applied mathematics, computational mathematics, and probability and statistics. Typically one volume is published each year, and each volume consists of four issues.