{"title":"受约束目标分布的球形哈密顿蒙特卡洛。","authors":"Shiwei Lan, Bo Zhou, Babak Shahbaba","doi":"","DOIUrl":null,"url":null,"abstract":"<p><p>Statistical models with constrained probability distributions are abundant in machine learning. Some examples include regression models with norm constraints (e.g., Lasso), probit models, many copula models, and Latent Dirichlet Allocation (LDA) models. Bayesian inference involving probability distributions confined to constrained domains could be quite challenging for commonly used sampling algorithms. For such problems, we propose a novel Markov Chain Monte Carlo (MCMC) method that provides a general and computationally efficient framework for handling boundary conditions. Our method first maps the <i>D</i>-dimensional constrained domain of parameters to the unit ball [Formula: see text], then augments it to a <i>D</i>-dimensional sphere <b>S</b><sup><i>D</i></sup> such that the original boundary corresponds to the equator of <b>S</b><sup><i>D</i></sup> . This way, our method handles the constraints implicitly by moving freely on the sphere generating proposals that remain within boundaries when mapped back to the original space. To improve the computational efficiency of our algorithm, we divide the dynamics into several parts such that the resulting split dynamics has a partial analytical solution as a geodesic flow on the sphere. We apply our method to several examples including truncated Gaussian, Bayesian Lasso, Bayesian bridge regression, and a copula model for identifying synchrony among multiple neurons. Our results show that the proposed method can provide a natural and efficient framework for handling several types of constraints on target distributions.</p>","PeriodicalId":89793,"journal":{"name":"JMLR workshop and conference proceedings","volume":"32 ","pages":"629-637"},"PeriodicalIF":0.0000,"publicationDate":"2014-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4407381/pdf/nihms672830.pdf","citationCount":"0","resultStr":"{\"title\":\"Spherical Hamiltonian Monte Carlo for Constrained Target Distributions.\",\"authors\":\"Shiwei Lan, Bo Zhou, Babak Shahbaba\",\"doi\":\"\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Statistical models with constrained probability distributions are abundant in machine learning. Some examples include regression models with norm constraints (e.g., Lasso), probit models, many copula models, and Latent Dirichlet Allocation (LDA) models. Bayesian inference involving probability distributions confined to constrained domains could be quite challenging for commonly used sampling algorithms. For such problems, we propose a novel Markov Chain Monte Carlo (MCMC) method that provides a general and computationally efficient framework for handling boundary conditions. Our method first maps the <i>D</i>-dimensional constrained domain of parameters to the unit ball [Formula: see text], then augments it to a <i>D</i>-dimensional sphere <b>S</b><sup><i>D</i></sup> such that the original boundary corresponds to the equator of <b>S</b><sup><i>D</i></sup> . This way, our method handles the constraints implicitly by moving freely on the sphere generating proposals that remain within boundaries when mapped back to the original space. To improve the computational efficiency of our algorithm, we divide the dynamics into several parts such that the resulting split dynamics has a partial analytical solution as a geodesic flow on the sphere. We apply our method to several examples including truncated Gaussian, Bayesian Lasso, Bayesian bridge regression, and a copula model for identifying synchrony among multiple neurons. Our results show that the proposed method can provide a natural and efficient framework for handling several types of constraints on target distributions.</p>\",\"PeriodicalId\":89793,\"journal\":{\"name\":\"JMLR workshop and conference proceedings\",\"volume\":\"32 \",\"pages\":\"629-637\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-06-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4407381/pdf/nihms672830.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JMLR workshop and conference proceedings\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMLR workshop and conference proceedings","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
在机器学习中,具有约束概率分布的统计模型比比皆是。一些例子包括带规范约束的回归模型(如 Lasso)、probit 模型、许多 copula 模型和 Latent Dirichlet Allocation (LDA) 模型。对于常用的抽样算法来说,涉及受限域的概率分布的贝叶斯推断具有相当大的挑战性。针对此类问题,我们提出了一种新颖的马尔可夫链蒙特卡罗(MCMC)方法,为处理边界条件提供了一个通用且计算高效的框架。我们的方法首先将 D 维参数约束域映射到单位球[公式:见正文],然后将其扩展到一个 D 维球体 SD,这样原始边界就对应于 SD 的赤道。这样,我们的方法通过在球面上自由移动来隐式处理约束,生成的提案在映射回原始空间时仍保持在边界内。为了提高算法的计算效率,我们将动力学分为几个部分,这样产生的分裂动力学就有了球面上大地流的部分解析解。我们将我们的方法应用于几个例子,包括截断高斯模型、贝叶斯拉索模型、贝叶斯桥回归模型,以及用于识别多个神经元之间同步性的 copula 模型。我们的研究结果表明,所提出的方法可以为处理目标分布的多种类型约束提供一个自然而高效的框架。
Spherical Hamiltonian Monte Carlo for Constrained Target Distributions.
Statistical models with constrained probability distributions are abundant in machine learning. Some examples include regression models with norm constraints (e.g., Lasso), probit models, many copula models, and Latent Dirichlet Allocation (LDA) models. Bayesian inference involving probability distributions confined to constrained domains could be quite challenging for commonly used sampling algorithms. For such problems, we propose a novel Markov Chain Monte Carlo (MCMC) method that provides a general and computationally efficient framework for handling boundary conditions. Our method first maps the D-dimensional constrained domain of parameters to the unit ball [Formula: see text], then augments it to a D-dimensional sphere SD such that the original boundary corresponds to the equator of SD . This way, our method handles the constraints implicitly by moving freely on the sphere generating proposals that remain within boundaries when mapped back to the original space. To improve the computational efficiency of our algorithm, we divide the dynamics into several parts such that the resulting split dynamics has a partial analytical solution as a geodesic flow on the sphere. We apply our method to several examples including truncated Gaussian, Bayesian Lasso, Bayesian bridge regression, and a copula model for identifying synchrony among multiple neurons. Our results show that the proposed method can provide a natural and efficient framework for handling several types of constraints on target distributions.