Assessment of Gradient-Based Samplers in Standard Cosmological Likelihoods

IF 4.7 3区 物理与天体物理 Q1 ASTRONOMY & ASTROPHYSICS
Arrykrishna Mootoovaloo, Jaime Ruiz-Zapatero, Carlos García-García, David Alonso
{"title":"Assessment of Gradient-Based Samplers in Standard Cosmological Likelihoods","authors":"Arrykrishna Mootoovaloo, Jaime Ruiz-Zapatero, Carlos García-García, David Alonso","doi":"10.1093/mnras/stae2138","DOIUrl":null,"url":null,"abstract":"We assess the usefulness of gradient-based samplers, such as the No-U-Turn Sampler (${\\tt NUTS}$), by comparison with traditional Metropolis-Hastings algorithms, in tomographic 3 × 2 point analyses. Specifically, we use the DES Year 1 data and a simulated future LSST-like survey as representative examples of these studies, containing a significant number of nuisance parameters (20 and 32, respectively) that affect the performance of rejection-based samplers. To do so, we implement a differentiable forward model using JAX-COSMO, and we use it to derive parameter constraints from both datasets using the NUTS algorithm implemented in ${\\tt numpyro}$, and the Metropolis-Hastings algorithm as implemented in Cobaya. When quantified in terms of the number of effective number of samples taken per likelihood evaluation, we find a relative efficiency gain of ${\\mathcal {O}}(10)$ in favour of NUTS. However, this efficiency is reduced to a factor ∼2 when quantified in terms of computational time, since we find the cost of the gradient computation (needed by NUTS) relative to the likelihood to be ∼4.5 times larger for both experiments. We validate these results making use of analytical multi-variate distributions (a multivariate Gaussian and a Rosenbrock distribution) with increasing dimensionality. Based on these results, we conclude that gradient-based samplers such as ${\\tt NUTS}$ can be leveraged to sample high dimensional parameter spaces in Cosmology, although the efficiency improvement is relatively mild for moderate (${\\mathcal {O}}(50)$) dimension numbers, typical of tomographic large-scale structure analyses.","PeriodicalId":18930,"journal":{"name":"Monthly Notices of the Royal Astronomical Society","volume":"74 1","pages":""},"PeriodicalIF":4.7000,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Monthly Notices of the Royal Astronomical Society","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1093/mnras/stae2138","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}
引用次数: 0

Abstract

We assess the usefulness of gradient-based samplers, such as the No-U-Turn Sampler (${\tt NUTS}$), by comparison with traditional Metropolis-Hastings algorithms, in tomographic 3 × 2 point analyses. Specifically, we use the DES Year 1 data and a simulated future LSST-like survey as representative examples of these studies, containing a significant number of nuisance parameters (20 and 32, respectively) that affect the performance of rejection-based samplers. To do so, we implement a differentiable forward model using JAX-COSMO, and we use it to derive parameter constraints from both datasets using the NUTS algorithm implemented in ${\tt numpyro}$, and the Metropolis-Hastings algorithm as implemented in Cobaya. When quantified in terms of the number of effective number of samples taken per likelihood evaluation, we find a relative efficiency gain of ${\mathcal {O}}(10)$ in favour of NUTS. However, this efficiency is reduced to a factor ∼2 when quantified in terms of computational time, since we find the cost of the gradient computation (needed by NUTS) relative to the likelihood to be ∼4.5 times larger for both experiments. We validate these results making use of analytical multi-variate distributions (a multivariate Gaussian and a Rosenbrock distribution) with increasing dimensionality. Based on these results, we conclude that gradient-based samplers such as ${\tt NUTS}$ can be leveraged to sample high dimensional parameter spaces in Cosmology, although the efficiency improvement is relatively mild for moderate (${\mathcal {O}}(50)$) dimension numbers, typical of tomographic large-scale structure analyses.
基于梯度的采样器在标准宇宙学可能性中的评估
通过与传统的 Metropolis-Hastorings 算法比较,我们评估了基于梯度的采样器(如 No-U-Turn Sampler (${tt NUTS}$))在层析 3 × 2 点分析中的实用性。具体地说,我们使用DES第1年数据和模拟的未来LSST类巡天作为这些研究的代表实例,其中包含大量影响基于拒绝的采样器性能的干扰参数(分别为20和32)。为此,我们使用 JAX-COSMO 实现了一个可微分的前向模型,并使用该模型从这两个数据集中推导出参数约束,使用的算法包括在 ${\tt numpyro}$ 中实现的 NUTS 算法和在 Cobaya 中实现的 Metropolis-Hastings 算法。如果以每次似然评估所需的有效样本数量来量化,我们发现 NUTS 算法的相对效率收益为 ${\mathcal {O}}(10)$ 。然而,如果以计算时间来量化,这种效率则降低了 ∼ 2 倍,因为我们发现在这两个实验中,梯度计算(NUTS 所需的)相对于似然的成本要大∼ 4.5 倍。我们利用维度不断增加的分析多变量分布(多变量高斯分布和罗森布洛克分布)验证了这些结果。基于这些结果,我们得出结论,基于梯度的采样器,如 ${tt NUTS}$,可以用于宇宙学中高维参数空间的采样,尽管对于中等维数(${mathcal {O}}(50)$) 维数,即典型的层析大尺度结构分析,效率的提高相对温和。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
9.10
自引率
37.50%
发文量
3198
审稿时长
3 months
期刊介绍: Monthly Notices of the Royal Astronomical Society is one of the world''s leading primary research journals in astronomy and astrophysics, as well as one of the longest established. It publishes the results of original research in positional and dynamical astronomy, astrophysics, radio astronomy, cosmology, space research and the design of astronomical instruments.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信