Arrykrishna Mootoovaloo, Jaime Ruiz-Zapatero, Carlos García-García, David Alonso
{"title":"基于梯度的采样器在标准宇宙学可能性中的评估","authors":"Arrykrishna Mootoovaloo, Jaime Ruiz-Zapatero, Carlos García-García, David Alonso","doi":"10.1093/mnras/stae2138","DOIUrl":null,"url":null,"abstract":"We assess the usefulness of gradient-based samplers, such as the No-U-Turn Sampler (${\\tt NUTS}$), by comparison with traditional Metropolis-Hastings algorithms, in tomographic 3 × 2 point analyses. Specifically, we use the DES Year 1 data and a simulated future LSST-like survey as representative examples of these studies, containing a significant number of nuisance parameters (20 and 32, respectively) that affect the performance of rejection-based samplers. To do so, we implement a differentiable forward model using JAX-COSMO, and we use it to derive parameter constraints from both datasets using the NUTS algorithm implemented in ${\\tt numpyro}$, and the Metropolis-Hastings algorithm as implemented in Cobaya. When quantified in terms of the number of effective number of samples taken per likelihood evaluation, we find a relative efficiency gain of ${\\mathcal {O}}(10)$ in favour of NUTS. However, this efficiency is reduced to a factor ∼2 when quantified in terms of computational time, since we find the cost of the gradient computation (needed by NUTS) relative to the likelihood to be ∼4.5 times larger for both experiments. We validate these results making use of analytical multi-variate distributions (a multivariate Gaussian and a Rosenbrock distribution) with increasing dimensionality. Based on these results, we conclude that gradient-based samplers such as ${\\tt NUTS}$ can be leveraged to sample high dimensional parameter spaces in Cosmology, although the efficiency improvement is relatively mild for moderate (${\\mathcal {O}}(50)$) dimension numbers, typical of tomographic large-scale structure analyses.","PeriodicalId":18930,"journal":{"name":"Monthly Notices of the Royal Astronomical Society","volume":"74 1","pages":""},"PeriodicalIF":4.7000,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Assessment of Gradient-Based Samplers in Standard Cosmological Likelihoods\",\"authors\":\"Arrykrishna Mootoovaloo, Jaime Ruiz-Zapatero, Carlos García-García, David Alonso\",\"doi\":\"10.1093/mnras/stae2138\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We assess the usefulness of gradient-based samplers, such as the No-U-Turn Sampler (${\\\\tt NUTS}$), by comparison with traditional Metropolis-Hastings algorithms, in tomographic 3 × 2 point analyses. Specifically, we use the DES Year 1 data and a simulated future LSST-like survey as representative examples of these studies, containing a significant number of nuisance parameters (20 and 32, respectively) that affect the performance of rejection-based samplers. To do so, we implement a differentiable forward model using JAX-COSMO, and we use it to derive parameter constraints from both datasets using the NUTS algorithm implemented in ${\\\\tt numpyro}$, and the Metropolis-Hastings algorithm as implemented in Cobaya. When quantified in terms of the number of effective number of samples taken per likelihood evaluation, we find a relative efficiency gain of ${\\\\mathcal {O}}(10)$ in favour of NUTS. However, this efficiency is reduced to a factor ∼2 when quantified in terms of computational time, since we find the cost of the gradient computation (needed by NUTS) relative to the likelihood to be ∼4.5 times larger for both experiments. We validate these results making use of analytical multi-variate distributions (a multivariate Gaussian and a Rosenbrock distribution) with increasing dimensionality. Based on these results, we conclude that gradient-based samplers such as ${\\\\tt NUTS}$ can be leveraged to sample high dimensional parameter spaces in Cosmology, although the efficiency improvement is relatively mild for moderate (${\\\\mathcal {O}}(50)$) dimension numbers, typical of tomographic large-scale structure analyses.\",\"PeriodicalId\":18930,\"journal\":{\"name\":\"Monthly Notices of the Royal Astronomical Society\",\"volume\":\"74 1\",\"pages\":\"\"},\"PeriodicalIF\":4.7000,\"publicationDate\":\"2024-09-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Monthly Notices of the Royal Astronomical Society\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://doi.org/10.1093/mnras/stae2138\",\"RegionNum\":3,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ASTRONOMY & ASTROPHYSICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Monthly Notices of the Royal Astronomical Society","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1093/mnras/stae2138","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}
Assessment of Gradient-Based Samplers in Standard Cosmological Likelihoods
We assess the usefulness of gradient-based samplers, such as the No-U-Turn Sampler (${\tt NUTS}$), by comparison with traditional Metropolis-Hastings algorithms, in tomographic 3 × 2 point analyses. Specifically, we use the DES Year 1 data and a simulated future LSST-like survey as representative examples of these studies, containing a significant number of nuisance parameters (20 and 32, respectively) that affect the performance of rejection-based samplers. To do so, we implement a differentiable forward model using JAX-COSMO, and we use it to derive parameter constraints from both datasets using the NUTS algorithm implemented in ${\tt numpyro}$, and the Metropolis-Hastings algorithm as implemented in Cobaya. When quantified in terms of the number of effective number of samples taken per likelihood evaluation, we find a relative efficiency gain of ${\mathcal {O}}(10)$ in favour of NUTS. However, this efficiency is reduced to a factor ∼2 when quantified in terms of computational time, since we find the cost of the gradient computation (needed by NUTS) relative to the likelihood to be ∼4.5 times larger for both experiments. We validate these results making use of analytical multi-variate distributions (a multivariate Gaussian and a Rosenbrock distribution) with increasing dimensionality. Based on these results, we conclude that gradient-based samplers such as ${\tt NUTS}$ can be leveraged to sample high dimensional parameter spaces in Cosmology, although the efficiency improvement is relatively mild for moderate (${\mathcal {O}}(50)$) dimension numbers, typical of tomographic large-scale structure analyses.
期刊介绍:
Monthly Notices of the Royal Astronomical Society is one of the world''s leading primary research journals in astronomy and astrophysics, as well as one of the longest established. It publishes the results of original research in positional and dynamical astronomy, astrophysics, radio astronomy, cosmology, space research and the design of astronomical instruments.