宇宙学n体模拟:对可扩展生成模型的挑战

IF 16.281

Computational Astrophysics and Cosmology Pub Date : 2019-12-19 DOI:10.1186/s40668-019-0032-1

Nathanaël Perraudin, Ankit Srivastava, Aurelien Lucchi, Tomasz Kacprzak, Thomas Hofmann, Alexandre Réfrégier

{"title":"宇宙学n体模拟:对可扩展生成模型的挑战","authors":"Nathanaël Perraudin, Ankit Srivastava, Aurelien Lucchi, Tomasz Kacprzak, Thomas Hofmann, Alexandre Réfrégier","doi":"10.1186/s40668-019-0032-1","DOIUrl":null,"url":null,"abstract":"Deep generative models, such as Generative Adversarial Networks (GANs) or Variational Autoencoders (VAs) have been demonstrated to produce images of high visual quality. However, the existing hardware on which these models are trained severely limits the size of the images that can be generated. The rapid growth of high dimensional data in many fields of science therefore poses a significant challenge for generative models. In cosmology, the large-scale, three-dimensional matter distribution, modeled with N-body simulations, plays a crucial role in understanding the evolution of structures in the universe. As these simulations are computationally very expensive, GANs have recently generated interest as a possible method to emulate these datasets, but they have been, so far, mostly limited to two dimensional data. In this work, we introduce a new benchmark for the generation of three dimensional N-body simulations, in order to stimulate new ideas in the machine learning community and move closer to the practical use of generative models in cosmology. As a first benchmark result, we propose a scalable GAN approach for training a generator of N-body three-dimensional cubes. Our technique relies on two key building blocks, (i) splitting the generation of the high-dimensional data into smaller parts, and (ii) using a multi-scale approach that efficiently captures global image features that might otherwise be lost in the splitting process. We evaluate the performance of our model for the generation of N-body samples using various statistical measures commonly used in cosmology. Our results show that the proposed model produces samples of high visual quality, although the statistical analysis reveals that capturing rare features in the data poses significant problems for the generative models. We make the data, quality evaluation routines, and the proposed GAN architecture publicly available at https://github.com/nperraud/3DcosmoGAN.","PeriodicalId":523,"journal":{"name":"Computational Astrophysics and Cosmology","volume":"6 1","pages":""},"PeriodicalIF":16.2810,"publicationDate":"2019-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s40668-019-0032-1","citationCount":"26","resultStr":"{\"title\":\"Cosmological N-body simulations: a challenge for scalable generative models\",\"authors\":\"Nathanaël Perraudin, Ankit Srivastava, Aurelien Lucchi, Tomasz Kacprzak, Thomas Hofmann, Alexandre Réfrégier\",\"doi\":\"10.1186/s40668-019-0032-1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Deep generative models, such as Generative Adversarial Networks (GANs) or Variational Autoencoders (VAs) have been demonstrated to produce images of high visual quality. However, the existing hardware on which these models are trained severely limits the size of the images that can be generated. The rapid growth of high dimensional data in many fields of science therefore poses a significant challenge for generative models. In cosmology, the large-scale, three-dimensional matter distribution, modeled with N-body simulations, plays a crucial role in understanding the evolution of structures in the universe. As these simulations are computationally very expensive, GANs have recently generated interest as a possible method to emulate these datasets, but they have been, so far, mostly limited to two dimensional data. In this work, we introduce a new benchmark for the generation of three dimensional N-body simulations, in order to stimulate new ideas in the machine learning community and move closer to the practical use of generative models in cosmology. As a first benchmark result, we propose a scalable GAN approach for training a generator of N-body three-dimensional cubes. Our technique relies on two key building blocks, (i) splitting the generation of the high-dimensional data into smaller parts, and (ii) using a multi-scale approach that efficiently captures global image features that might otherwise be lost in the splitting process. We evaluate the performance of our model for the generation of N-body samples using various statistical measures commonly used in cosmology. Our results show that the proposed model produces samples of high visual quality, although the statistical analysis reveals that capturing rare features in the data poses significant problems for the generative models. We make the data, quality evaluation routines, and the proposed GAN architecture publicly available at https://github.com/nperraud/3DcosmoGAN.\",\"PeriodicalId\":523,\"journal\":{\"name\":\"Computational Astrophysics and Cosmology\",\"volume\":\"6 1\",\"pages\":\"\"},\"PeriodicalIF\":16.2810,\"publicationDate\":\"2019-12-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1186/s40668-019-0032-1\",\"citationCount\":\"26\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computational Astrophysics and Cosmology\",\"FirstCategoryId\":\"4\",\"ListUrlMain\":\"https://link.springer.com/article/10.1186/s40668-019-0032-1\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Astrophysics and Cosmology","FirstCategoryId":"4","ListUrlMain":"https://link.springer.com/article/10.1186/s40668-019-0032-1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 26

摘要

深度生成模型，如生成对抗网络(gan)或变分自编码器(VAs)已被证明可以产生高视觉质量的图像。然而，用于训练这些模型的现有硬件严重限制了可以生成的图像的大小。因此，在许多科学领域中，高维数据的快速增长对生成模型提出了重大挑战。在宇宙学中，以n体模拟为模型的大尺度三维物质分布对于理解宇宙结构的演化起着至关重要的作用。由于这些模拟在计算上非常昂贵，gan最近作为一种模拟这些数据集的可能方法产生了兴趣，但到目前为止，它们主要局限于二维数据。在这项工作中，我们为生成三维n体模拟引入了一个新的基准，以激发机器学习社区的新想法，并更接近生成模型在宇宙学中的实际应用。作为第一个基准测试结果，我们提出了一种可扩展的GAN方法来训练n体三维立方体的生成器。我们的技术依赖于两个关键的构建块，(i)将高维数据的生成分割成更小的部分，以及(ii)使用多尺度方法有效捕获全局图像特征，否则这些特征可能会在分割过程中丢失。我们使用宇宙学中常用的各种统计措施来评估我们的模型在生成n体样本方面的性能。我们的研究结果表明，所提出的模型产生了高视觉质量的样本，尽管统计分析表明，捕获数据中的罕见特征对生成模型提出了重大问题。我们在https://github.com/nperraud/3DcosmoGAN上公开了数据、质量评估例程和提议的GAN架构。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Cosmological N-body simulations: a challenge for scalable generative models

查看原文本刊更多论文

Cosmological N-body simulations: a challenge for scalable generative models

Deep generative models, such as Generative Adversarial Networks (GANs) or Variational Autoencoders (VAs) have been demonstrated to produce images of high visual quality. However, the existing hardware on which these models are trained severely limits the size of the images that can be generated. The rapid growth of high dimensional data in many fields of science therefore poses a significant challenge for generative models. In cosmology, the large-scale, three-dimensional matter distribution, modeled with N-body simulations, plays a crucial role in understanding the evolution of structures in the universe. As these simulations are computationally very expensive, GANs have recently generated interest as a possible method to emulate these datasets, but they have been, so far, mostly limited to two dimensional data. In this work, we introduce a new benchmark for the generation of three dimensional N-body simulations, in order to stimulate new ideas in the machine learning community and move closer to the practical use of generative models in cosmology. As a first benchmark result, we propose a scalable GAN approach for training a generator of N-body three-dimensional cubes. Our technique relies on two key building blocks, (i) splitting the generation of the high-dimensional data into smaller parts, and (ii) using a multi-scale approach that efficiently captures global image features that might otherwise be lost in the splitting process. We evaluate the performance of our model for the generation of N-body samples using various statistical measures commonly used in cosmology. Our results show that the proposed model produces samples of high visual quality, although the statistical analysis reveals that capturing rare features in the data poses significant problems for the generative models. We make the data, quality evaluation routines, and the proposed GAN architecture publicly available at https://github.com/nperraud/3DcosmoGAN.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Computational Astrophysics and Cosmology

自引率

0.00%

发文量

期刊介绍： Computational Astrophysics and Cosmology (CompAC) is now closed and no longer accepting submissions. However, we would like to assure you that Springer will maintain an archive of all articles published in CompAC, ensuring their accessibility through SpringerLink's comprehensive search functionality.