Data-centric GPU-based adaptive mesh refinement

Proceedings of the 5th Workshop on Irregular Applications: Architectures and Algorithms Pub Date : 2015-11-15 DOI:10.1145/2833179.2833181

M. Wahib, N. Maruyama

{"title":"Data-centric GPU-based adaptive mesh refinement","authors":"M. Wahib, N. Maruyama","doi":"10.1145/2833179.2833181","DOIUrl":null,"url":null,"abstract":"It has been demonstrated that explicit stencil computations of high-resolution scheme can highly benefit from GPUs. This includes Adaptive Mesh Refinement (AMR), which is a model for adapting the resolution of a stencil grid locally. Unlike uniform grid stencils, however, adapting the grid is typically done on the CPU side. This requires transferring the stencil data arrays to/from CPU every time the grid is adapted. We propose a data-centric approach to GPU-based AMR. That is, porting all the mesh adaptation operations touching the data arrays to the GPU. This would allow the stencil data arrays to reside on the GPU memory for the entirety of the simulation. Thus, the GPU code would specialize on the data residing on its memory while the CPU specializes on the AMR metadata residing on CPU memory. We compare the performance of the proposed method to a basic GPU implementation and an optimized GPU implementation that overlaps communication and computation. The performance of two GPU-based AMR applications is enhanced by 2.21x, and 2.83x compared to the basic implementation.","PeriodicalId":215872,"journal":{"name":"Proceedings of the 5th Workshop on Irregular Applications: Architectures and Algorithms","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 5th Workshop on Irregular Applications: Architectures and Algorithms","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2833179.2833181","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 11

Abstract

It has been demonstrated that explicit stencil computations of high-resolution scheme can highly benefit from GPUs. This includes Adaptive Mesh Refinement (AMR), which is a model for adapting the resolution of a stencil grid locally. Unlike uniform grid stencils, however, adapting the grid is typically done on the CPU side. This requires transferring the stencil data arrays to/from CPU every time the grid is adapted. We propose a data-centric approach to GPU-based AMR. That is, porting all the mesh adaptation operations touching the data arrays to the GPU. This would allow the stencil data arrays to reside on the GPU memory for the entirety of the simulation. Thus, the GPU code would specialize on the data residing on its memory while the CPU specializes on the AMR metadata residing on CPU memory. We compare the performance of the proposed method to a basic GPU implementation and an optimized GPU implementation that overlaps communication and computation. The performance of two GPU-based AMR applications is enhanced by 2.21x, and 2.83x compared to the basic implementation.

查看原文本刊更多论文

以数据为中心的基于gpu的自适应网格细化

研究表明，高分辨率方案的显式模板计算可以充分利用gpu。这包括自适应网格细化(AMR)，这是一种局部适应模板网格分辨率的模型。然而，与统一网格模板不同的是，网格的调整通常是在CPU端完成的。这需要在每次调整网格时向CPU传输模板数据数组。我们提出了一种以数据为中心的基于gpu的AMR方法。也就是说，将所有涉及数据数组的网格适配操作移植到GPU。这将允许模板数据数组在整个模拟过程中驻留在GPU内存上。因此，GPU代码将专门处理驻留在其内存中的数据，而CPU专门处理驻留在CPU内存中的AMR元数据。我们将提出的方法的性能与基本GPU实现和优化的GPU实现进行了比较，这些GPU实现重叠了通信和计算。与基本实现相比，两个基于gpu的AMR应用程序的性能分别提高了2.21倍和2.83倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 5th Workshop on Irregular Applications: Architectures and Algorithms

自引率

0.00%

发文量