GPU Acceleration of Smoothed Particle Hydrodynamics for the Navier-Stokes Equations

2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP) Pub Date : 2016-04-04 DOI:10.1109/PDP.2016.28

Yingrui Wang, Leisheng Li, Jingtao Wang, R. Tian

{"title":"GPU Acceleration of Smoothed Particle Hydrodynamics for the Navier-Stokes Equations","authors":"Yingrui Wang, Leisheng Li, Jingtao Wang, R. Tian","doi":"10.1109/PDP.2016.28","DOIUrl":null,"url":null,"abstract":"Although there exist much work on GPU acceleration on the SPH method, the focus so far has been on the Euler equations in fluid mechanics. This paper presents GPU acceleration on the SPH method for the Navier-Stokes equations for both solid and fluid mechanics. We investigate and compare three CPU-GPU coupling models in terms of one large-scale parallel application code: (1) CPU?GPU (to only run hotspots on GPU), (2) GPU-alone (to run the whole of simulation on GPU), and (3) CPU||GPU (to treat CPU and GPU as equivalent processors). A common issue to the three models, \"easy code transplant onto GPU\", is emphasized. Optimizations on particle indexing and particle interaction on GPU, which are of unique importance to a SPH code, are addressed. Numerical experiments are finally performed and 4x, 10x, 16x speedups are observed for the three coupling models, respectively, with reference to single CPU core. Among the three, the fastest model -- Xthe \"CPU||GPU\" model -- Xfurther undergoes scalability tests on a cluster of 6 heterogeneous nodes and shows 90+% parallel efficiency.","PeriodicalId":192273,"journal":{"name":"2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PDP.2016.28","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

Abstract

Although there exist much work on GPU acceleration on the SPH method, the focus so far has been on the Euler equations in fluid mechanics. This paper presents GPU acceleration on the SPH method for the Navier-Stokes equations for both solid and fluid mechanics. We investigate and compare three CPU-GPU coupling models in terms of one large-scale parallel application code: (1) CPU?GPU (to only run hotspots on GPU), (2) GPU-alone (to run the whole of simulation on GPU), and (3) CPU||GPU (to treat CPU and GPU as equivalent processors). A common issue to the three models, "easy code transplant onto GPU", is emphasized. Optimizations on particle indexing and particle interaction on GPU, which are of unique importance to a SPH code, are addressed. Numerical experiments are finally performed and 4x, 10x, 16x speedups are observed for the three coupling models, respectively, with reference to single CPU core. Among the three, the fastest model -- Xthe "CPU||GPU" model -- Xfurther undergoes scalability tests on a cluster of 6 heterogeneous nodes and shows 90+% parallel efficiency.

查看原文本刊更多论文

Navier-Stokes方程光滑粒子流体力学的GPU加速

虽然在SPH方法的GPU加速方面已经有了很多工作，但迄今为止的重点是流体力学中的欧拉方程。本文介绍了固体力学和流体力学Navier-Stokes方程的SPH方法上的GPU加速。我们根据一个大规模并行应用程序代码调查和比较了三种CPU- gpu耦合模型:(1)CPU?GPU(仅在GPU上运行热点)，(2)GPU单独(在GPU上运行整个仿真)，(3)CPU||GPU(将CPU和GPU视为等效处理器)。强调了三种模型的共同问题，即“易于代码移植到GPU上”。对粒子索引和粒子相互作用在GPU上的优化问题进行了研究，这两个问题对SPH代码具有独特的重要性。最后进行了数值实验，在参考单个CPU内核的情况下，三种耦合模型的速度分别提高了4倍、10倍和16倍。在这三种模型中，最快的模型——“CPU||GPU”模型——x进一步在6个异构节点的集群上进行了可扩展性测试，并显示出90%以上的并行效率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP)

自引率

0.00%

发文量