{"title":"GPU Acceleration of Smoothed Particle Hydrodynamics for the Navier-Stokes Equations","authors":"Yingrui Wang, Leisheng Li, Jingtao Wang, R. Tian","doi":"10.1109/PDP.2016.28","DOIUrl":null,"url":null,"abstract":"Although there exist much work on GPU acceleration on the SPH method, the focus so far has been on the Euler equations in fluid mechanics. This paper presents GPU acceleration on the SPH method for the Navier-Stokes equations for both solid and fluid mechanics. We investigate and compare three CPU-GPU coupling models in terms of one large-scale parallel application code: (1) CPU?GPU (to only run hotspots on GPU), (2) GPU-alone (to run the whole of simulation on GPU), and (3) CPU||GPU (to treat CPU and GPU as equivalent processors). A common issue to the three models, \"easy code transplant onto GPU\", is emphasized. Optimizations on particle indexing and particle interaction on GPU, which are of unique importance to a SPH code, are addressed. Numerical experiments are finally performed and 4x, 10x, 16x speedups are observed for the three coupling models, respectively, with reference to single CPU core. Among the three, the fastest model -- Xthe \"CPU||GPU\" model -- Xfurther undergoes scalability tests on a cluster of 6 heterogeneous nodes and shows 90+% parallel efficiency.","PeriodicalId":192273,"journal":{"name":"2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PDP.2016.28","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Although there exist much work on GPU acceleration on the SPH method, the focus so far has been on the Euler equations in fluid mechanics. This paper presents GPU acceleration on the SPH method for the Navier-Stokes equations for both solid and fluid mechanics. We investigate and compare three CPU-GPU coupling models in terms of one large-scale parallel application code: (1) CPU?GPU (to only run hotspots on GPU), (2) GPU-alone (to run the whole of simulation on GPU), and (3) CPU||GPU (to treat CPU and GPU as equivalent processors). A common issue to the three models, "easy code transplant onto GPU", is emphasized. Optimizations on particle indexing and particle interaction on GPU, which are of unique importance to a SPH code, are addressed. Numerical experiments are finally performed and 4x, 10x, 16x speedups are observed for the three coupling models, respectively, with reference to single CPU core. Among the three, the fastest model -- Xthe "CPU||GPU" model -- Xfurther undergoes scalability tests on a cluster of 6 heterogeneous nodes and shows 90+% parallel efficiency.