Acceleration of a 3D Immersed Boundary Solver Using OpenACC

Apurva Raj, Somnath C. Roy, N. Vydyanathan, Bharatkumar Sharma
{"title":"Acceleration of a 3D Immersed Boundary Solver Using OpenACC","authors":"Apurva Raj, Somnath C. Roy, N. Vydyanathan, Bharatkumar Sharma","doi":"10.1109/HIPCW.2018.8634138","DOIUrl":null,"url":null,"abstract":"Immersed-boundary methods (IBM) have been constantly gaining popularity and are increasingly expanding to new areas of applications in computational mechanics since last three decades due to the potentials of their application in modeling complex multiphysics phenomena which involves flow over complex and moving boundaries. The specific advantages of an immersed boundary method are due to its accuracy and simplicity. As this method uses a fixed structured Cartesian mesh, the complex grid generation processes can be fully avoided whereas the complex/moving boundary is described using another surface mesh. The computational overheads in an immersed boundary implementation can be very high due to expensive search and interpolation steps through which the effects of the boundary conditions on the surface mesh are translated to the fixed Cartesian volume mesh. Therefore, computationally efficient numerical implementation of an IBM solver is of extreme importance to researchers. This paper presents an accelerated discrete finite difference based immersed boundary (IB) solver that is used to study the external flow behavior around complex geometries. The flow is assumed to be incompressible. The immersed boundary solver is parallelized using OpenACC for quick acceleration with minimal code changes and to ensure performance portability across both GPUs and multicore CPUs. Our experimental results indicate that the OpenACC-based IB solver run on a NVIDIA Tesla P100 GPU is 21x faster than the sequential legacy solver and is 3.3x faster than the OpenACC-based IB solver run on a dual socket Intel Xeon Gold 6148, 20 core CPU. The recirculation lengths obtained for Reynolds numbers of 20 and 40 and the Strouhal number for Reynolds number 100, for a standard flow visualization problem over a fixed cylinder, are in accordance with the reported data in available literature, thereby validating the accuracy of the parallel solver. We also analyze the performance of the accelerated solver on different GPU architectures: Kepler, Pascal and Volta.","PeriodicalId":401060,"journal":{"name":"2018 IEEE 25th International Conference on High Performance Computing Workshops (HiPCW)","volume":"194 5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 25th International Conference on High Performance Computing Workshops (HiPCW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HIPCW.2018.8634138","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

Immersed-boundary methods (IBM) have been constantly gaining popularity and are increasingly expanding to new areas of applications in computational mechanics since last three decades due to the potentials of their application in modeling complex multiphysics phenomena which involves flow over complex and moving boundaries. The specific advantages of an immersed boundary method are due to its accuracy and simplicity. As this method uses a fixed structured Cartesian mesh, the complex grid generation processes can be fully avoided whereas the complex/moving boundary is described using another surface mesh. The computational overheads in an immersed boundary implementation can be very high due to expensive search and interpolation steps through which the effects of the boundary conditions on the surface mesh are translated to the fixed Cartesian volume mesh. Therefore, computationally efficient numerical implementation of an IBM solver is of extreme importance to researchers. This paper presents an accelerated discrete finite difference based immersed boundary (IB) solver that is used to study the external flow behavior around complex geometries. The flow is assumed to be incompressible. The immersed boundary solver is parallelized using OpenACC for quick acceleration with minimal code changes and to ensure performance portability across both GPUs and multicore CPUs. Our experimental results indicate that the OpenACC-based IB solver run on a NVIDIA Tesla P100 GPU is 21x faster than the sequential legacy solver and is 3.3x faster than the OpenACC-based IB solver run on a dual socket Intel Xeon Gold 6148, 20 core CPU. The recirculation lengths obtained for Reynolds numbers of 20 and 40 and the Strouhal number for Reynolds number 100, for a standard flow visualization problem over a fixed cylinder, are in accordance with the reported data in available literature, thereby validating the accuracy of the parallel solver. We also analyze the performance of the accelerated solver on different GPU architectures: Kepler, Pascal and Volta.
基于OpenACC的三维浸入式边界求解器的加速研究
近三十年来,浸入式边界方法(IBM)在模拟复杂的多物理场现象(涉及复杂和移动边界的流动)方面具有潜在的应用潜力,因此在计算力学中不断得到普及,并日益扩展到新的应用领域。浸没边界法的独特优势在于它的准确性和简单性。由于该方法使用固定结构的笛卡尔网格,可以完全避免复杂的网格生成过程,而使用另一种表面网格来描述复杂/移动的边界。浸入式边界实现的计算开销可能非常高,因为搜索和插值步骤非常昂贵,通过这些步骤将边界条件对表面网格的影响转换为固定的笛卡尔体网格。因此,IBM求解器的高效计算数值实现对研究人员来说至关重要。本文提出了一种加速离散有限差分浸入边界(IB)求解器,用于研究复杂几何形状周围的外部流动行为。假定流不可压缩。浸入式边界求解器使用OpenACC进行并行化,以最小的代码更改实现快速加速,并确保跨gpu和多核cpu的性能可移植性。我们的实验结果表明,在NVIDIA Tesla P100 GPU上运行的基于openacc的IB求解器比顺序遗留求解器快21倍,比在双插槽Intel Xeon Gold 61448,20核CPU上运行的基于openacc的IB求解器快3.3倍。对于固定圆柱体上的标准流动可视化问题,雷诺数为20和40时得到的再循环长度和雷诺数为100时得到的Strouhal数与现有文献报道的数据一致,从而验证了并行求解器的准确性。我们还分析了加速求解器在不同GPU架构(Kepler, Pascal和Volta)上的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信