Distributed Parallel Hybrid CPU-GPGPU Implementation of the Phase-Field Method for Accelerated High-Accuracy Simulations of Pore-Scale Two-Phase Flow

C. Thiele, M. Araya-Polo, F. Alpak, B. Rivière
{"title":"Distributed Parallel Hybrid CPU-GPGPU Implementation of the Phase-Field Method for Accelerated High-Accuracy Simulations of Pore-Scale Two-Phase Flow","authors":"C. Thiele, M. Araya-Polo, F. Alpak, B. Rivière","doi":"10.2118/193922-MS","DOIUrl":null,"url":null,"abstract":"\n Direct numerical simulation of multi-phase pore-scale flow is a computationally demanding task with strong requirements on time-to-solution for the prediction of relative permeabilities. In this paper, we describe the hybrid-parallel implementation of a two-phase two-component incompressible flow simulator using MPI, OpenMP, and general-purpose graphics processing units (GPUs), and we analyze its computational performance. In particular, we evaluate the parallel performance of GPU-based iterative linear solvers for this application, and we compare them to CPU-based implementations of the same solver algorithms. Simulations on real-life Berea sandstone micro-CT images are used to assess the strong scalability and computational performance of the different solver implementations and their effect on time-to-solution. Additionally, we use a Poisson problem to further characterize achievable strong and weak scalability of the GPU-based solvers in reproducible experiments. Our experiments show that GPU-based iterative solvers can greatly reduce time-to-solution in complex pore-scale simulations. On the other hand, strong scalability is currently limited by the unbalanced computing capacities of the host and the GPUs. The experiments with the Poisson problem indicate that GPU-based iterative solvers are efficient when weak scalability is desired. Our findings show that proper utilization of GPUs can help to make our two-phase pore-scale flow simulation computationally feasible in existing workflows.","PeriodicalId":246878,"journal":{"name":"Day 2 Thu, April 11, 2019","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Day 2 Thu, April 11, 2019","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2118/193922-MS","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Direct numerical simulation of multi-phase pore-scale flow is a computationally demanding task with strong requirements on time-to-solution for the prediction of relative permeabilities. In this paper, we describe the hybrid-parallel implementation of a two-phase two-component incompressible flow simulator using MPI, OpenMP, and general-purpose graphics processing units (GPUs), and we analyze its computational performance. In particular, we evaluate the parallel performance of GPU-based iterative linear solvers for this application, and we compare them to CPU-based implementations of the same solver algorithms. Simulations on real-life Berea sandstone micro-CT images are used to assess the strong scalability and computational performance of the different solver implementations and their effect on time-to-solution. Additionally, we use a Poisson problem to further characterize achievable strong and weak scalability of the GPU-based solvers in reproducible experiments. Our experiments show that GPU-based iterative solvers can greatly reduce time-to-solution in complex pore-scale simulations. On the other hand, strong scalability is currently limited by the unbalanced computing capacities of the host and the GPUs. The experiments with the Poisson problem indicate that GPU-based iterative solvers are efficient when weak scalability is desired. Our findings show that proper utilization of GPUs can help to make our two-phase pore-scale flow simulation computationally feasible in existing workflows.
分布式并行混合CPU-GPGPU相场法实现孔隙尺度两相流加速高精度模拟
多相孔隙尺度流动的直接数值模拟是一项计算量很大的任务,对相对渗透率预测的求解时间要求很高。本文描述了一种采用MPI、OpenMP和通用图形处理单元(gpu)的两相双组分不可压缩流动模拟器的混合并行实现,并对其计算性能进行了分析。特别地,我们评估了基于gpu的迭代线性求解器的并行性能,并将它们与基于cpu的相同求解器算法的实现进行了比较。通过对真实Berea砂岩微ct图像的模拟,评估了不同求解器实现的强大可扩展性和计算性能,以及它们对求解时间的影响。此外,我们使用泊松问题进一步表征了基于gpu的求解器在可重复性实验中的可实现的强可扩展性和弱可扩展性。我们的实验表明,基于gpu的迭代求解器可以大大缩短复杂孔隙尺度模拟的求解时间。另一方面,强大的可扩展性目前受到主机和gpu计算能力不平衡的限制。泊松问题的实验表明,当需要弱可扩展性时,基于gpu的迭代求解方法是有效的。我们的研究结果表明,适当利用gpu有助于使我们的两相孔隙尺度流模拟在现有工作流中计算可行。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信