Accelerating frequency-domain simulations using small shared-memory CPU/GPU cluster

T. Topa, A. Noga, A. Karwowski
{"title":"Accelerating frequency-domain simulations using small shared-memory CPU/GPU cluster","authors":"T. Topa, A. Noga, A. Karwowski","doi":"10.1109/MIKON.2016.7492098","DOIUrl":null,"url":null,"abstract":"Numerical approach to frequency response problems usually requires that the system governing equation is solved repeatedly at many frequencies. The computational efficiency of the overall process can be increased by departing from traditional sequential computing model in favor of utilizing the parallel processing capability commonly offered by modern hardware. In this paper, we consider a hybrid programming pattern, OpenMP + CUDA, from the perspective of a user of a rather typical low-cost multi-core CPU-based workstation that can accommodate up to four GPUs. Such the small-scale heterogeneous platforms have recently gained wide popularity in scientific computing as an inexpensive massively parallel architecture. The relevant programming model issues and performance questions are addressed. Experimental results for the example physics problem, that is, the electromagnetic scattering from perfectly electrically conducting body, show that significant performance improvement can be attained with the OpenMP + CUDA programming model.","PeriodicalId":354299,"journal":{"name":"2016 21st International Conference on Microwave, Radar and Wireless Communications (MIKON)","volume":"86 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 21st International Conference on Microwave, Radar and Wireless Communications (MIKON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MIKON.2016.7492098","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Numerical approach to frequency response problems usually requires that the system governing equation is solved repeatedly at many frequencies. The computational efficiency of the overall process can be increased by departing from traditional sequential computing model in favor of utilizing the parallel processing capability commonly offered by modern hardware. In this paper, we consider a hybrid programming pattern, OpenMP + CUDA, from the perspective of a user of a rather typical low-cost multi-core CPU-based workstation that can accommodate up to four GPUs. Such the small-scale heterogeneous platforms have recently gained wide popularity in scientific computing as an inexpensive massively parallel architecture. The relevant programming model issues and performance questions are addressed. Experimental results for the example physics problem, that is, the electromagnetic scattering from perfectly electrically conducting body, show that significant performance improvement can be attained with the OpenMP + CUDA programming model.
使用小型共享内存CPU/GPU集群加速频域模拟
频率响应问题的数值求解通常需要在多个频率上重复求解系统控制方程。抛弃传统的顺序计算模型,利用现代硬件普遍提供的并行处理能力,可以提高整个过程的计算效率。在本文中,我们考虑了一种混合编程模式,OpenMP + CUDA,从一个相当典型的低成本多核cpu工作站的用户的角度来看,它可以容纳多达四个gpu。这种小规模的异构平台最近作为一种廉价的大规模并行架构在科学计算中得到了广泛的普及。讨论了相关的编程模型问题和性能问题。实验结果表明,采用OpenMP + CUDA编程模型可以显著提高算法的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信