Accelerating frequency-domain simulations using small shared-memory CPU/GPU cluster

2016 21st International Conference on Microwave, Radar and Wireless Communications (MIKON) Pub Date : 2016-05-09 DOI:10.1109/MIKON.2016.7492098

T. Topa, A. Noga, A. Karwowski

{"title":"Accelerating frequency-domain simulations using small shared-memory CPU/GPU cluster","authors":"T. Topa, A. Noga, A. Karwowski","doi":"10.1109/MIKON.2016.7492098","DOIUrl":null,"url":null,"abstract":"Numerical approach to frequency response problems usually requires that the system governing equation is solved repeatedly at many frequencies. The computational efficiency of the overall process can be increased by departing from traditional sequential computing model in favor of utilizing the parallel processing capability commonly offered by modern hardware. In this paper, we consider a hybrid programming pattern, OpenMP + CUDA, from the perspective of a user of a rather typical low-cost multi-core CPU-based workstation that can accommodate up to four GPUs. Such the small-scale heterogeneous platforms have recently gained wide popularity in scientific computing as an inexpensive massively parallel architecture. The relevant programming model issues and performance questions are addressed. Experimental results for the example physics problem, that is, the electromagnetic scattering from perfectly electrically conducting body, show that significant performance improvement can be attained with the OpenMP + CUDA programming model.","PeriodicalId":354299,"journal":{"name":"2016 21st International Conference on Microwave, Radar and Wireless Communications (MIKON)","volume":"86 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 21st International Conference on Microwave, Radar and Wireless Communications (MIKON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MIKON.2016.7492098","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Numerical approach to frequency response problems usually requires that the system governing equation is solved repeatedly at many frequencies. The computational efficiency of the overall process can be increased by departing from traditional sequential computing model in favor of utilizing the parallel processing capability commonly offered by modern hardware. In this paper, we consider a hybrid programming pattern, OpenMP + CUDA, from the perspective of a user of a rather typical low-cost multi-core CPU-based workstation that can accommodate up to four GPUs. Such the small-scale heterogeneous platforms have recently gained wide popularity in scientific computing as an inexpensive massively parallel architecture. The relevant programming model issues and performance questions are addressed. Experimental results for the example physics problem, that is, the electromagnetic scattering from perfectly electrically conducting body, show that significant performance improvement can be attained with the OpenMP + CUDA programming model.

查看原文本刊更多论文

使用小型共享内存CPU/GPU集群加速频域模拟

频率响应问题的数值求解通常需要在多个频率上重复求解系统控制方程。抛弃传统的顺序计算模型，利用现代硬件普遍提供的并行处理能力，可以提高整个过程的计算效率。在本文中，我们考虑了一种混合编程模式，OpenMP + CUDA，从一个相当典型的低成本多核cpu工作站的用户的角度来看，它可以容纳多达四个gpu。这种小规模的异构平台最近作为一种廉价的大规模并行架构在科学计算中得到了广泛的普及。讨论了相关的编程模型问题和性能问题。实验结果表明，采用OpenMP + CUDA编程模型可以显著提高算法的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 21st International Conference on Microwave, Radar and Wireless Communications (MIKON)

自引率

0.00%

发文量