基于GPU集群的单级快速多极方法求解电磁问题

2013 US National Committee of URSI National Radio Science Meeting (USNC-URSI NRSM) Pub Date : 2013-06-06 DOI:10.1109/USNC-URSI-NRSM.2013.6525071

V. Dang, Q. Nguyen, O. Kilic, E. El-Araby

{"title":"基于GPU集群的单级快速多极方法求解电磁问题","authors":"V. Dang, Q. Nguyen, O. Kilic, E. El-Araby","doi":"10.1109/USNC-URSI-NRSM.2013.6525071","DOIUrl":null,"url":null,"abstract":"Fast Multipole Method (FMM) is a mathematical technique which was developed to seek rapid solutions to integral equations of scattering for Helmholtz problems. For scattering problems, the integral equation is discretized into a matrix equation by the method of moments (MoM). The resultant equation is then typically solved by the direct LU, or an iterative method which requires O(N3) or O(N2) floating point operations respectively. However, if FMM is implemented, the complexity is reduced to O(N3/2). Moreover, the multilevel fast multipole algorithm (MLFMA) which is a multistage FMM can further reduce the complexity to O(NlogN). These methods are promising for providing a path to large scale computing in electromagnetics.In this paper, the exploitation of an HPC GPU cluster to implement single level fast multipole method for large scale scattering problems is investigated. The NVIDIA's Compute Unified Device Architecture (CUDA) and MVAPICH2 programming environments are utilized on a 13-node cluster equipped with GPUs and interconnected through a high bandwidth, low latency Infiniband network.","PeriodicalId":123571,"journal":{"name":"2013 US National Committee of URSI National Radio Science Meeting (USNC-URSI NRSM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Single level fast multipole method on GPU cluster for electromagnetic problems\",\"authors\":\"V. Dang, Q. Nguyen, O. Kilic, E. El-Araby\",\"doi\":\"10.1109/USNC-URSI-NRSM.2013.6525071\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Fast Multipole Method (FMM) is a mathematical technique which was developed to seek rapid solutions to integral equations of scattering for Helmholtz problems. For scattering problems, the integral equation is discretized into a matrix equation by the method of moments (MoM). The resultant equation is then typically solved by the direct LU, or an iterative method which requires O(N3) or O(N2) floating point operations respectively. However, if FMM is implemented, the complexity is reduced to O(N3/2). Moreover, the multilevel fast multipole algorithm (MLFMA) which is a multistage FMM can further reduce the complexity to O(NlogN). These methods are promising for providing a path to large scale computing in electromagnetics.In this paper, the exploitation of an HPC GPU cluster to implement single level fast multipole method for large scale scattering problems is investigated. The NVIDIA's Compute Unified Device Architecture (CUDA) and MVAPICH2 programming environments are utilized on a 13-node cluster equipped with GPUs and interconnected through a high bandwidth, low latency Infiniband network.\",\"PeriodicalId\":123571,\"journal\":{\"name\":\"2013 US National Committee of URSI National Radio Science Meeting (USNC-URSI NRSM)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-06-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 US National Committee of URSI National Radio Science Meeting (USNC-URSI NRSM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/USNC-URSI-NRSM.2013.6525071\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 US National Committee of URSI National Radio Science Meeting (USNC-URSI NRSM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/USNC-URSI-NRSM.2013.6525071","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

快速多极法(FMM)是为了快速求解亥姆霍兹问题散射积分方程而发展起来的一种数学方法。对于散射问题，采用矩量法将积分方程离散为矩阵方程。结果方程通常通过直接LU或迭代方法求解，迭代方法分别需要O(N3)或O(N2)个浮点运算。然而，如果实现FMM，复杂度降低到0 (N3/2)。此外，多级快速多极算法(MLFMA)作为一种多级FMM，可以进一步将复杂度降低到0 (NlogN)。这些方法有望为电磁学的大规模计算提供一条途径。本文研究了利用高性能计算GPU集群实现大规模散射问题的单级快速多极子方法。NVIDIA的计算统一设备架构(CUDA)和MVAPICH2编程环境在配备gpu的13节点集群上使用，并通过高带宽，低延迟的Infiniband网络相互连接。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Single level fast multipole method on GPU cluster for electromagnetic problems

Fast Multipole Method (FMM) is a mathematical technique which was developed to seek rapid solutions to integral equations of scattering for Helmholtz problems. For scattering problems, the integral equation is discretized into a matrix equation by the method of moments (MoM). The resultant equation is then typically solved by the direct LU, or an iterative method which requires O(N3) or O(N2) floating point operations respectively. However, if FMM is implemented, the complexity is reduced to O(N3/2). Moreover, the multilevel fast multipole algorithm (MLFMA) which is a multistage FMM can further reduce the complexity to O(NlogN). These methods are promising for providing a path to large scale computing in electromagnetics.In this paper, the exploitation of an HPC GPU cluster to implement single level fast multipole method for large scale scattering problems is investigated. The NVIDIA's Compute Unified Device Architecture (CUDA) and MVAPICH2 programming environments are utilized on a 13-node cluster equipped with GPUs and interconnected through a high bandwidth, low latency Infiniband network.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2013 US National Committee of URSI National Radio Science Meeting (USNC-URSI NRSM)

自引率

0.00%

发文量