{"title":"基于GPU集群的单级快速多极方法求解电磁问题","authors":"V. Dang, Q. Nguyen, O. Kilic, E. El-Araby","doi":"10.1109/USNC-URSI-NRSM.2013.6525071","DOIUrl":null,"url":null,"abstract":"Fast Multipole Method (FMM) is a mathematical technique which was developed to seek rapid solutions to integral equations of scattering for Helmholtz problems. For scattering problems, the integral equation is discretized into a matrix equation by the method of moments (MoM). The resultant equation is then typically solved by the direct LU, or an iterative method which requires O(N3) or O(N2) floating point operations respectively. However, if FMM is implemented, the complexity is reduced to O(N3/2). Moreover, the multilevel fast multipole algorithm (MLFMA) which is a multistage FMM can further reduce the complexity to O(NlogN). These methods are promising for providing a path to large scale computing in electromagnetics.In this paper, the exploitation of an HPC GPU cluster to implement single level fast multipole method for large scale scattering problems is investigated. The NVIDIA's Compute Unified Device Architecture (CUDA) and MVAPICH2 programming environments are utilized on a 13-node cluster equipped with GPUs and interconnected through a high bandwidth, low latency Infiniband network.","PeriodicalId":123571,"journal":{"name":"2013 US National Committee of URSI National Radio Science Meeting (USNC-URSI NRSM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Single level fast multipole method on GPU cluster for electromagnetic problems\",\"authors\":\"V. Dang, Q. Nguyen, O. Kilic, E. El-Araby\",\"doi\":\"10.1109/USNC-URSI-NRSM.2013.6525071\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Fast Multipole Method (FMM) is a mathematical technique which was developed to seek rapid solutions to integral equations of scattering for Helmholtz problems. For scattering problems, the integral equation is discretized into a matrix equation by the method of moments (MoM). The resultant equation is then typically solved by the direct LU, or an iterative method which requires O(N3) or O(N2) floating point operations respectively. However, if FMM is implemented, the complexity is reduced to O(N3/2). Moreover, the multilevel fast multipole algorithm (MLFMA) which is a multistage FMM can further reduce the complexity to O(NlogN). These methods are promising for providing a path to large scale computing in electromagnetics.In this paper, the exploitation of an HPC GPU cluster to implement single level fast multipole method for large scale scattering problems is investigated. The NVIDIA's Compute Unified Device Architecture (CUDA) and MVAPICH2 programming environments are utilized on a 13-node cluster equipped with GPUs and interconnected through a high bandwidth, low latency Infiniband network.\",\"PeriodicalId\":123571,\"journal\":{\"name\":\"2013 US National Committee of URSI National Radio Science Meeting (USNC-URSI NRSM)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-06-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 US National Committee of URSI National Radio Science Meeting (USNC-URSI NRSM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/USNC-URSI-NRSM.2013.6525071\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 US National Committee of URSI National Radio Science Meeting (USNC-URSI NRSM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/USNC-URSI-NRSM.2013.6525071","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Single level fast multipole method on GPU cluster for electromagnetic problems
Fast Multipole Method (FMM) is a mathematical technique which was developed to seek rapid solutions to integral equations of scattering for Helmholtz problems. For scattering problems, the integral equation is discretized into a matrix equation by the method of moments (MoM). The resultant equation is then typically solved by the direct LU, or an iterative method which requires O(N3) or O(N2) floating point operations respectively. However, if FMM is implemented, the complexity is reduced to O(N3/2). Moreover, the multilevel fast multipole algorithm (MLFMA) which is a multistage FMM can further reduce the complexity to O(NlogN). These methods are promising for providing a path to large scale computing in electromagnetics.In this paper, the exploitation of an HPC GPU cluster to implement single level fast multipole method for large scale scattering problems is investigated. The NVIDIA's Compute Unified Device Architecture (CUDA) and MVAPICH2 programming environments are utilized on a 13-node cluster equipped with GPUs and interconnected through a high bandwidth, low latency Infiniband network.