{"title":"GPU Parallelization of Wave Equation Based Discontinuous Galerkin Time Domain Method","authors":"Z. Ban, Yan Shi","doi":"10.23919/ACES48530.2019.9060656","DOIUrl":null,"url":null,"abstract":"In this paper, a GPU-accelerated improved discontinuous Galerkin time domain scheme based on Helmholtz vector wave equation (GPU DGTD-WE) has been proposed. By mapping FEM meshes to CUDA multi-layer parallel architectures and implementing some optimization operations, good acceleration performance can be obtained. By using matrix projection between the matrices in the global coordinates to universal matrices in local ones, high memory compression rate can be achieved for highly efficient solution of large-scale EM problems. Numerical examples including a substrate integrated waveguide filter and a missile-loaded microstrip patch antenna array are given to demonstrate the good performance of the proposed method.","PeriodicalId":247909,"journal":{"name":"2019 International Applied Computational Electromagnetics Society Symposium - China (ACES)","volume":"61 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Applied Computational Electromagnetics Society Symposium - China (ACES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/ACES48530.2019.9060656","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
In this paper, a GPU-accelerated improved discontinuous Galerkin time domain scheme based on Helmholtz vector wave equation (GPU DGTD-WE) has been proposed. By mapping FEM meshes to CUDA multi-layer parallel architectures and implementing some optimization operations, good acceleration performance can be obtained. By using matrix projection between the matrices in the global coordinates to universal matrices in local ones, high memory compression rate can be achieved for highly efficient solution of large-scale EM problems. Numerical examples including a substrate integrated waveguide filter and a missile-loaded microstrip patch antenna array are given to demonstrate the good performance of the proposed method.