{"title":"A Novel Acceleration Method for DGTD Algorithm on Sunway TaihuLight","authors":"Geng Chen, Lei Zhao, Wenhua Yu, Hu Ren, H. Fu","doi":"10.1109/APCAP.2018.8538209","DOIUrl":null,"url":null,"abstract":"In this paper, a novel acceleration method for dis-continuous Galerkin time domain (DGTD) algorithm on Sunway platform is proposed with a multi-level speedup technique. In the proposed method, the message passing interface (MPI) is used to connect each core-group (CG) and the registor level massaging passing interface (RL-MPI) is used to connect the slave core on each computing processing element (CPE) cluster, which efficiently overcomes the bottleneck caused by discrete memory accessing. Numerical results show that the performance of DGTD on SW26010 CPU can be dramatically improved by using RL-MPI.","PeriodicalId":198124,"journal":{"name":"2018 IEEE Asia-Pacific Conference on Antennas and Propagation (APCAP)","volume":"461 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE Asia-Pacific Conference on Antennas and Propagation (APCAP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APCAP.2018.8538209","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
In this paper, a novel acceleration method for dis-continuous Galerkin time domain (DGTD) algorithm on Sunway platform is proposed with a multi-level speedup technique. In the proposed method, the message passing interface (MPI) is used to connect each core-group (CG) and the registor level massaging passing interface (RL-MPI) is used to connect the slave core on each computing processing element (CPE) cluster, which efficiently overcomes the bottleneck caused by discrete memory accessing. Numerical results show that the performance of DGTD on SW26010 CPU can be dramatically improved by using RL-MPI.