J. M. Taboada, L. Landesa, F. Obelleiro, J. L. Rodríguez, J. Bértolo, J. C. Mouriño, A. Gómez
{"title":"Parallel FMM-FFT solver for the analysis of hundreds of millions of unknowns","authors":"J. M. Taboada, L. Landesa, F. Obelleiro, J. L. Rodríguez, J. Bértolo, J. C. Mouriño, A. Gómez","doi":"10.1109/CEM.2009.5228111","DOIUrl":null,"url":null,"abstract":"An efficient parallel implementation of the fast multipole method (FMM) combined with the fast fourier transform (FFT) is presented. The good scaling properties of the FMM-FFT, combined with a careful parallelization strategy, has shown to be very effective when using large parallel high performance supercomputers. For the case of very large problems, with hundreds of millions of unknowns, a nested scheme has been derived that further reduces the memory consumption. A challenging problem with more than 0.5 billion unknowns has been solved using this implementation, which demonstrates the ability of the algorithm to take advantage of the availability of supercomputers for the analysis of large, leading-edge electromagnetic problems.","PeriodicalId":416029,"journal":{"name":"2009 Computational Electromagnetics International Workshop","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 Computational Electromagnetics International Workshop","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CEM.2009.5228111","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
An efficient parallel implementation of the fast multipole method (FMM) combined with the fast fourier transform (FFT) is presented. The good scaling properties of the FMM-FFT, combined with a careful parallelization strategy, has shown to be very effective when using large parallel high performance supercomputers. For the case of very large problems, with hundreds of millions of unknowns, a nested scheme has been derived that further reduces the memory consumption. A challenging problem with more than 0.5 billion unknowns has been solved using this implementation, which demonstrates the ability of the algorithm to take advantage of the availability of supercomputers for the analysis of large, leading-edge electromagnetic problems.