{"title":"Terascale Spectral Element Algorithms and Implementations","authors":"H. Tufo, P. Fischer","doi":"10.1145/331532.331599","DOIUrl":null,"url":null,"abstract":"We describe the development and implementation of an efficient spectral element code for multimillion gridpoint simulations of incompressible flows in general two- and three-dimensional domains. Key to this effort has been the development of scalable solvers for elliptic problems and a stabilization scheme that admits full use of the method’s high-order accuracy. We review these and other recently developed algorithmic underpinnings that have resulted in good parallel and vector performance on a broad range of architectures and that, with sustained performance of 319 GFLOPS on 2048 nodes of the Intel ASCI-Red machine at Sandia, readies us for the multithousand node terascale computing systems now coming on line at the DOE labs.","PeriodicalId":354898,"journal":{"name":"ACM/IEEE SC 1999 Conference (SC'99)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"137","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM/IEEE SC 1999 Conference (SC'99)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/331532.331599","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 137
Abstract
We describe the development and implementation of an efficient spectral element code for multimillion gridpoint simulations of incompressible flows in general two- and three-dimensional domains. Key to this effort has been the development of scalable solvers for elliptic problems and a stabilization scheme that admits full use of the method’s high-order accuracy. We review these and other recently developed algorithmic underpinnings that have resulted in good parallel and vector performance on a broad range of architectures and that, with sustained performance of 319 GFLOPS on 2048 nodes of the Intel ASCI-Red machine at Sandia, readies us for the multithousand node terascale computing systems now coming on line at the DOE labs.