{"title":"Cph CT Toolbox: A performance evaluation","authors":"J. Bardino, Martin Rehr, B. Vinter","doi":"10.1109/HPCSim.2015.7237019","DOIUrl":null,"url":null,"abstract":"With the first version of the Cph CT Toolbox released and introduced, we turn to intensively evaluating the performance of the FDK and Katsevich reconstruction implementations in the second major release. The evaluation focuses on comparisons between different hardware platforms from the two major GPU compute vendors, AMD and NVIDIA, using our updated CUDA and new OpenCL implementations. Such a performance comparison is in itself interesting in a narrow CT scanning and reconstruction perspective, but it also sheds some light on the performance of those AMD and NVIDIA platforms and GPU technologies: something of general interest to anyone building or considering GPU solutions for their scientific calculations. Results from the best system reveals the chosen streaming strategy to scale linearly up to problem sizes one order of magnitude larger than the available GPU memory, and with only a minor scaling decrease when increasing the problem size further to the next order of magnitude.","PeriodicalId":134009,"journal":{"name":"2015 International Conference on High Performance Computing & Simulation (HPCS)","volume":"605 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on High Performance Computing & Simulation (HPCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HPCSim.2015.7237019","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
With the first version of the Cph CT Toolbox released and introduced, we turn to intensively evaluating the performance of the FDK and Katsevich reconstruction implementations in the second major release. The evaluation focuses on comparisons between different hardware platforms from the two major GPU compute vendors, AMD and NVIDIA, using our updated CUDA and new OpenCL implementations. Such a performance comparison is in itself interesting in a narrow CT scanning and reconstruction perspective, but it also sheds some light on the performance of those AMD and NVIDIA platforms and GPU technologies: something of general interest to anyone building or considering GPU solutions for their scientific calculations. Results from the best system reveals the chosen streaming strategy to scale linearly up to problem sizes one order of magnitude larger than the available GPU memory, and with only a minor scaling decrease when increasing the problem size further to the next order of magnitude.