V. G. Pinto, Vinicius Alves Herbstrith, L. Schnorr
{"title":"Replicating the Performance Evaluation of an N-Body Application on a Manycore Accelerator","authors":"V. G. Pinto, Vinicius Alves Herbstrith, L. Schnorr","doi":"10.1109/SBAC-PADW.2015.17","DOIUrl":null,"url":null,"abstract":"Reproducibility for High Performance Computing (HPC) systems has been discussed for some time already, but more work should be carried out to cover the latest accelerators that equip the fastest supercomputers such as the ones listed in Top500. In this paper, we perform a replication of a performance evaluation carried out using an N-Body Open MP parallel application on a XeonPhi accelerator. We also compare the obtained performance with a similar N-Body CUDA application. Besides encountering intriguing results about the Xeon Phi on the number of hardware threads, our comparison against Nvidia boards using the same load shows that the execution Xeon Phi is slower than on Nvidia K20 and GTX760 accelerators.","PeriodicalId":161685,"journal":{"name":"2015 International Symposium on Computer Architecture and High Performance Computing Workshop (SBAC-PADW)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Symposium on Computer Architecture and High Performance Computing Workshop (SBAC-PADW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SBAC-PADW.2015.17","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Reproducibility for High Performance Computing (HPC) systems has been discussed for some time already, but more work should be carried out to cover the latest accelerators that equip the fastest supercomputers such as the ones listed in Top500. In this paper, we perform a replication of a performance evaluation carried out using an N-Body Open MP parallel application on a XeonPhi accelerator. We also compare the obtained performance with a similar N-Body CUDA application. Besides encountering intriguing results about the Xeon Phi on the number of hardware threads, our comparison against Nvidia boards using the same load shows that the execution Xeon Phi is slower than on Nvidia K20 and GTX760 accelerators.