Pierre Schweitzer, C. Mazel, D. Hill, C. Cârloganu
{"title":"Performance analysis with a memory-bound Monte Carlo simulation on Xeon Phi","authors":"Pierre Schweitzer, C. Mazel, D. Hill, C. Cârloganu","doi":"10.1109/HPCSim.2015.7237074","DOIUrl":null,"url":null,"abstract":"Physics simulations are known to be great resources exhausters (CPU, memory). Hardware acceleration can help reduce the need for CPU time and increase the available memory bandwidth. In this paper, we present the performance gain when running a memory-bound muon Monte Carlo simulation on an Intel Xeon Phi and an Intel Xeon CPU. We show how to increase performance on the Xeon Phi without modifying the Physics software frameworks we are using for our application. We investigate distributed simulations on multicore and manycore systems and also the impact of hyper-threading on performance. We extend this to a hybrid computing model, balancing the computing burden between both the manycore and multicore processors of a computing node. Finally, we improved memory usage on the Xeon Phi by sharing Kernel Memory pages using KSM, and we show that, using this approach, we can run 16% more simulation instances.","PeriodicalId":134009,"journal":{"name":"2015 International Conference on High Performance Computing & Simulation (HPCS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on High Performance Computing & Simulation (HPCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HPCSim.2015.7237074","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Physics simulations are known to be great resources exhausters (CPU, memory). Hardware acceleration can help reduce the need for CPU time and increase the available memory bandwidth. In this paper, we present the performance gain when running a memory-bound muon Monte Carlo simulation on an Intel Xeon Phi and an Intel Xeon CPU. We show how to increase performance on the Xeon Phi without modifying the Physics software frameworks we are using for our application. We investigate distributed simulations on multicore and manycore systems and also the impact of hyper-threading on performance. We extend this to a hybrid computing model, balancing the computing burden between both the manycore and multicore processors of a computing node. Finally, we improved memory usage on the Xeon Phi by sharing Kernel Memory pages using KSM, and we show that, using this approach, we can run 16% more simulation instances.