Performance analysis with a memory-bound Monte Carlo simulation on Xeon Phi

2015 International Conference on High Performance Computing & Simulation (HPCS) Pub Date : 2015-07-20 DOI:10.1109/HPCSim.2015.7237074

Pierre Schweitzer, C. Mazel, D. Hill, C. Cârloganu

引用次数: 1

Abstract

Physics simulations are known to be great resources exhausters (CPU, memory). Hardware acceleration can help reduce the need for CPU time and increase the available memory bandwidth. In this paper, we present the performance gain when running a memory-bound muon Monte Carlo simulation on an Intel Xeon Phi and an Intel Xeon CPU. We show how to increase performance on the Xeon Phi without modifying the Physics software frameworks we are using for our application. We investigate distributed simulations on multicore and manycore systems and also the impact of hyper-threading on performance. We extend this to a hybrid computing model, balancing the computing burden between both the manycore and multicore processors of a computing node. Finally, we improved memory usage on the Xeon Phi by sharing Kernel Memory pages using KSM, and we show that, using this approach, we can run 16% more simulation instances.

查看原文本刊更多论文

在Xeon Phi处理器上使用内存绑定蒙特卡罗模拟进行性能分析

物理模拟被认为是巨大的资源消耗者(CPU，内存)。硬件加速可以帮助减少对CPU时间的需求，并增加可用内存带宽。在本文中，我们展示了在Intel Xeon Phi和Intel Xeon CPU上运行内存绑定μ子蒙特卡罗模拟时的性能增益。我们展示了如何在不修改我们应用程序使用的物理软件框架的情况下提高Xeon Phi的性能。我们研究了多核和多核系统上的分布式模拟，以及超线程对性能的影响。我们将其扩展到混合计算模型，在计算节点的多核和多核处理器之间平衡计算负担。最后，我们通过使用KSM共享内核内存页面来改善Xeon Phi处理器的内存使用情况，并且我们表明，使用这种方法，我们可以多运行16%的模拟实例。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 International Conference on High Performance Computing & Simulation (HPCS)

自引率

0.00%

发文量