Adapting a message-driven parallel application to GPU-accelerated clusters

2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis Pub Date : 2008-11-15 DOI:10.1109/SC.2008.5214716

James C. Phillips, J. Stone, K. Schulten

引用次数: 185

Abstract

Graphics processing units (GPUs) have become an attractive option for accelerating scientific computations as a result of advances in the performance and flexibility of GPU hardware, and due to the availability of GPU software development tools targeting general purpose and scientific computation. However, effective use of GPUs in clusters presents a number of application development and system integration challenges. We describe strategies for the decomposition and scheduling of computation among CPU cores and GPUs, and techniques for overlapping communication and CPU computation with GPU kernel execution. We report the adaptation of these techniques to NAMD, a widely-used parallel molecular dynamics simulation package, and present performance results for a 64-core 64-GPU cluster.

查看原文本刊更多论文

使消息驱动的并行应用程序适应gpu加速的集群

由于GPU硬件的性能和灵活性的进步，以及针对通用和科学计算的GPU软件开发工具的可用性，图形处理单元(GPU)已经成为加速科学计算的一个有吸引力的选择。然而，在集群中有效使用gpu提出了许多应用程序开发和系统集成方面的挑战。我们描述了CPU内核和GPU之间计算的分解和调度策略，以及与GPU内核执行重叠通信和CPU计算的技术。我们报告了这些技术对NAMD(一个广泛使用的并行分子动力学模拟包)的适应，并给出了64核64 gpu集群的性能结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis

自引率

0.00%

发文量