Accelerated high-performance computing through efficient multi-process GPU resource sharing

ACM International Conference on Computing Frontiers Pub Date : 2012-05-15 DOI:10.1145/2212908.2212950

Teng Li, Vikram K. Narayana, T. El-Ghazawi

{"title":"Accelerated high-performance computing through efficient multi-process GPU resource sharing","authors":"Teng Li, Vikram K. Narayana, T. El-Ghazawi","doi":"10.1145/2212908.2212950","DOIUrl":null,"url":null,"abstract":"The HPC field is witnessing a widespread adoption of GPUs as accelerators for traditional homogeneous HPC systems. One of the prevalent parallel programming models is the SPMD paradigm, which has been adapted for GPU-based parallel processing. Since each process executes the same program under SPMD, every process mapped to a CPU core also needs the GPU availability. Therefore SPMD demands a symmetric CPU/GPU distribution. However, since modern HPC systems feature a large number of CPU cores that outnumber the number of GPUs, computing resources are generally underutilized with SPMD. Our previous efforts have focused on GPU virtualization that enables efficient sharing of GPU among multiple CPU processes. Nevertheless, a formal method to evaluate and choose the appropriate GPU sharing approach is still lacking. In this paper, based on SPMD GPU kernel profiles, we propose different multi-process GPU sharing scenarios under virtualization. We introduce an analytical model that captures these sharing scenarios and provides a theoretical performance gain estimation. Benchmarks validate our analyses and achievable performance gains. While our analytical study provides a suitable theoretical foundation for GPU sharing, the experimental results demonstrate that GPU virtualization affords significant performance improvements over the non-virtualized solutions for all proposed sharing scenarios.","PeriodicalId":430420,"journal":{"name":"ACM International Conference on Computing Frontiers","volume":"81 16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM International Conference on Computing Frontiers","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2212908.2212950","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

Abstract

The HPC field is witnessing a widespread adoption of GPUs as accelerators for traditional homogeneous HPC systems. One of the prevalent parallel programming models is the SPMD paradigm, which has been adapted for GPU-based parallel processing. Since each process executes the same program under SPMD, every process mapped to a CPU core also needs the GPU availability. Therefore SPMD demands a symmetric CPU/GPU distribution. However, since modern HPC systems feature a large number of CPU cores that outnumber the number of GPUs, computing resources are generally underutilized with SPMD. Our previous efforts have focused on GPU virtualization that enables efficient sharing of GPU among multiple CPU processes. Nevertheless, a formal method to evaluate and choose the appropriate GPU sharing approach is still lacking. In this paper, based on SPMD GPU kernel profiles, we propose different multi-process GPU sharing scenarios under virtualization. We introduce an analytical model that captures these sharing scenarios and provides a theoretical performance gain estimation. Benchmarks validate our analyses and achievable performance gains. While our analytical study provides a suitable theoretical foundation for GPU sharing, the experimental results demonstrate that GPU virtualization affords significant performance improvements over the non-virtualized solutions for all proposed sharing scenarios.

查看原文本刊更多论文

通过高效的多进程GPU资源共享加速高性能计算

HPC领域正在见证gpu作为传统同质HPC系统加速器的广泛采用。流行的并行编程模型之一是SPMD范式，它已被用于基于gpu的并行处理。由于每个进程在SPMD下执行相同的程序，因此映射到CPU核心的每个进程也需要GPU可用性。因此，SPMD要求对称的CPU/GPU分布。但是，由于现代HPC系统具有大量的CPU内核，其数量超过了gpu的数量，因此SPMD通常没有充分利用计算资源。我们之前的工作主要集中在GPU虚拟化上，它可以在多个CPU进程之间高效地共享GPU。然而，一个正式的方法来评估和选择适当的GPU共享方法仍然缺乏。本文在SPMD GPU内核配置文件的基础上，提出了虚拟化下不同的多进程GPU共享场景。我们引入了一个分析模型来捕捉这些共享场景，并提供了一个理论上的性能增益估计。基准测试验证了我们的分析和可实现的性能增益。虽然我们的分析研究为GPU共享提供了合适的理论基础，但实验结果表明，在所有提出的共享场景中，GPU虚拟化比非虚拟化解决方案提供了显着的性能改进。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ACM International Conference on Computing Frontiers

自引率

0.00%

发文量