Supporting GPU sharing in cloud environments with a transparent runtime consolidation framework

IEEE International Symposium on High-Performance Parallel Distributed Computing Pub Date : 2011-06-08 DOI:10.1145/1996130.1996160

Vignesh T. Ravi, M. Becchi, G. Agrawal, S. Chakradhar

{"title":"Supporting GPU sharing in cloud environments with a transparent runtime consolidation framework","authors":"Vignesh T. Ravi, M. Becchi, G. Agrawal, S. Chakradhar","doi":"10.1145/1996130.1996160","DOIUrl":null,"url":null,"abstract":"Driven by the emergence of GPUs as a major player in high performance computing and the rapidly growing popularity of cloud environments, GPU instances are now being offered by cloud providers. The use of GPUs in a cloud environment, however, is still at initial stages, and the challenge of making GPU a true shared resource in the cloud has not yet been addressed.\n This paper presents a framework to enable applications executing within virtual machines to transparently share one or more GPUs. Our contributions are twofold: we extend an open source GPU virtualization software to include efficient GPU sharing, and we propose solutions to the conceptual problem of GPU kernel consolidation. In particular, we introduce a method for computing the affinity score between two or more kernels, which provides an indication of potential performance improvements upon kernel consolidation. In addition, we explore molding as a means to achieve efficient GPU sharing also in the case of kernels with high or conflicting resource requirements. We use these concepts to develop an algorithm to efficiently map a set of kernels on a pair of GPUs. We extensively evaluate our framework using eight popular GPU kernels and two Fermi GPUs. We find that even when contention is high our consolidation algorithm is effective in improving the throughput, and that the runtime overhead of our framework is low.","PeriodicalId":330072,"journal":{"name":"IEEE International Symposium on High-Performance Parallel Distributed Computing","volume":"78 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"127","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE International Symposium on High-Performance Parallel Distributed Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1996130.1996160","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 127

Abstract

Driven by the emergence of GPUs as a major player in high performance computing and the rapidly growing popularity of cloud environments, GPU instances are now being offered by cloud providers. The use of GPUs in a cloud environment, however, is still at initial stages, and the challenge of making GPU a true shared resource in the cloud has not yet been addressed. This paper presents a framework to enable applications executing within virtual machines to transparently share one or more GPUs. Our contributions are twofold: we extend an open source GPU virtualization software to include efficient GPU sharing, and we propose solutions to the conceptual problem of GPU kernel consolidation. In particular, we introduce a method for computing the affinity score between two or more kernels, which provides an indication of potential performance improvements upon kernel consolidation. In addition, we explore molding as a means to achieve efficient GPU sharing also in the case of kernels with high or conflicting resource requirements. We use these concepts to develop an algorithm to efficiently map a set of kernels on a pair of GPUs. We extensively evaluate our framework using eight popular GPU kernels and two Fermi GPUs. We find that even when contention is high our consolidation algorithm is effective in improving the throughput, and that the runtime overhead of our framework is low.

查看原文本刊更多论文

通过透明的运行时整合框架支持云环境中的GPU共享

GPU作为高性能计算的主要参与者的出现，以及云环境的迅速普及，推动了GPU实例现在由云提供商提供。然而，GPU在云环境中的使用仍处于初始阶段，使GPU成为云中的真正共享资源的挑战尚未得到解决。本文提出了一个框架，使在虚拟机内执行的应用程序能够透明地共享一个或多个gpu。我们的贡献是双重的:我们扩展了一个开源的GPU虚拟化软件，包括高效的GPU共享，我们提出了GPU内核整合的概念问题的解决方案。特别是，我们引入了一种计算两个或多个内核之间亲和度评分的方法，它提供了内核合并后潜在性能改进的指示。此外，我们还探讨了在具有高资源需求或资源冲突的内核的情况下，模塑作为实现高效GPU共享的一种手段。我们使用这些概念来开发一种算法，以有效地将一组内核映射到一对gpu上。我们使用八个流行的GPU内核和两个费米GPU广泛评估我们的框架。我们发现，即使在竞争激烈的情况下，我们的整合算法也能有效地提高吞吐量，并且我们框架的运行时开销也很低。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE International Symposium on High-Performance Parallel Distributed Computing

自引率

0.00%

发文量