基于gpu的并行计算编程模型，具有可扩展性和抽象性

Spring conference on Computer graphics Pub Date : 2009-04-23 DOI:10.1145/1980462.1980484

B. Domonkos, G. Jakab

{"title":"基于gpu的并行计算编程模型，具有可扩展性和抽象性","authors":"B. Domonkos, G. Jakab","doi":"10.1145/1980462.1980484","DOIUrl":null,"url":null,"abstract":"In this paper, we present a multi-level programming model for recent GPU-based high performance computing systems. Involving cooperative stream threads and symmetric multiprocessing threads our model gives a computational framework that scales through multi-GPU environments to GPU-cluster systems. Instead of hiding the execution environment from the programmer using compiler extensions or metaprogramming techniques we aim a solution that both enables optimizations and provides abstract problem space mapping with code reusability and virtualization of hardware resources in order to decrease the programming effort. We evaluate an implementation of our model based on CUDA, OpenMP, and MPI2 technologies on a complex practical application scenario and discuss its performance scaling behavior.","PeriodicalId":235681,"journal":{"name":"Spring conference on Computer graphics","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"A programming model for GPU-based parallel computing with scalability and abstraction\",\"authors\":\"B. Domonkos, G. Jakab\",\"doi\":\"10.1145/1980462.1980484\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we present a multi-level programming model for recent GPU-based high performance computing systems. Involving cooperative stream threads and symmetric multiprocessing threads our model gives a computational framework that scales through multi-GPU environments to GPU-cluster systems. Instead of hiding the execution environment from the programmer using compiler extensions or metaprogramming techniques we aim a solution that both enables optimizations and provides abstract problem space mapping with code reusability and virtualization of hardware resources in order to decrease the programming effort. We evaluate an implementation of our model based on CUDA, OpenMP, and MPI2 technologies on a complex practical application scenario and discuss its performance scaling behavior.\",\"PeriodicalId\":235681,\"journal\":{\"name\":\"Spring conference on Computer graphics\",\"volume\":\"37 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-04-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Spring conference on Computer graphics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/1980462.1980484\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spring conference on Computer graphics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1980462.1980484","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

在本文中，我们提出了一个基于gpu的高性能计算系统的多级编程模型。通过协作流线程和对称多处理线程，我们的模型给出了一个可在多gpu环境下扩展到gpu集群系统的计算框架。我们没有使用编译器扩展或元编程技术向程序员隐藏执行环境，我们的目标是一个解决方案，它既支持优化，又提供抽象的问题空间映射，具有代码可重用性和硬件资源虚拟化，以减少编程工作。我们在一个复杂的实际应用场景中评估了基于CUDA、OpenMP和MPI2技术的模型的实现，并讨论了其性能扩展行为。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A programming model for GPU-based parallel computing with scalability and abstraction

In this paper, we present a multi-level programming model for recent GPU-based high performance computing systems. Involving cooperative stream threads and symmetric multiprocessing threads our model gives a computational framework that scales through multi-GPU environments to GPU-cluster systems. Instead of hiding the execution environment from the programmer using compiler extensions or metaprogramming techniques we aim a solution that both enables optimizations and provides abstract problem space mapping with code reusability and virtualization of hardware resources in order to decrease the programming effort. We evaluate an implementation of our model based on CUDA, OpenMP, and MPI2 technologies on a complex practical application scenario and discuss its performance scaling behavior.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Spring conference on Computer graphics

自引率

0.00%

发文量