Compiling Python to a hybrid execution environment

GPGPU-3 Pub Date : 2010-03-14 DOI:10.1145/1735688.1735695

R. Garg, J. N. Amaral

引用次数: 36

Abstract

A new compilation framework enables the execution of numerical-intensive applications, written in Python, on a hybrid execution environment formed by a CPU and a GPU. This compiler automatically computes the set of memory locations that need to be transferred to the GPU, and produces the correct mapping between the CPU and the GPU address spaces. Thus, the programming model implements a virtual shared address space. This framework is implemented as a combination of unPython, an ahead-of-time compiler from Python/NumPy to the C programming language, and jit4GPU, a just-in-time compiler from C to the AMD CAL interface. Experimental evaluation demonstrates that for some benchmarks the generated GPU code is 50 times faster than generated OpenMP code. The GPU performance also compares favorably with optimized CPU BLAS code for single-precision computations in most cases.

查看原文本刊更多论文

将Python编译到混合执行环境

一个新的编译框架允许在由CPU和GPU组成的混合执行环境上执行用Python编写的数字密集型应用程序。该编译器自动计算需要传输到GPU的内存位置集，并生成CPU和GPU地址空间之间的正确映射。因此，编程模型实现了一个虚拟的共享地址空间。这个框架是由unPython(一个从Python/NumPy到C编程语言的提前编译器)和jit4GPU(一个从C到AMD CAL接口的即时编译器)的组合实现的。实验评估表明，在一些基准测试中，生成的GPU代码比生成的OpenMP代码快50倍。在大多数情况下，GPU性能也优于优化的CPU BLAS代码进行单精度计算。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

GPGPU-3

自引率

0.00%

发文量