A parallel Huffman coder on the CUDA architecture

2014 IEEE Visual Communications and Image Processing Conference Pub Date : 2014-12-01 DOI:10.1109/VCIP.2014.7051566

Habibelahi Rahmani, C. Topal, C. Akinlar

引用次数: 11

Abstract

We present a parallel implementation of the widely-used entropy encoding algorithm, the Huffman coder, on the NVIDIA CUDA architecture. After constructing the Huffman codeword tree serially, we proceed in parallel by generating a byte stream where each byte represents a single bit of the compressed output stream. The final step is then to combine each consecutive 8 bytes into a single byte in parallel to generate the final compressed output bit stream. Experimental results show that we can achieve up to 22× speedups compared to the serial CPU implementation without any constraint on the maximum codeword length or data entropy.

查看原文本刊更多论文

CUDA架构上的并行霍夫曼编码器

我们提出了一个在NVIDIA CUDA架构上并行实现广泛使用的熵编码算法，霍夫曼编码器。在连续构造霍夫曼码字树之后，我们通过生成字节流并行进行，其中每个字节代表压缩输出流的单个位。最后一步是将每个连续的8个字节并行地组合成一个字节，以生成最终的压缩输出位流。实验结果表明，在不受最大码字长度或数据熵限制的情况下，与串行CPU实现相比，我们可以实现高达22倍的速度提升。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2014 IEEE Visual Communications and Image Processing Conference

自引率

0.00%

发文量