CULZSS: LZSS Lossless Data Compression on CUDA

2011 IEEE International Conference on Cluster Computing Pub Date : 2011-09-26 DOI:10.1109/CLUSTER.2011.52

Adnan Ozsoy, D. M. Swany

{"title":"CULZSS: LZSS Lossless Data Compression on CUDA","authors":"Adnan Ozsoy, D. M. Swany","doi":"10.1109/CLUSTER.2011.52","DOIUrl":null,"url":null,"abstract":"Increasing needs in efficient storage management and better utilization of network bandwidth with less data transfer have led the computing community to consider data compression as a solution. However, compression introduces extra overhead and performance can suffer. The key elements in making the decision to use compression are execution time and compression ratio. Due to negative performance impact, compression is often neglected. General purpose computing on graphic processing units (GPUs) introduces new opportunities where parallelism is available. Our work targets the use of opportunities in GPU based systems by exploiting parallelism in compression algorithms. In this paper we present an implementation of the Lempel-Ziv-Storer-Szymanski (LZSS) loss less data compression algorithm by using NVIDIA GPUs Compute Unified Device Architecture (CUDA) Framework. Our implementation of the LZSS algorithm on GPUs significantly improves the performance of the compression process compared to CPU based implementation without any loss in compression ratio. This can support GPU based clusters in solving application bandwidth problems. Our system outperforms the serial CPU LZSS implementation by up to 18x, the parallel threaded version up to 3x and the BZIP2 program by up to 6x in terms of compression time, showing the promise of CUDA systems in loss less data compression. To give the programmers an easy to use tool, our work also provides an API for in memory compression without the need for reading from and writing to files, in addition to the version involving I/O.","PeriodicalId":200830,"journal":{"name":"2011 IEEE International Conference on Cluster Computing","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"60","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE International Conference on Cluster Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CLUSTER.2011.52","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 60

Abstract

Increasing needs in efficient storage management and better utilization of network bandwidth with less data transfer have led the computing community to consider data compression as a solution. However, compression introduces extra overhead and performance can suffer. The key elements in making the decision to use compression are execution time and compression ratio. Due to negative performance impact, compression is often neglected. General purpose computing on graphic processing units (GPUs) introduces new opportunities where parallelism is available. Our work targets the use of opportunities in GPU based systems by exploiting parallelism in compression algorithms. In this paper we present an implementation of the Lempel-Ziv-Storer-Szymanski (LZSS) loss less data compression algorithm by using NVIDIA GPUs Compute Unified Device Architecture (CUDA) Framework. Our implementation of the LZSS algorithm on GPUs significantly improves the performance of the compression process compared to CPU based implementation without any loss in compression ratio. This can support GPU based clusters in solving application bandwidth problems. Our system outperforms the serial CPU LZSS implementation by up to 18x, the parallel threaded version up to 3x and the BZIP2 program by up to 6x in terms of compression time, showing the promise of CUDA systems in loss less data compression. To give the programmers an easy to use tool, our work also provides an API for in memory compression without the need for reading from and writing to files, in addition to the version involving I/O.

查看原文本刊更多论文

CULZSS:基于CUDA的LZSS无损数据压缩

对高效存储管理和更好地利用网络带宽以及更少的数据传输的日益增长的需求使得计算界考虑将数据压缩作为一种解决方案。但是，压缩会带来额外的开销，性能也会受到影响。决定使用压缩的关键因素是执行时间和压缩比。由于对性能的负面影响，压缩常常被忽略。图形处理单元(gpu)上的通用计算引入了并行性可用的新机会。我们的工作目标是通过利用压缩算法中的并行性来利用基于GPU的系统中的机会。本文提出了一种基于NVIDIA gpu计算统一设备架构(CUDA)框架的leppel - ziv - store - szymanski (LZSS)无丢失数据压缩算法。与基于CPU的实现相比，我们在gpu上实现的LZSS算法显著提高了压缩过程的性能，而压缩比没有任何损失。这可以支持基于GPU的集群解决应用程序带宽问题。我们的系统在压缩时间方面比串行CPU LZSS实现高出18倍，并行线程版本高达3倍，BZIP2程序高达6倍，显示了CUDA系统在减少数据压缩方面的承诺。为了给程序员提供一个易于使用的工具，我们的工作还提供了一个无需读写文件的内存压缩API，以及涉及I/O的版本。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2011 IEEE International Conference on Cluster Computing

自引率

0.00%

发文量