On Lempel-Ziv Decompression in Small Space

2019 Data Compression Conference (DCC) Pub Date : 2019-03-01 DOI:10.1109/DCC.2019.00030

S. Puglisi, Massimiliano Rossi

引用次数: 3

Abstract

Lempel-Ziv (LZ77) parsing is a powerful tool for data compression that has been the subject of intense research in the past 40 years and is now used in popular and widely-used compression software and as part of larger software systems. In this paper we study algorithms to efficiently decompress strings from the LZ parsing that use working memory proportional to the size, z, of the parsing itself, not that of the output string, n, as is the usual case. The only work we are aware of on this problem is recent and due to Bille~et~al. who describe an algorithm using O(n log^δσ time and O(z log^1 - δσ) space for any 0 ≤ δ ≤ 1. We provide the first implementation and experimental analysis of Bille~et~al.'s approach. Our results show that this approach, when implemented as described, is extremely slow in practice compared to the naive decompression algorithm, and uses lots of space. To remedy this we introduce several novel optimizations that drastically improve performance and lead to relevant space-time tradeoffs in practice on all datasets we tested.

查看原文本刊更多论文

小空间的Lempel-Ziv解压缩

Lempel-Ziv (LZ77)解析是一种功能强大的数据压缩工具，在过去40年中一直是研究的重点，现在被广泛应用于流行的压缩软件中，并作为大型软件系统的一部分。在本文中，我们研究了从LZ解析中有效解压缩字符串的算法，这些算法使用的工作内存与解析本身的大小z成正比，而不是与通常情况下的输出字符串n成正比。我们所知道的关于这个问题的唯一工作是最近的，是由比尔等人完成的。对任意0≤δ≤1，用O(n log^δσ)时间和O(z log^1 - δσ)空间描述了一个算法。我们提供了Bille等人的首次实现和实验分析。的方法。我们的结果表明，当按照描述的方法实现时，与朴素的解压缩算法相比，这种方法在实践中非常慢，并且使用了大量的空间。为了解决这个问题，我们引入了几个新的优化，这些优化大大提高了性能，并在我们测试的所有数据集上实现了相关的时空权衡。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 Data Compression Conference (DCC)

自引率

0.00%

发文量