Multiple-dictionary compression using partial matching

Proceedings DCC '95 Data Compression Conference Pub Date : 1995-03-28 DOI:10.1109/DCC.1995.515517

Dzung T. Hoang, Philip M. Long, J. Vitter

引用次数: 6

Abstract

Motivated by the desire to find text compressors that compress better than existing dictionary methods, but run faster than PPM implementations, we describe methods for text compression using multiple dictionaries, one for each context of preceding characters, where the contexts have varying lengths. The context to be used is determined using an escape mechanism similar to that of PPM methods. We describe modifications of three popular dictionary coders along these lines and experiments evaluating their efficacy using the text files in the Calgary corpus. Our results suggest that modifying LZ77 along these lines yields an improvement in compression of about 4%, that modifying LZFG yields a compression improvement of about 8%, and that modifying LZW in this manner yields an average improvement on the order of 12%.

查看原文本刊更多论文

使用部分匹配的多字典压缩

由于希望找到比现有字典方法压缩得更好、但运行速度比PPM实现更快的文本压缩器，我们描述了使用多个字典的文本压缩方法，每个字典对应前面字符的上下文，其中上下文具有不同的长度。使用类似于PPM方法的逃逸机制来确定要使用的上下文。我们沿着这些思路描述了三种流行的字典编码器的修改，并使用卡尔加里语料库中的文本文件评估了它们的有效性。我们的结果表明，沿着这条线修改LZ77可以使压缩性能提高约4%，修改LZFG可以使压缩性能提高约8%，以这种方式修改LZW可以使压缩性能平均提高约12%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings DCC '95 Data Compression Conference

自引率

0.00%

发文量