Complexity of preprocessor in MPM data compression system

Proceedings DCC '98 Data Compression Conference (Cat. No.98TB100225) Pub Date : 1998-03-30 DOI:10.1109/DCC.1998.672292

J. Kieffer, E. Yang, T. Park, S. Yakowitz

{"title":"Complexity of preprocessor in MPM data compression system","authors":"J. Kieffer, E. Yang, T. Park, S. Yakowitz","doi":"10.1109/DCC.1998.672292","DOIUrl":null,"url":null,"abstract":"Summary form only given. The multilevel pattern matching data compression system is one of a class of compression algorithms introduced by Kieffer and Yang (see ERA Amer. Math. Soc., vol.3, p.11-16, 1997). The MPM system is currently of interest because of its good redundancy performance in losslessly compressing data strings of arbitrary length over a finite alphabet. An MPM system consists of a preprocessor, encoder/decoder, and a reconstruction engine. The preprocessor detects matching patterns in the input data string (substrings of the data appearing in two or more nonoverlapping positions). The preprocessor operates at several levels sequentially, with the number of levels selected by the user. The matching patterns detected at each level are of a fixed length which decreases by a constant factor from level to level, until this fixed length becomes one at the final level. The preprocessor represents information about matching patterns at each level as a string of tokens which is passed to the encoder of the MPM system. The decoder of the MPM system recovers these token strings, from which the reconstruction engine rebuilds the input data string. The preprocessor is the most complex component of the MPM system. We exhibit an implementation of the preprocessor of linear complexity in terms of execution time and space requirements; the number of levels satisfies O(log/sub 2/log/sub 2/n) for input data strings of length n.","PeriodicalId":191890,"journal":{"name":"Proceedings DCC '98 Data Compression Conference (Cat. No.98TB100225)","volume":"196 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1998-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings DCC '98 Data Compression Conference (Cat. No.98TB100225)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DCC.1998.672292","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Summary form only given. The multilevel pattern matching data compression system is one of a class of compression algorithms introduced by Kieffer and Yang (see ERA Amer. Math. Soc., vol.3, p.11-16, 1997). The MPM system is currently of interest because of its good redundancy performance in losslessly compressing data strings of arbitrary length over a finite alphabet. An MPM system consists of a preprocessor, encoder/decoder, and a reconstruction engine. The preprocessor detects matching patterns in the input data string (substrings of the data appearing in two or more nonoverlapping positions). The preprocessor operates at several levels sequentially, with the number of levels selected by the user. The matching patterns detected at each level are of a fixed length which decreases by a constant factor from level to level, until this fixed length becomes one at the final level. The preprocessor represents information about matching patterns at each level as a string of tokens which is passed to the encoder of the MPM system. The decoder of the MPM system recovers these token strings, from which the reconstruction engine rebuilds the input data string. The preprocessor is the most complex component of the MPM system. We exhibit an implementation of the preprocessor of linear complexity in terms of execution time and space requirements; the number of levels satisfies O(log/sub 2/log/sub 2/n) for input data strings of length n.

查看原文本刊更多论文

MPM数据压缩系统中预处理器的复杂性

只提供摘要形式。多级模式匹配数据压缩系统是Kieffer和Yang介绍的一类压缩算法之一(参见ERA Amer)。数学。Soc。，第3卷，第11-16页，1997)。MPM系统由于其在有限字母表上无损压缩任意长度的数据串的良好冗余性能而引起了人们的兴趣。MPM系统由预处理器、编码器/解码器和重构引擎组成。预处理器检测输入数据字符串中的匹配模式(数据的子字符串出现在两个或多个不重叠的位置)。预处理器按顺序在几个级别上运行，级别的数量由用户选择。在每一层检测到的匹配模式具有固定长度，该固定长度每一层以恒定因子递减，直到该固定长度在最后一层变为固定长度。预处理器将每一层的匹配模式信息表示为一串令牌，这些令牌传递给MPM系统的编码器。MPM系统的解码器恢复这些令牌字符串，重建引擎根据这些令牌字符串重建输入数据字符串。预处理器是MPM系统中最复杂的组成部分。我们展示了在执行时间和空间要求方面线性复杂性的预处理器的实现;对于长度为n的输入数据字符串，层数满足0 (log/sub 2/log/sub 2/n)。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings DCC '98 Data Compression Conference (Cat. No.98TB100225)

自引率

0.00%

发文量