JPEG optimization using an entropy-constrained quantization framework

Proceedings DCC '95 Data Compression Conference Pub Date : 1995-03-28 DOI:10.1109/DCC.1995.515524

M. Crouse, K. Ramchandran

{"title":"JPEG optimization using an entropy-constrained quantization framework","authors":"M. Crouse, K. Ramchandran","doi":"10.1109/DCC.1995.515524","DOIUrl":null,"url":null,"abstract":"Previous works, including adaptive quantizer selection and adaptive coefficient thresholding, have addressed the optimization of a baseline-decodable JPEG coder in a rate-distortion (R-D) sense. In this work, by developing an entropy-constrained quantization framework, we show that these previous works do not fully realize the attainable coding gain, and then formulate a computationally efficient way that attempts to fully realize this gain for baseline-JPEG-decodable systems. Interestingly, we find that the gains obtained using the previous algorithms are almost additive. The framework involves viewing a scalar-quantized system with fixed quantizers as a special type of vector quantizer (VQ), and then to use techniques akin to entropy-constrained vector quantization (ECVQ) to optimize the system. In the JPEG case, a computationally efficient algorithm can be derived, without training, by jointly performing coefficient thresholding, quantizer selection, and Huffman table customization, all compatible with the baseline JPEG syntax. Our algorithm achieves significant R-D improvement over standard JPEG (about 2 dB for typical images) with performance comparable to that of more complex \"state-of-the-art\" coders. For example, for the Lenna image coded at 1.0 bits per pixel, our JPEG-compatible coder achieves a PSNR of 39.6 dB, which even slightly exceeds the published performance of Shapiro's wavelet coder. Although PSNR does not guarantee subjective performance, our algorithm can be applied with a flexible range of visually-based distortion metrics.","PeriodicalId":107017,"journal":{"name":"Proceedings DCC '95 Data Compression Conference","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1995-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings DCC '95 Data Compression Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DCC.1995.515524","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 15

Abstract

Previous works, including adaptive quantizer selection and adaptive coefficient thresholding, have addressed the optimization of a baseline-decodable JPEG coder in a rate-distortion (R-D) sense. In this work, by developing an entropy-constrained quantization framework, we show that these previous works do not fully realize the attainable coding gain, and then formulate a computationally efficient way that attempts to fully realize this gain for baseline-JPEG-decodable systems. Interestingly, we find that the gains obtained using the previous algorithms are almost additive. The framework involves viewing a scalar-quantized system with fixed quantizers as a special type of vector quantizer (VQ), and then to use techniques akin to entropy-constrained vector quantization (ECVQ) to optimize the system. In the JPEG case, a computationally efficient algorithm can be derived, without training, by jointly performing coefficient thresholding, quantizer selection, and Huffman table customization, all compatible with the baseline JPEG syntax. Our algorithm achieves significant R-D improvement over standard JPEG (about 2 dB for typical images) with performance comparable to that of more complex "state-of-the-art" coders. For example, for the Lenna image coded at 1.0 bits per pixel, our JPEG-compatible coder achieves a PSNR of 39.6 dB, which even slightly exceeds the published performance of Shapiro's wavelet coder. Although PSNR does not guarantee subjective performance, our algorithm can be applied with a flexible range of visually-based distortion metrics.

查看原文本刊更多论文

使用熵约束量化框架的JPEG优化

以前的工作，包括自适应量化器选择和自适应系数阈值，已经在率失真(R-D)意义上解决了基线可解码JPEG编码器的优化问题。在这项工作中，通过开发一个熵约束的量化框架，我们表明这些先前的工作并没有完全实现可实现的编码增益，然后制定了一个计算效率的方法，试图完全实现基线- jpeg可解码系统的这种增益。有趣的是，我们发现使用先前算法获得的增益几乎是加性的。该框架包括将带有固定量化器的标量量化系统视为一种特殊类型的矢量量化器(VQ)，然后使用类似于熵约束矢量量化(ECVQ)的技术来优化系统。在JPEG的情况下，通过联合执行系数阈值、量化器选择和Huffman表定制，可以推导出计算效率高的算法，而无需训练，所有这些都与基线JPEG语法兼容。我们的算法比标准JPEG(典型图像约2 dB)实现了显著的R-D改进，其性能可与更复杂的“最先进”编码器相媲美。例如，对于编码为每像素1.0比特的Lenna图像，我们的jpeg兼容编码器实现了39.6 dB的PSNR，甚至略高于夏皮罗的小波编码器的公开性能。虽然PSNR不能保证主观性能，但我们的算法可以应用于灵活的基于视觉的失真指标。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings DCC '95 Data Compression Conference

自引率

0.00%

发文量