Scalar and Vector Quantization for Learned Image Compression: A Study on the Effects of MSE and GAN Loss in Various Spaces

2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC) Pub Date : 2020-09-20 DOI:10.1109/ITSC45102.2020.9294350

Jonas Löhdefink, Fabian Hüger, Peter Schlicht, T. Fingscheidt

{"title":"Scalar and Vector Quantization for Learned Image Compression: A Study on the Effects of MSE and GAN Loss in Various Spaces","authors":"Jonas Löhdefink, Fabian Hüger, Peter Schlicht, T. Fingscheidt","doi":"10.1109/ITSC45102.2020.9294350","DOIUrl":null,"url":null,"abstract":"Recently, learned image compression by means of neural networks has experienced a performance boost by the use of adversarial loss functions. Typically, a generative adversarial network (GAN) is designed with the generator being an autoencoder with quantizer in the bottleneck for compression and reconstruction. It is well known from rate-distortion theory that vector quantizers provide lower quantization errors than scalar quantizers at the same bitrate. Still, learned image compression approaches often use scalar quantization instead. In this work we provide insights into the image reconstruction quality of the often-employed uniform scalar quantizers, non-uniform scalar quantizers, and the rarely employed but bitrate-efficient vector quantizers, all being integrated into backpropagation and operating under the exact same bitrate. Further interesting insights are obtained by our investigation of an MSE loss and a GAN loss. We show that vector quantization is always beneficial for the compression performance both in the latent space and the reconstructed image space. However, image samples demonstrate that the GAN loss produces the more pleasing reconstructed images, while the non-adversarial MSE loss provides better quality scores of various instrumental measures both in the latent space and on the reconstructed images.","PeriodicalId":394538,"journal":{"name":"2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITSC45102.2020.9294350","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

Recently, learned image compression by means of neural networks has experienced a performance boost by the use of adversarial loss functions. Typically, a generative adversarial network (GAN) is designed with the generator being an autoencoder with quantizer in the bottleneck for compression and reconstruction. It is well known from rate-distortion theory that vector quantizers provide lower quantization errors than scalar quantizers at the same bitrate. Still, learned image compression approaches often use scalar quantization instead. In this work we provide insights into the image reconstruction quality of the often-employed uniform scalar quantizers, non-uniform scalar quantizers, and the rarely employed but bitrate-efficient vector quantizers, all being integrated into backpropagation and operating under the exact same bitrate. Further interesting insights are obtained by our investigation of an MSE loss and a GAN loss. We show that vector quantization is always beneficial for the compression performance both in the latent space and the reconstructed image space. However, image samples demonstrate that the GAN loss produces the more pleasing reconstructed images, while the non-adversarial MSE loss provides better quality scores of various instrumental measures both in the latent space and on the reconstructed images.

查看原文本刊更多论文

学习图像压缩的标量和矢量量化:不同空间中MSE和GAN损失影响的研究

近年来，基于神经网络的学习图像压缩由于使用了对抗损失函数而得到了性能上的提升。通常，生成式对抗网络(GAN)的设计是将生成器作为自编码器，在瓶颈处设置量化器进行压缩和重构。从率失真理论可知，在相同比特率下，矢量量化器比标量量化器提供更低的量化误差。然而，学习图像压缩方法通常使用标量量化代替。在这项工作中，我们提供了对经常使用的均匀标量量化器、非均匀标量量化器和很少使用但比特率有效的矢量量化器的图像重建质量的见解，所有这些都被集成到反向传播中并在完全相同的比特率下工作。通过对MSE损耗和GAN损耗的研究，我们获得了进一步有趣的见解。结果表明，无论在潜在空间还是重构图像空间，矢量量化都有利于提高压缩性能。然而，图像样本表明，GAN损失产生了更令人满意的重建图像，而非对抗性MSE损失在潜在空间和重建图像上提供了更好的各种工具测量质量分数。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC)

自引率

0.00%

发文量