基于离散高斯混合似然和注意模的学习图像压缩

Proposed for presentation at the 2020 Virtual MRS Fall Meeting & Exhibit held November 27 - December 4, 2020. Pub Date : 2021-02-23 DOI:10.36548/JEEA.2020.4.004

G. Ranganathan, Bindhu

{"title":"基于离散高斯混合似然和注意模的学习图像压缩","authors":"G. Ranganathan, Bindhu","doi":"10.36548/JEEA.2020.4.004","DOIUrl":null,"url":null,"abstract":"There have been many compression standards developed during the past few decades and technological advances has resulted in introducing many methodologies with promising results. As far as PSNR metric is concerned, there is a performance gap between reigning compression standards and learned compression algorithms. Based on research, we experimented using an accurate entropy model on the learned compression algorithms to determine the rate-distortion performance. In this paper, discretized Gaussian Mixture likelihood is proposed to determine the latent code parameters in order to attain a more flexible and accurate model of entropy. Moreover, we have also enhanced the performance of the work by introducing recent attention modules in the network architecture. Simulation results indicate that when compared with the previously existing techniques using high-resolution and Kodak datasets, the proposed work achieves a higher rate of performance. When MS-SSIM is used for optimization, our work generates a more visually pleasant image.","PeriodicalId":20643,"journal":{"name":"Proposed for presentation at the 2020 Virtual MRS Fall Meeting & Exhibit held November 27 - December 4, 2020.","volume":"36 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":"{\"title\":\"Learned Image Compression with Discretized Gaussian Mixture Likelihoods and Attention Modules\",\"authors\":\"G. Ranganathan, Bindhu\",\"doi\":\"10.36548/JEEA.2020.4.004\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"There have been many compression standards developed during the past few decades and technological advances has resulted in introducing many methodologies with promising results. As far as PSNR metric is concerned, there is a performance gap between reigning compression standards and learned compression algorithms. Based on research, we experimented using an accurate entropy model on the learned compression algorithms to determine the rate-distortion performance. In this paper, discretized Gaussian Mixture likelihood is proposed to determine the latent code parameters in order to attain a more flexible and accurate model of entropy. Moreover, we have also enhanced the performance of the work by introducing recent attention modules in the network architecture. Simulation results indicate that when compared with the previously existing techniques using high-resolution and Kodak datasets, the proposed work achieves a higher rate of performance. When MS-SSIM is used for optimization, our work generates a more visually pleasant image.\",\"PeriodicalId\":20643,\"journal\":{\"name\":\"Proposed for presentation at the 2020 Virtual MRS Fall Meeting & Exhibit held November 27 - December 4, 2020.\",\"volume\":\"36 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-02-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"19\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proposed for presentation at the 2020 Virtual MRS Fall Meeting & Exhibit held November 27 - December 4, 2020.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.36548/JEEA.2020.4.004\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proposed for presentation at the 2020 Virtual MRS Fall Meeting & Exhibit held November 27 - December 4, 2020.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.36548/JEEA.2020.4.004","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 19

摘要

在过去的几十年里，已经开发了许多压缩标准，技术进步导致引入了许多有希望的结果的方法。就PSNR度量而言，主流压缩标准与学习压缩算法之间存在性能差距。在研究的基础上，我们对学习的压缩算法进行了精确熵模型的实验，以确定率失真的性能。为了得到更灵活、准确的熵模型，本文提出了离散高斯混合似然来确定隐码参数。此外，我们还通过在网络架构中引入最新的注意力模块来提高工作的性能。仿真结果表明，与先前使用高分辨率和柯达数据集的现有技术相比，所提出的工作实现了更高的性能。当使用MS-SSIM进行优化时，我们的工作生成了一个视觉上更令人愉快的图像。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Learned Image Compression with Discretized Gaussian Mixture Likelihoods and Attention Modules

There have been many compression standards developed during the past few decades and technological advances has resulted in introducing many methodologies with promising results. As far as PSNR metric is concerned, there is a performance gap between reigning compression standards and learned compression algorithms. Based on research, we experimented using an accurate entropy model on the learned compression algorithms to determine the rate-distortion performance. In this paper, discretized Gaussian Mixture likelihood is proposed to determine the latent code parameters in order to attain a more flexible and accurate model of entropy. Moreover, we have also enhanced the performance of the work by introducing recent attention modules in the network architecture. Simulation results indicate that when compared with the previously existing techniques using high-resolution and Kodak datasets, the proposed work achieves a higher rate of performance. When MS-SSIM is used for optimization, our work generates a more visually pleasant image.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proposed for presentation at the 2020 Virtual MRS Fall Meeting & Exhibit held November 27 - December 4, 2020.

自引率

0.00%

发文量