基于离散高斯混合似然和注意模的学习图像压缩

G. Ranganathan, Bindhu
{"title":"基于离散高斯混合似然和注意模的学习图像压缩","authors":"G. Ranganathan, Bindhu","doi":"10.36548/JEEA.2020.4.004","DOIUrl":null,"url":null,"abstract":"There have been many compression standards developed during the past few decades and technological advances has resulted in introducing many methodologies with promising results. As far as PSNR metric is concerned, there is a performance gap between reigning compression standards and learned compression algorithms. Based on research, we experimented using an accurate entropy model on the learned compression algorithms to determine the rate-distortion performance. In this paper, discretized Gaussian Mixture likelihood is proposed to determine the latent code parameters in order to attain a more flexible and accurate model of entropy. Moreover, we have also enhanced the performance of the work by introducing recent attention modules in the network architecture. Simulation results indicate that when compared with the previously existing techniques using high-resolution and Kodak datasets, the proposed work achieves a higher rate of performance. When MS-SSIM is used for optimization, our work generates a more visually pleasant image.","PeriodicalId":20643,"journal":{"name":"Proposed for presentation at the 2020 Virtual MRS Fall Meeting & Exhibit held November 27 - December 4, 2020.","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":"{\"title\":\"Learned Image Compression with Discretized Gaussian Mixture Likelihoods and Attention Modules\",\"authors\":\"G. Ranganathan, Bindhu\",\"doi\":\"10.36548/JEEA.2020.4.004\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"There have been many compression standards developed during the past few decades and technological advances has resulted in introducing many methodologies with promising results. As far as PSNR metric is concerned, there is a performance gap between reigning compression standards and learned compression algorithms. Based on research, we experimented using an accurate entropy model on the learned compression algorithms to determine the rate-distortion performance. In this paper, discretized Gaussian Mixture likelihood is proposed to determine the latent code parameters in order to attain a more flexible and accurate model of entropy. Moreover, we have also enhanced the performance of the work by introducing recent attention modules in the network architecture. Simulation results indicate that when compared with the previously existing techniques using high-resolution and Kodak datasets, the proposed work achieves a higher rate of performance. When MS-SSIM is used for optimization, our work generates a more visually pleasant image.\",\"PeriodicalId\":20643,\"journal\":{\"name\":\"Proposed for presentation at the 2020 Virtual MRS Fall Meeting & Exhibit held November 27 - December 4, 2020.\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-02-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"19\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proposed for presentation at the 2020 Virtual MRS Fall Meeting & Exhibit held November 27 - December 4, 2020.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.36548/JEEA.2020.4.004\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proposed for presentation at the 2020 Virtual MRS Fall Meeting & Exhibit held November 27 - December 4, 2020.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.36548/JEEA.2020.4.004","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 19

摘要

在过去的几十年里,已经开发了许多压缩标准,技术进步导致引入了许多有希望的结果的方法。就PSNR度量而言,主流压缩标准与学习压缩算法之间存在性能差距。在研究的基础上,我们对学习的压缩算法进行了精确熵模型的实验,以确定率失真的性能。为了得到更灵活、准确的熵模型,本文提出了离散高斯混合似然来确定隐码参数。此外,我们还通过在网络架构中引入最新的注意力模块来提高工作的性能。仿真结果表明,与先前使用高分辨率和柯达数据集的现有技术相比,所提出的工作实现了更高的性能。当使用MS-SSIM进行优化时,我们的工作生成了一个视觉上更令人愉快的图像。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Learned Image Compression with Discretized Gaussian Mixture Likelihoods and Attention Modules
There have been many compression standards developed during the past few decades and technological advances has resulted in introducing many methodologies with promising results. As far as PSNR metric is concerned, there is a performance gap between reigning compression standards and learned compression algorithms. Based on research, we experimented using an accurate entropy model on the learned compression algorithms to determine the rate-distortion performance. In this paper, discretized Gaussian Mixture likelihood is proposed to determine the latent code parameters in order to attain a more flexible and accurate model of entropy. Moreover, we have also enhanced the performance of the work by introducing recent attention modules in the network architecture. Simulation results indicate that when compared with the previously existing techniques using high-resolution and Kodak datasets, the proposed work achieves a higher rate of performance. When MS-SSIM is used for optimization, our work generates a more visually pleasant image.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信