Rate-Distortion-Complexity Optimized Framework for Multi-Model Image Compression

IF 13.7

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society Pub Date : 2025-08-21 DOI:10.1109/TIP.2025.3598916

Xinyu Hang;Ziqing Ge;Hongfei Fan;Chuanmin Jia;Siwei Ma;Wen Gao

{"title":"Rate-Distortion-Complexity Optimized Framework for Multi-Model Image Compression","authors":"Xinyu Hang;Ziqing Ge;Hongfei Fan;Chuanmin Jia;Siwei Ma;Wen Gao","doi":"10.1109/TIP.2025.3598916","DOIUrl":null,"url":null,"abstract":"Learned Image Compression (LIC) has experienced rapid growth with the emergence of diverse frameworks. However, the variability in model design and training datasets poses a challenge for the universal application of a single coding model. To address this problem, this paper introduces a pioneering multi-model image coding framework that integrates various image codecs to overcome these limitations. By dynamically allocating codecs to different image regions, our framework optimizes reconstruction quality within the constraints of limited bitrate and decoding time, offering a high-performance, ubiquitous solution for the rate-distortion-complexity trade-off. Our framework features a detailed codec assignment algorithm based on the Simulated Annealing (SA) method, selected for its proven efficacy in managing the discrete and intricate nature of codec assignment optimization. We have implemented a coarse-to-fine strategy, which significantly enhances efficiency. Notably, our framework maintains compatibility with all standard image codecs without necessitating structural modifications. Empirical results indicate that our framework establishes a new standard in LIC, advancing the Pareto frontier for performance-complexity trade-offs. It achieves a significant 70% reduction in decoding time compared to current state-of-the-art methods, without compromising reconstruction quality. Furthermore, under comparable conditions, our approach not only outperforms but significantly eclipses existing Rate-Distortion-Complexity (RDC) optimized codecs, with decoding speeds up to 30 times faster.","PeriodicalId":94032,"journal":{"name":"IEEE transactions on image processing : a publication of the IEEE Signal Processing Society","volume":"34 ","pages":"5385-5399"},"PeriodicalIF":13.7000,"publicationDate":"2025-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on image processing : a publication of the IEEE Signal Processing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11131530/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Learned Image Compression (LIC) has experienced rapid growth with the emergence of diverse frameworks. However, the variability in model design and training datasets poses a challenge for the universal application of a single coding model. To address this problem, this paper introduces a pioneering multi-model image coding framework that integrates various image codecs to overcome these limitations. By dynamically allocating codecs to different image regions, our framework optimizes reconstruction quality within the constraints of limited bitrate and decoding time, offering a high-performance, ubiquitous solution for the rate-distortion-complexity trade-off. Our framework features a detailed codec assignment algorithm based on the Simulated Annealing (SA) method, selected for its proven efficacy in managing the discrete and intricate nature of codec assignment optimization. We have implemented a coarse-to-fine strategy, which significantly enhances efficiency. Notably, our framework maintains compatibility with all standard image codecs without necessitating structural modifications. Empirical results indicate that our framework establishes a new standard in LIC, advancing the Pareto frontier for performance-complexity trade-offs. It achieves a significant 70% reduction in decoding time compared to current state-of-the-art methods, without compromising reconstruction quality. Furthermore, under comparable conditions, our approach not only outperforms but significantly eclipses existing Rate-Distortion-Complexity (RDC) optimized codecs, with decoding speeds up to 30 times faster.

查看原文本刊更多论文

多模型图像压缩的速率-失真-复杂度优化框架

随着各种框架的出现，学习图像压缩（LIC）得到了快速发展。然而，模型设计和训练数据集的可变性对单一编码模型的普遍应用提出了挑战。为了解决这个问题，本文介绍了一个开创性的多模型图像编码框架，该框架集成了各种图像编解码器来克服这些限制。通过动态地将编解码器分配到不同的图像区域，我们的框架在有限的比特率和解码时间的约束下优化了重建质量，为速率-失真-复杂性权衡提供了高性能，普遍存在的解决方案。我们的框架具有基于模拟退火（SA）方法的详细编解码器分配算法，该算法因其在管理编解码器分配优化的离散和复杂性质方面的有效性而被选中。我们实施了从粗到精的策略，大大提高了效率。值得注意的是，我们的框架保持了与所有标准图像编解码器的兼容性，而无需进行结构修改。实证结果表明，我们的框架建立了LIC的新标准，推进了性能复杂性权衡的帕累托边界。与目前最先进的方法相比，它实现了解码时间显著减少70%，而不会影响重建质量。此外，在可比较的条件下，我们的方法不仅优于现有的RDC优化编解码器，而且大大超过了现有的RDC优化编解码器，解码速度提高了30倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society

自引率

0.00%

发文量