A channel-gained single-model network with variable rate for multispectral image compression in UAV air-to-ground remote sensing

IF 3.1 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Multimedia Systems Pub Date : 2024-07-02 DOI:10.1007/s00530-024-01398-6

Wei Wang, Daiyin Zhu, Kedi Hu

{"title":"A channel-gained single-model network with variable rate for multispectral image compression in UAV air-to-ground remote sensing","authors":"Wei Wang, Daiyin Zhu, Kedi Hu","doi":"10.1007/s00530-024-01398-6","DOIUrl":null,"url":null,"abstract":"<p>Unmanned aerial vehicle (UAV) air-to-ground remote sensing technology, has the advantages of long flight duration, real-time image transmission, wide applicability, low cost, and so on. To better preserve the integrity of image features during transmission and storage, and improve efficiency in the meanwhile, image compression is a very important link. Nowadays the image compressor based on deep learning framework has been updating as the technological development. However, in order to obtain enough bit rates to fit the performance curve, there is always a severe computational burden, especially for multispectral image compression. This problem arises not only because the complexity of the algorithm is deepening, but also repeated training with rate-distortion optimization. In this paper, a channel-gained single-model network with variable rate for multispectral image compression is proposed. First, a channel gained module is introduced to map the channel content of the image to vector domain as amplitude factors, which leads to representation scaling, as well as obtaining the image representation of different bit rates in a single model. Second, after extracting spatial-spectral features, a plug-and-play dynamic response attention mechanism module is applied to take good care of distinguishing the content correlation of features and weighting the important area dynamically without adding extra parameters. Besides, a hyperprior autoencoder is used to make full use of edge information for entropy estimation, which contributes to a more accurate entropy model. The experiments prove that the proposed method greatly reduces the computational cost, while maintaining good compression performance and surpasses JPEG2000 and some other algorithms based on deep learning in PSNR, MSSSIM and MSA.</p>","PeriodicalId":51138,"journal":{"name":"Multimedia Systems","volume":"19 1","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Multimedia Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s00530-024-01398-6","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Unmanned aerial vehicle (UAV) air-to-ground remote sensing technology, has the advantages of long flight duration, real-time image transmission, wide applicability, low cost, and so on. To better preserve the integrity of image features during transmission and storage, and improve efficiency in the meanwhile, image compression is a very important link. Nowadays the image compressor based on deep learning framework has been updating as the technological development. However, in order to obtain enough bit rates to fit the performance curve, there is always a severe computational burden, especially for multispectral image compression. This problem arises not only because the complexity of the algorithm is deepening, but also repeated training with rate-distortion optimization. In this paper, a channel-gained single-model network with variable rate for multispectral image compression is proposed. First, a channel gained module is introduced to map the channel content of the image to vector domain as amplitude factors, which leads to representation scaling, as well as obtaining the image representation of different bit rates in a single model. Second, after extracting spatial-spectral features, a plug-and-play dynamic response attention mechanism module is applied to take good care of distinguishing the content correlation of features and weighting the important area dynamically without adding extra parameters. Besides, a hyperprior autoencoder is used to make full use of edge information for entropy estimation, which contributes to a more accurate entropy model. The experiments prove that the proposed method greatly reduces the computational cost, while maintaining good compression performance and surpasses JPEG2000 and some other algorithms based on deep learning in PSNR, MSSSIM and MSA.

Abstract Image

查看原文本刊更多论文

用于无人机空对地遥感多光谱图像压缩的具有可变速率的信道增益单模型网络

无人机（UAV）空对地遥感技术，具有飞行时间长、图像传输实时、适用性广、成本低等优点。为了在传输和存储过程中更好地保持图像特征的完整性，同时提高效率，图像压缩是一个非常重要的环节。如今，随着技术的发展，基于深度学习框架的图像压缩技术也在不断更新。然而，为了获得足够的比特率以适应性能曲线，始终存在着严重的计算负担，尤其是多光谱图像压缩。出现这一问题的原因不仅在于算法复杂度的不断加深，还在于反复训练的速率失真优化。本文提出了一种用于多光谱图像压缩的速率可变的信道增益单模型网络。首先，引入信道增益模块，将图像的信道内容以振幅因子的形式映射到矢量域，从而实现表示缩放，并在单一模型中获得不同比特率的图像表示。其次，在提取空间光谱特征后，应用即插即用的动态响应关注机制模块，在不增加额外参数的情况下，很好地区分特征的内容相关性，并对重要区域进行动态加权。此外，该方法还采用了超优先自动编码器，充分利用边缘信息进行熵估计，从而建立了更精确的熵模型。实验证明，所提出的方法大大降低了计算成本，同时保持了良好的压缩性能，在 PSNR、MSSSIM 和 MSA 方面超过了 JPEG2000 和其他一些基于深度学习的算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Multimedia Systems 工程技术-计算机：理论方法

CiteScore

5.40

自引率

7.70%

发文量

148

审稿时长

4.5 months

期刊介绍： This journal details innovative research ideas, emerging technologies, state-of-the-art methods and tools in all aspects of multimedia computing, communication, storage, and applications. It features theoretical, experimental, and survey articles.