Lightweight Prompt Learning Implicit Degradation Estimation Network for Blind Super Resolution

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society Pub Date : 2024-08-19 DOI:10.1109/TIP.2024.3442613

Asif Hussain Khan;Christian Micheloni;Niki Martinel

{"title":"Lightweight Prompt Learning Implicit Degradation Estimation Network for Blind Super Resolution","authors":"Asif Hussain Khan;Christian Micheloni;Niki Martinel","doi":"10.1109/TIP.2024.3442613","DOIUrl":null,"url":null,"abstract":"Blind image super-resolution (SR) aims to recover a high-resolution (HR) image from its low-resolution (LR) counterpart under the assumption of unknown degradations. Many existing blind SR methods rely on supervising ground-truth kernels referred to as explicit degradation estimators. However, it is very challenging to obtain the ground-truths for different degradations kernels. Moreover, most of these methods rely on heavy backbone networks, which demand extensive computational resources. Implicit degradation estimators do not require the availability of ground truth kernels, but they see a significant performance gap with the explicit degradation estimators due to such missing information. We present a novel approach that significantly narrows such a gap by means of a lightweight architecture that implicitly learns the degradation kernel with the help of a novel loss component. The kernel is exploited by a learnable Wiener filter that performs efficient deconvolution in the Fourier domain by deriving a closed-form solution. Inspired by prompt-based learning, we also propose a novel degradation-conditioned prompt layer that exploits the estimated kernel to drive the focus on the discriminative contextual information that guides the reconstruction process in recovering the latent HR image. Extensive experiments under different degradation settings demonstrate that our model, named PL-IDENet, yields PSNR and SSIM improvements of more than \n<inline-formula> <tex-math>$0.4dB$ </tex-math></inline-formula>\n and 1.3%, and \n<inline-formula> <tex-math>$1.4dB$ </tex-math></inline-formula>\n and 4.8% to the best implicit and explicit blind-SR method, respectively. These results are achieved while maintaining a substantially lower number of parameters/FLOPs (i.e., 25% and 68% fewer parameters than best implicit and explicit methods, respectively).","PeriodicalId":94032,"journal":{"name":"IEEE transactions on image processing : a publication of the IEEE Signal Processing Society","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on image processing : a publication of the IEEE Signal Processing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10639339/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Blind image super-resolution (SR) aims to recover a high-resolution (HR) image from its low-resolution (LR) counterpart under the assumption of unknown degradations. Many existing blind SR methods rely on supervising ground-truth kernels referred to as explicit degradation estimators. However, it is very challenging to obtain the ground-truths for different degradations kernels. Moreover, most of these methods rely on heavy backbone networks, which demand extensive computational resources. Implicit degradation estimators do not require the availability of ground truth kernels, but they see a significant performance gap with the explicit degradation estimators due to such missing information. We present a novel approach that significantly narrows such a gap by means of a lightweight architecture that implicitly learns the degradation kernel with the help of a novel loss component. The kernel is exploited by a learnable Wiener filter that performs efficient deconvolution in the Fourier domain by deriving a closed-form solution. Inspired by prompt-based learning, we also propose a novel degradation-conditioned prompt layer that exploits the estimated kernel to drive the focus on the discriminative contextual information that guides the reconstruction process in recovering the latent HR image. Extensive experiments under different degradation settings demonstrate that our model, named PL-IDENet, yields PSNR and SSIM improvements of more than

$0.4dB$

and 1.3%, and

$1.4dB$

and 4.8% to the best implicit and explicit blind-SR method, respectively. These results are achieved while maintaining a substantially lower number of parameters/FLOPs (i.e., 25% and 68% fewer parameters than best implicit and explicit methods, respectively).

查看原文本刊更多论文

用于盲超解像的轻量级提示学习隐式退化估计网络

盲图像超分辨率（SR）旨在从低分辨率（LR）对应图像中恢复出高分辨率（HR）图像，前提是退化情况未知。许多现有的盲超解像方法都依赖于被称为显式退化估计器的监督地面实况核。然而，要获得不同退化内核的地面实况非常具有挑战性。此外，这些方法大多依赖于庞大的骨干网络，需要大量的计算资源。隐式降解估计器不需要获得地面实况内核，但由于此类信息缺失，它们与显式降解估计器的性能差距很大。我们提出了一种新颖的方法，通过一种轻量级架构，借助新颖的损失组件隐式地学习退化内核，从而大大缩小了这种差距。该内核由可学习的维纳滤波器利用，通过推导闭式解在傅里叶域执行高效的解卷积。受基于提示的学习的启发，我们还提出了一个新颖的降解条件提示层，利用估计的内核来驱动对鉴别性上下文信息的关注，从而在恢复潜在 HR 图像的过程中指导重建过程。在不同降解设置下进行的大量实验表明，我们的模型（命名为 PL-IDENet）的 PSNR 和 SSIM 分别比最佳隐式和显式盲 SR 方法提高了 0.4dB 和 1.3% 以及 1.4dB 和 4.8%。在取得这些结果的同时，还大大降低了参数/FLOP 数量（即分别比最佳隐式和显式方法少 25% 和 68%）。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society

自引率

0.00%

发文量

文献相关原料

公司名称	产品信息	采购帮参考价格