Lightweight Prompt Learning Implicit Degradation Estimation Network for Blind Super Resolution

Asif Hussain Khan;Christian Micheloni;Niki Martinel
{"title":"Lightweight Prompt Learning Implicit Degradation Estimation Network for Blind Super Resolution","authors":"Asif Hussain Khan;Christian Micheloni;Niki Martinel","doi":"10.1109/TIP.2024.3442613","DOIUrl":null,"url":null,"abstract":"Blind image super-resolution (SR) aims to recover a high-resolution (HR) image from its low-resolution (LR) counterpart under the assumption of unknown degradations. Many existing blind SR methods rely on supervising ground-truth kernels referred to as explicit degradation estimators. However, it is very challenging to obtain the ground-truths for different degradations kernels. Moreover, most of these methods rely on heavy backbone networks, which demand extensive computational resources. Implicit degradation estimators do not require the availability of ground truth kernels, but they see a significant performance gap with the explicit degradation estimators due to such missing information. We present a novel approach that significantly narrows such a gap by means of a lightweight architecture that implicitly learns the degradation kernel with the help of a novel loss component. The kernel is exploited by a learnable Wiener filter that performs efficient deconvolution in the Fourier domain by deriving a closed-form solution. Inspired by prompt-based learning, we also propose a novel degradation-conditioned prompt layer that exploits the estimated kernel to drive the focus on the discriminative contextual information that guides the reconstruction process in recovering the latent HR image. Extensive experiments under different degradation settings demonstrate that our model, named PL-IDENet, yields PSNR and SSIM improvements of more than \n<inline-formula> <tex-math>$0.4dB$ </tex-math></inline-formula>\n and 1.3%, and \n<inline-formula> <tex-math>$1.4dB$ </tex-math></inline-formula>\n and 4.8% to the best implicit and explicit blind-SR method, respectively. These results are achieved while maintaining a substantially lower number of parameters/FLOPs (i.e., 25% and 68% fewer parameters than best implicit and explicit methods, respectively).","PeriodicalId":94032,"journal":{"name":"IEEE transactions on image processing : a publication of the IEEE Signal Processing Society","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on image processing : a publication of the IEEE Signal Processing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10639339/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Blind image super-resolution (SR) aims to recover a high-resolution (HR) image from its low-resolution (LR) counterpart under the assumption of unknown degradations. Many existing blind SR methods rely on supervising ground-truth kernels referred to as explicit degradation estimators. However, it is very challenging to obtain the ground-truths for different degradations kernels. Moreover, most of these methods rely on heavy backbone networks, which demand extensive computational resources. Implicit degradation estimators do not require the availability of ground truth kernels, but they see a significant performance gap with the explicit degradation estimators due to such missing information. We present a novel approach that significantly narrows such a gap by means of a lightweight architecture that implicitly learns the degradation kernel with the help of a novel loss component. The kernel is exploited by a learnable Wiener filter that performs efficient deconvolution in the Fourier domain by deriving a closed-form solution. Inspired by prompt-based learning, we also propose a novel degradation-conditioned prompt layer that exploits the estimated kernel to drive the focus on the discriminative contextual information that guides the reconstruction process in recovering the latent HR image. Extensive experiments under different degradation settings demonstrate that our model, named PL-IDENet, yields PSNR and SSIM improvements of more than $0.4dB$ and 1.3%, and $1.4dB$ and 4.8% to the best implicit and explicit blind-SR method, respectively. These results are achieved while maintaining a substantially lower number of parameters/FLOPs (i.e., 25% and 68% fewer parameters than best implicit and explicit methods, respectively).
用于盲超解像的轻量级提示学习隐式退化估计网络
盲图像超分辨率(SR)旨在从低分辨率(LR)对应图像中恢复出高分辨率(HR)图像,前提是退化情况未知。许多现有的盲超解像方法都依赖于被称为显式退化估计器的监督地面实况核。然而,要获得不同退化内核的地面实况非常具有挑战性。此外,这些方法大多依赖于庞大的骨干网络,需要大量的计算资源。隐式降解估计器不需要获得地面实况内核,但由于此类信息缺失,它们与显式降解估计器的性能差距很大。我们提出了一种新颖的方法,通过一种轻量级架构,借助新颖的损失组件隐式地学习退化内核,从而大大缩小了这种差距。该内核由可学习的维纳滤波器利用,通过推导闭式解在傅里叶域执行高效的解卷积。受基于提示的学习的启发,我们还提出了一个新颖的降解条件提示层,利用估计的内核来驱动对鉴别性上下文信息的关注,从而在恢复潜在 HR 图像的过程中指导重建过程。在不同降解设置下进行的大量实验表明,我们的模型(命名为 PL-IDENet)的 PSNR 和 SSIM 分别比最佳隐式和显式盲 SR 方法提高了 0.4dB 和 1.3% 以及 1.4dB 和 4.8%。在取得这些结果的同时,还大大降低了参数/FLOP 数量(即分别比最佳隐式和显式方法少 25% 和 68%)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
文献相关原料
公司名称 产品信息 采购帮参考价格
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信