All-in-one weather removal via Multi-Depth Gated Transformer with gradient modulation

IF 7.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Pattern Recognition Pub Date : 2025-04-03 DOI:10.1016/j.patcog.2025.111643

Xiang Li, Jianwu Li

{"title":"All-in-one weather removal via Multi-Depth Gated Transformer with gradient modulation","authors":"Xiang Li, Jianwu Li","doi":"10.1016/j.patcog.2025.111643","DOIUrl":null,"url":null,"abstract":"<div><div>All-in-one weather removal methods have made impressive progress recently, but their ability to recover finer details from degraded images still needs to be improved, since (1) the difficulty of Convolutional Neural Networks (CNNs) in providing long-distance information interaction or Visual Transformer with simple convolutions in extracting richer local details, makes them unable to effectively utilize similar original texture features in different regions of a degraded image, and (2) under complex weather degradation distributions, their pixel reconstruction loss functions often result in losing high-frequency details in restored images, even when perceptual loss is used. In this paper, we propose a Multi-Depth Gated Transformer Network (MDGTNet) for all-in-one weather removal, with (1) a multi-depth gated module to capture richer background texture details from various weather noises in an input-adaptive manner, (2) self-attentions to reconstruct similar background textures via long-range feature interaction, and (3) a novel Adaptive Smooth <span><math><msub><mrow><mtext>L</mtext></mrow><mrow><mn>1</mn></mrow></msub></math></span>\n (<span><math><msub><mrow><mtext>ASL</mtext></mrow><mrow><mn>1</mn></mrow></msub></math></span>) loss based on gradient modulation to prompt finer detail restoration. Experimental results show that our method achieves superior performance on both synthetic and real-world benchmarks. Source code is available at <span><span>https://github.com/xiangLi-bit/MDGTNet</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"165 ","pages":"Article 111643"},"PeriodicalIF":7.5000,"publicationDate":"2025-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325003036","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

All-in-one weather removal methods have made impressive progress recently, but their ability to recover finer details from degraded images still needs to be improved, since (1) the difficulty of Convolutional Neural Networks (CNNs) in providing long-distance information interaction or Visual Transformer with simple convolutions in extracting richer local details, makes them unable to effectively utilize similar original texture features in different regions of a degraded image, and (2) under complex weather degradation distributions, their pixel reconstruction loss functions often result in losing high-frequency details in restored images, even when perceptual loss is used. In this paper, we propose a Multi-Depth Gated Transformer Network (MDGTNet) for all-in-one weather removal, with (1) a multi-depth gated module to capture richer background texture details from various weather noises in an input-adaptive manner, (2) self-attentions to reconstruct similar background textures via long-range feature interaction, and (3) a novel Adaptive Smooth

L_{1}

(

{ASL}_{1}

) loss based on gradient modulation to prompt finer detail restoration. Experimental results show that our method achieves superior performance on both synthetic and real-world benchmarks. Source code is available at https://github.com/xiangLi-bit/MDGTNet.

查看原文本刊更多论文

一体化天气去除通过多深度门控变压器与梯度调制

一体化天气去除方法近年来取得了令人印象深刻的进展，但其从退化图像中恢复更精细细节的能力仍有待提高，因为(1)卷积神经网络（cnn）难以提供远距离信息交互或视觉变形（Visual Transformer）用简单卷积提取更丰富的局部细节，这使得它们无法在退化图像的不同区域有效利用相似的原始纹理特征；(2)在复杂的天气退化分布下，即使使用感知损失，它们的像素重建损失函数也会导致恢复图像中的高频细节丢失。在本文中，我们提出了一种多深度门控变压器网络（MDGTNet），用于一体化天气去除，其中(1)多深度门控模块以输入自适应的方式从各种天气噪声中捕获更丰富的背景纹理细节，(2)通过远程特征交互自关注重建相似的背景纹理，以及(3)基于梯度调制的新型自适应平滑L1 （ASL1）损失，以促进更精细的细节恢复。实验结果表明，我们的方法在合成基准和实际基准上都取得了优异的性能。源代码可从https://github.com/xiangLi-bit/MDGTNet获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Pattern Recognition 工程技术-工程：电子与电气

CiteScore

14.40

自引率

16.20%

发文量

683

审稿时长

5.6 months

期刊介绍： The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.