All-in-one weather removal via Multi-Depth Gated Transformer with gradient modulation

IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Xiang Li, Jianwu Li
{"title":"All-in-one weather removal via Multi-Depth Gated Transformer with gradient modulation","authors":"Xiang Li,&nbsp;Jianwu Li","doi":"10.1016/j.patcog.2025.111643","DOIUrl":null,"url":null,"abstract":"<div><div>All-in-one weather removal methods have made impressive progress recently, but their ability to recover finer details from degraded images still needs to be improved, since (1) the difficulty of Convolutional Neural Networks (CNNs) in providing long-distance information interaction or Visual Transformer with simple convolutions in extracting richer local details, makes them unable to effectively utilize similar original texture features in different regions of a degraded image, and (2) under complex weather degradation distributions, their pixel reconstruction loss functions often result in losing high-frequency details in restored images, even when perceptual loss is used. In this paper, we propose a Multi-Depth Gated Transformer Network (MDGTNet) for all-in-one weather removal, with (1) a multi-depth gated module to capture richer background texture details from various weather noises in an input-adaptive manner, (2) self-attentions to reconstruct similar background textures via long-range feature interaction, and (3) a novel Adaptive Smooth <span><math><msub><mrow><mtext>L</mtext></mrow><mrow><mn>1</mn></mrow></msub></math></span>\n (<span><math><msub><mrow><mtext>ASL</mtext></mrow><mrow><mn>1</mn></mrow></msub></math></span>) loss based on gradient modulation to prompt finer detail restoration. Experimental results show that our method achieves superior performance on both synthetic and real-world benchmarks. Source code is available at <span><span>https://github.com/xiangLi-bit/MDGTNet</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"165 ","pages":"Article 111643"},"PeriodicalIF":7.5000,"publicationDate":"2025-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325003036","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

All-in-one weather removal methods have made impressive progress recently, but their ability to recover finer details from degraded images still needs to be improved, since (1) the difficulty of Convolutional Neural Networks (CNNs) in providing long-distance information interaction or Visual Transformer with simple convolutions in extracting richer local details, makes them unable to effectively utilize similar original texture features in different regions of a degraded image, and (2) under complex weather degradation distributions, their pixel reconstruction loss functions often result in losing high-frequency details in restored images, even when perceptual loss is used. In this paper, we propose a Multi-Depth Gated Transformer Network (MDGTNet) for all-in-one weather removal, with (1) a multi-depth gated module to capture richer background texture details from various weather noises in an input-adaptive manner, (2) self-attentions to reconstruct similar background textures via long-range feature interaction, and (3) a novel Adaptive Smooth L1 (ASL1) loss based on gradient modulation to prompt finer detail restoration. Experimental results show that our method achieves superior performance on both synthetic and real-world benchmarks. Source code is available at https://github.com/xiangLi-bit/MDGTNet.
一体化天气去除通过多深度门控变压器与梯度调制
一体化天气去除方法近年来取得了令人印象深刻的进展,但其从退化图像中恢复更精细细节的能力仍有待提高,因为(1)卷积神经网络(cnn)难以提供远距离信息交互或视觉变形(Visual Transformer)用简单卷积提取更丰富的局部细节,这使得它们无法在退化图像的不同区域有效利用相似的原始纹理特征;(2)在复杂的天气退化分布下,即使使用感知损失,它们的像素重建损失函数也会导致恢复图像中的高频细节丢失。在本文中,我们提出了一种多深度门控变压器网络(MDGTNet),用于一体化天气去除,其中(1)多深度门控模块以输入自适应的方式从各种天气噪声中捕获更丰富的背景纹理细节,(2)通过远程特征交互自关注重建相似的背景纹理,以及(3)基于梯度调制的新型自适应平滑L1 (ASL1)损失,以促进更精细的细节恢复。实验结果表明,我们的方法在合成基准和实际基准上都取得了优异的性能。源代码可从https://github.com/xiangLi-bit/MDGTNet获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Pattern Recognition
Pattern Recognition 工程技术-工程:电子与电气
CiteScore
14.40
自引率
16.20%
发文量
683
审稿时长
5.6 months
期刊介绍: The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信