Pixel adaptive deep-unfolding neural network with state space model for image deraining.

IF 6.3 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Neural Networks Pub Date : 2025-11-01 Epub Date: 2025-07-14 DOI:10.1016/j.neunet.2025.107845
Yao Xiao, Youshen Xia
{"title":"Pixel adaptive deep-unfolding neural network with state space model for image deraining.","authors":"Yao Xiao, Youshen Xia","doi":"10.1016/j.neunet.2025.107845","DOIUrl":null,"url":null,"abstract":"<p><p>Rain streaks affects the visual quality and interfere with high-level vision tasks on rainy days. Removing raindrops from captured rainy images becomes important in computer vision applications. Recently, deep-unfolding neural networks (DUNs) are shown their effectiveness on image deraining. Yet, there are two issues that need to be further addressed : 1) Deep unfolding networks typically use convolutional neural networks (CNNs), which lack the ability to perceive global structures, thereby limiting the applicability of the network model; 2) Their gradient descent modules usually rely on a scalar step size, which limits the adaptability of the method to different input images. To address the two issues, we proposes a new image de-raining method based on a pixel adaptive deep unfolding network with state space models. The proposed network mainly consists of both the adaptive pixel-wise gradient descent (APGD) module and the stage fusion proximal mapping (SFPM) module. APGD module overcomes scalar step size inflexibility by adaptively adjusting the gradient step size for each pixel based on the previous stage features. SFPM module adopts a dual-branch architecture combining CNNs with state space models (SSMs) to enhance the perception of both local and global structures. Compared to Transformer-based models, SSM enables efficient long-range dependency modeling with linear complexity. In addition, we introduce a stage feature fusion with the Fourier transform mechanism to reduce information loss during the unfolding process, ensuring key features are effectively propagated. Extensive experiments on multiple public datasets demonstrate that our method consistently outperforms state-of-the-art deraining methods in terms of quantitative metrics and visual quality. The source code is available at https://github.com/cassiopeia-yxx/PADUM.</p>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"191 ","pages":"107845"},"PeriodicalIF":6.3000,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Networks","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1016/j.neunet.2025.107845","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/7/14 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Rain streaks affects the visual quality and interfere with high-level vision tasks on rainy days. Removing raindrops from captured rainy images becomes important in computer vision applications. Recently, deep-unfolding neural networks (DUNs) are shown their effectiveness on image deraining. Yet, there are two issues that need to be further addressed : 1) Deep unfolding networks typically use convolutional neural networks (CNNs), which lack the ability to perceive global structures, thereby limiting the applicability of the network model; 2) Their gradient descent modules usually rely on a scalar step size, which limits the adaptability of the method to different input images. To address the two issues, we proposes a new image de-raining method based on a pixel adaptive deep unfolding network with state space models. The proposed network mainly consists of both the adaptive pixel-wise gradient descent (APGD) module and the stage fusion proximal mapping (SFPM) module. APGD module overcomes scalar step size inflexibility by adaptively adjusting the gradient step size for each pixel based on the previous stage features. SFPM module adopts a dual-branch architecture combining CNNs with state space models (SSMs) to enhance the perception of both local and global structures. Compared to Transformer-based models, SSM enables efficient long-range dependency modeling with linear complexity. In addition, we introduce a stage feature fusion with the Fourier transform mechanism to reduce information loss during the unfolding process, ensuring key features are effectively propagated. Extensive experiments on multiple public datasets demonstrate that our method consistently outperforms state-of-the-art deraining methods in terms of quantitative metrics and visual quality. The source code is available at https://github.com/cassiopeia-yxx/PADUM.

基于状态空间模型的像素自适应深度展开神经网络图像训练。
雨痕会影响视觉质量,干扰雨天的高水平视觉任务。从捕获的雨图像中去除雨滴在计算机视觉应用中变得非常重要。近年来,深度展开神经网络(deep-展开neural networks, DUNs)在图像训练方面的有效性得到了充分的证明。然而,有两个问题需要进一步解决:1)深度展开网络通常使用卷积神经网络(cnn),缺乏感知全局结构的能力,从而限制了网络模型的适用性;2)它们的梯度下降模块通常依赖于标量步长,这限制了该方法对不同输入图像的适应性。为了解决这两个问题,我们提出了一种基于状态空间模型的像素自适应深度展开网络的图像去训练方法。该网络主要由自适应逐像素梯度下降(APGD)模块和阶段融合近端映射(SFPM)模块组成。APGD模块通过基于前一阶段特征自适应调整每个像素的梯度步长,克服了标量步长不灵活的问题。SFPM模块采用双分支架构,将cnn与状态空间模型(ssm)相结合,增强对局部和全局结构的感知。与基于transformer的模型相比,SSM支持具有线性复杂性的高效远程依赖关系建模。此外,我们引入了傅里叶变换机制的阶段特征融合,以减少展开过程中的信息损失,确保关键特征得到有效传播。在多个公共数据集上进行的大量实验表明,我们的方法在定量指标和视觉质量方面始终优于最先进的脱轨方法。源代码可从https://github.com/cassiopeia-yxx/PADUM获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Neural Networks
Neural Networks 工程技术-计算机:人工智能
CiteScore
13.90
自引率
7.70%
发文量
425
审稿时长
67 days
期刊介绍: Neural Networks is a platform that aims to foster an international community of scholars and practitioners interested in neural networks, deep learning, and other approaches to artificial intelligence and machine learning. Our journal invites submissions covering various aspects of neural networks research, from computational neuroscience and cognitive modeling to mathematical analyses and engineering applications. By providing a forum for interdisciplinary discussions between biology and technology, we aim to encourage the development of biologically-inspired artificial intelligence.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信