Pixel adaptive deep-unfolding neural network with state space model for image deraining.

IF 6.3 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Neural Networks Pub Date : 2025-11-01 Epub Date: 2025-07-14 DOI:10.1016/j.neunet.2025.107845

Yao Xiao, Youshen Xia

{"title":"Pixel adaptive deep-unfolding neural network with state space model for image deraining.","authors":"Yao Xiao, Youshen Xia","doi":"10.1016/j.neunet.2025.107845","DOIUrl":null,"url":null,"abstract":"<p><p>Rain streaks affects the visual quality and interfere with high-level vision tasks on rainy days. Removing raindrops from captured rainy images becomes important in computer vision applications. Recently, deep-unfolding neural networks (DUNs) are shown their effectiveness on image deraining. Yet, there are two issues that need to be further addressed : 1) Deep unfolding networks typically use convolutional neural networks (CNNs), which lack the ability to perceive global structures, thereby limiting the applicability of the network model; 2) Their gradient descent modules usually rely on a scalar step size, which limits the adaptability of the method to different input images. To address the two issues, we proposes a new image de-raining method based on a pixel adaptive deep unfolding network with state space models. The proposed network mainly consists of both the adaptive pixel-wise gradient descent (APGD) module and the stage fusion proximal mapping (SFPM) module. APGD module overcomes scalar step size inflexibility by adaptively adjusting the gradient step size for each pixel based on the previous stage features. SFPM module adopts a dual-branch architecture combining CNNs with state space models (SSMs) to enhance the perception of both local and global structures. Compared to Transformer-based models, SSM enables efficient long-range dependency modeling with linear complexity. In addition, we introduce a stage feature fusion with the Fourier transform mechanism to reduce information loss during the unfolding process, ensuring key features are effectively propagated. Extensive experiments on multiple public datasets demonstrate that our method consistently outperforms state-of-the-art deraining methods in terms of quantitative metrics and visual quality. The source code is available at https://github.com/cassiopeia-yxx/PADUM.</p>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"191 ","pages":"107845"},"PeriodicalIF":6.3000,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Networks","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1016/j.neunet.2025.107845","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/7/14 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Rain streaks affects the visual quality and interfere with high-level vision tasks on rainy days. Removing raindrops from captured rainy images becomes important in computer vision applications. Recently, deep-unfolding neural networks (DUNs) are shown their effectiveness on image deraining. Yet, there are two issues that need to be further addressed : 1) Deep unfolding networks typically use convolutional neural networks (CNNs), which lack the ability to perceive global structures, thereby limiting the applicability of the network model; 2) Their gradient descent modules usually rely on a scalar step size, which limits the adaptability of the method to different input images. To address the two issues, we proposes a new image de-raining method based on a pixel adaptive deep unfolding network with state space models. The proposed network mainly consists of both the adaptive pixel-wise gradient descent (APGD) module and the stage fusion proximal mapping (SFPM) module. APGD module overcomes scalar step size inflexibility by adaptively adjusting the gradient step size for each pixel based on the previous stage features. SFPM module adopts a dual-branch architecture combining CNNs with state space models (SSMs) to enhance the perception of both local and global structures. Compared to Transformer-based models, SSM enables efficient long-range dependency modeling with linear complexity. In addition, we introduce a stage feature fusion with the Fourier transform mechanism to reduce information loss during the unfolding process, ensuring key features are effectively propagated. Extensive experiments on multiple public datasets demonstrate that our method consistently outperforms state-of-the-art deraining methods in terms of quantitative metrics and visual quality. The source code is available at https://github.com/cassiopeia-yxx/PADUM.

查看原文本刊更多论文

基于状态空间模型的像素自适应深度展开神经网络图像训练。

雨痕会影响视觉质量，干扰雨天的高水平视觉任务。从捕获的雨图像中去除雨滴在计算机视觉应用中变得非常重要。近年来，深度展开神经网络（deep-展开neural networks， DUNs）在图像训练方面的有效性得到了充分的证明。然而，有两个问题需要进一步解决：1)深度展开网络通常使用卷积神经网络（cnn），缺乏感知全局结构的能力，从而限制了网络模型的适用性；2)它们的梯度下降模块通常依赖于标量步长，这限制了该方法对不同输入图像的适应性。为了解决这两个问题，我们提出了一种基于状态空间模型的像素自适应深度展开网络的图像去训练方法。该网络主要由自适应逐像素梯度下降（APGD）模块和阶段融合近端映射（SFPM）模块组成。APGD模块通过基于前一阶段特征自适应调整每个像素的梯度步长，克服了标量步长不灵活的问题。SFPM模块采用双分支架构，将cnn与状态空间模型（ssm）相结合，增强对局部和全局结构的感知。与基于transformer的模型相比，SSM支持具有线性复杂性的高效远程依赖关系建模。此外，我们引入了傅里叶变换机制的阶段特征融合，以减少展开过程中的信息损失，确保关键特征得到有效传播。在多个公共数据集上进行的大量实验表明，我们的方法在定量指标和视觉质量方面始终优于最先进的脱轨方法。源代码可从https://github.com/cassiopeia-yxx/PADUM获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Neural Networks 工程技术-计算机：人工智能

CiteScore

13.90

自引率

7.70%

发文量

425

审稿时长

67 days

期刊介绍： Neural Networks is a platform that aims to foster an international community of scholars and practitioners interested in neural networks, deep learning, and other approaches to artificial intelligence and machine learning. Our journal invites submissions covering various aspects of neural networks research, from computational neuroscience and cognitive modeling to mathematical analyses and engineering applications. By providing a forum for interdisciplinary discussions between biology and technology, we aim to encourage the development of biologically-inspired artificial intelligence.