{"title":"Dynamic frequency window transformer for single image deraining","authors":"Pengcheng Wang, Yuli Fu, Youjun Xiang, Yufeng Tan","doi":"10.1016/j.patrec.2025.02.009","DOIUrl":null,"url":null,"abstract":"<div><div>Outdoor visual systems often encounter rain streaks leading to the degradation of images that affects later high-level tasks. In contrast to methods that currently perform feature extraction in the spatial domain, this paper proposes a Dynamic Frequency Window Transformer Network(DFWT-Net) by integrating the frequency and spatial domain information of images. Firstly, the Space Frequency Module (SFM) is utilized to extract local features and preliminary frequency characteristics of the image, including amplitude and phase. Subsequently, the residuals of the rain streaks are obtained by a hierarchical encoder–decoder network consisting of dynamic frequency window transformers (DFWTs). The Dynamic Frequency Window Transformer (DFWT) employs learnable masks for dynamic frequency filtering before self-attention computation to emphasize the frequency of rain streaks. To obtain rain-free images that resemble real scenes, this paper propose the Cosine Phase Loss (CPL) function to measure the phase similarity between the recovered image and the ground truth image. The experimental results thoroughly validate the effectiveness and robustness of the proposed network.</div></div>","PeriodicalId":54638,"journal":{"name":"Pattern Recognition Letters","volume":"190 ","pages":"Pages 111-117"},"PeriodicalIF":3.9000,"publicationDate":"2025-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition Letters","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167865525000510","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Outdoor visual systems often encounter rain streaks leading to the degradation of images that affects later high-level tasks. In contrast to methods that currently perform feature extraction in the spatial domain, this paper proposes a Dynamic Frequency Window Transformer Network(DFWT-Net) by integrating the frequency and spatial domain information of images. Firstly, the Space Frequency Module (SFM) is utilized to extract local features and preliminary frequency characteristics of the image, including amplitude and phase. Subsequently, the residuals of the rain streaks are obtained by a hierarchical encoder–decoder network consisting of dynamic frequency window transformers (DFWTs). The Dynamic Frequency Window Transformer (DFWT) employs learnable masks for dynamic frequency filtering before self-attention computation to emphasize the frequency of rain streaks. To obtain rain-free images that resemble real scenes, this paper propose the Cosine Phase Loss (CPL) function to measure the phase similarity between the recovered image and the ground truth image. The experimental results thoroughly validate the effectiveness and robustness of the proposed network.
期刊介绍:
Pattern Recognition Letters aims at rapid publication of concise articles of a broad interest in pattern recognition.
Subject areas include all the current fields of interest represented by the Technical Committees of the International Association of Pattern Recognition, and other developing themes involving learning and recognition.