{"title":"Boosting Restoration of Turbulence-Degraded Images With State Space Conditional Diffusion","authors":"Yubo Wu;Kuanhong Cheng;Tingting Chai;Gengyu Lyu;Shuping Zhao;Wei Jia","doi":"10.1109/TSP.2025.3580723","DOIUrl":null,"url":null,"abstract":"Recovering fine details from turbulence-distorted images is highly challenging due to the complex, spatially varying, and stochastic nature of the distortion process. Conventional multi-frame methods rely on extracting and averaging clear regions from pre-aligned frames, but their effectiveness is limited due to the rarity of “lucky regions”. In contrast, learning based methods have shown superior performance across various vision tasks. However, existing deep learning approaches still face key limitations: (1) they struggle to efficiently model the global context required for correcting pixel dispersion caused by spatially varying Point Spread Functions (PSFs); (2) they often overlook the physical formation of turbulence, particularly the spatial-frequency relationship between phase distortions and PSFs; and (3) they rely on deterministic architectures that fail to capture the inherent uncertainty in turbulence, leading to visually implausible outputs. To address these issues, we propose the Two-Stage Turbulence Removal Network (TSTRNet). The first stage uses a UNet-based generator built on the State Space Model to perform efficient, coarse global restoration. The second stage refines the output through a Denoising Diffusion Probabilistic Model, introducing stochasticity and edge-guided conditioning for detail enhancement and realism. Both stages incorporate frequency-domain processing to align with the physical characteristics of turbulence. Experimental results on multiple benchmark datasets demonstrate that TSTRNet achieves superior restoration performance compared to state-of-the-art methods, with strong generalization from synthetic to real-world scenarios.","PeriodicalId":13330,"journal":{"name":"IEEE Transactions on Signal Processing","volume":"73 ","pages":"2631-2645"},"PeriodicalIF":4.6000,"publicationDate":"2025-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/11046205/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Recovering fine details from turbulence-distorted images is highly challenging due to the complex, spatially varying, and stochastic nature of the distortion process. Conventional multi-frame methods rely on extracting and averaging clear regions from pre-aligned frames, but their effectiveness is limited due to the rarity of “lucky regions”. In contrast, learning based methods have shown superior performance across various vision tasks. However, existing deep learning approaches still face key limitations: (1) they struggle to efficiently model the global context required for correcting pixel dispersion caused by spatially varying Point Spread Functions (PSFs); (2) they often overlook the physical formation of turbulence, particularly the spatial-frequency relationship between phase distortions and PSFs; and (3) they rely on deterministic architectures that fail to capture the inherent uncertainty in turbulence, leading to visually implausible outputs. To address these issues, we propose the Two-Stage Turbulence Removal Network (TSTRNet). The first stage uses a UNet-based generator built on the State Space Model to perform efficient, coarse global restoration. The second stage refines the output through a Denoising Diffusion Probabilistic Model, introducing stochasticity and edge-guided conditioning for detail enhancement and realism. Both stages incorporate frequency-domain processing to align with the physical characteristics of turbulence. Experimental results on multiple benchmark datasets demonstrate that TSTRNet achieves superior restoration performance compared to state-of-the-art methods, with strong generalization from synthetic to real-world scenarios.
期刊介绍:
The IEEE Transactions on Signal Processing covers novel theory, algorithms, performance analyses and applications of techniques for the processing, understanding, learning, retrieval, mining, and extraction of information from signals. The term “signal” includes, among others, audio, video, speech, image, communication, geophysical, sonar, radar, medical and musical signals. Examples of topics of interest include, but are not limited to, information processing and the theory and application of filtering, coding, transmitting, estimating, detecting, analyzing, recognizing, synthesizing, recording, and reproducing signals.