{"title":"A hybrid frequency-spatial domain unsupervised denoising model for Gaussian-Poisson mixed noise in medical imaging","authors":"Cheng Zhang, Kin Sam Yen","doi":"10.1016/j.compbiomed.2025.110374","DOIUrl":null,"url":null,"abstract":"<div><div>This paper proposes an unsupervised denoising model designed to address Gaussian-Poisson hybrid noise in CT, MRI, and X-ray images. Traditional deep image prior (DIP) methods suffer from slow convergence, spectral bias, and overfitting, limiting their clinical applicability. In this paper, by applying the Fourier transform, we incorporate frequency-domain priors extracted from the observed noisy image at the input stage. Instead of using both amplitude and phase, we rely solely on the amplitude spectrum, which captures the energy distribution of various frequency components while avoiding the instability associated with phase information. Meanwhile, we retain the spatial domain information to preserve the image's structural integrity, ensuring the effective capture of both low-frequency details and high-frequency anatomical features. This dual-domain strategy allows fine details to be captured early in training, thereby mitigating spectral bias, accelerating convergence, and improving the preservation of high-frequency anatomical structures. To further enhance diagnostic fidelity, we replace the conventional mean squared error (MSE) loss with an edge-aware L1 loss function that better preserves critical anatomical textures. Additionally, an entropy-based criterion tracks variations in image uncertainty over iterations to determine the optimal stopping point, effectively preventing overfitting without the need for external validation data. Experimental results demonstrate that our model achieves an average improvement of 10.7 % in PSNR and 17.9 % in SSIM compared to DIP, reaching peak performance in just 60 iterations, faster than the 1360 and 2990 iterations required by DIP and DIP-AITV, respectively. These findings highlight the efficiency and effectiveness of our method for medical image denoising.</div></div>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"193 ","pages":"Article 110374"},"PeriodicalIF":7.0000,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010482525007255","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
This paper proposes an unsupervised denoising model designed to address Gaussian-Poisson hybrid noise in CT, MRI, and X-ray images. Traditional deep image prior (DIP) methods suffer from slow convergence, spectral bias, and overfitting, limiting their clinical applicability. In this paper, by applying the Fourier transform, we incorporate frequency-domain priors extracted from the observed noisy image at the input stage. Instead of using both amplitude and phase, we rely solely on the amplitude spectrum, which captures the energy distribution of various frequency components while avoiding the instability associated with phase information. Meanwhile, we retain the spatial domain information to preserve the image's structural integrity, ensuring the effective capture of both low-frequency details and high-frequency anatomical features. This dual-domain strategy allows fine details to be captured early in training, thereby mitigating spectral bias, accelerating convergence, and improving the preservation of high-frequency anatomical structures. To further enhance diagnostic fidelity, we replace the conventional mean squared error (MSE) loss with an edge-aware L1 loss function that better preserves critical anatomical textures. Additionally, an entropy-based criterion tracks variations in image uncertainty over iterations to determine the optimal stopping point, effectively preventing overfitting without the need for external validation data. Experimental results demonstrate that our model achieves an average improvement of 10.7 % in PSNR and 17.9 % in SSIM compared to DIP, reaching peak performance in just 60 iterations, faster than the 1360 and 2990 iterations required by DIP and DIP-AITV, respectively. These findings highlight the efficiency and effectiveness of our method for medical image denoising.
期刊介绍:
Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.