Masuma Aktar , Kuldeep Singh Yadav , Rabul Hussain Laskar
{"title":"频率感知深度网络和小波域单幅图像超分辨率的逐块生成对抗训练","authors":"Masuma Aktar , Kuldeep Singh Yadav , Rabul Hussain Laskar","doi":"10.1016/j.asoc.2025.114035","DOIUrl":null,"url":null,"abstract":"<div><div>With the increasing demand for high-resolution displays, image super-resolution has become essential for enhancing visual quality, especially in the era of 4K and 8K devices. Single image super-resolution (SISR) plays a vital role in numerous computer vision applications because of its ability to enhance image resolution while preserving important details. Recently, deep learning-based super-resolution methods, including methods based on generative adversarial networks (GANs) and transformers, have become mainstream. However, the existing methods face the challenge of preserving high-frequency details while maintaining overall image quality. They often struggle to reconstruct realistic and undistorted high-frequency information. To address this challenge, we propose the frequency-aware perceptual wavelet domain super-resolution (FA-PWSR) model. FA-PWSR deals with images’ low-frequency and high-frequency components separately and in parallel. FA-PWSR employs a divide-and-conquer strategy, utilizing stationary wavelet transform (SWT) to decompose images into low and high-frequency sub-images. Specialized subnetworks are designed to process each component, ensuring targeted optimization for different frequency bands. The reconstructed sub-images are subsequently integrated using the inverse stationary wavelet transform (ISWT) to generate the final SR image. FA-PWSR achieves superior performance over state-of-the-art methods in both fidelity (PSNR and SSIM) and perceptual metrics (PI and LPIPS) across various datasets including Set5, Set14, Urban100, and BSD100. Even in the challenging Urban100 dataset, FA-PWSR achieves a 1.754 dB improvement in PSNR and a 0.049 increase in SSIM compared to ESRGAN. Furthermore, our model significantly enhances perceptual quality, achieving a 12 % reduction in LPIPS on the same dataset. Moreover, visual comparisons confirm that FA-PWSR effectively preserves fine details, enhances edges, and noticeably reduces artifacts, leading to more realistic and high-fidelity super-resolved images.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"185 ","pages":"Article 114035"},"PeriodicalIF":6.6000,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Frequency-aware deep networks and patch-wise generative adversarial training for single image super-resolution in wavelet domain\",\"authors\":\"Masuma Aktar , Kuldeep Singh Yadav , Rabul Hussain Laskar\",\"doi\":\"10.1016/j.asoc.2025.114035\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>With the increasing demand for high-resolution displays, image super-resolution has become essential for enhancing visual quality, especially in the era of 4K and 8K devices. Single image super-resolution (SISR) plays a vital role in numerous computer vision applications because of its ability to enhance image resolution while preserving important details. Recently, deep learning-based super-resolution methods, including methods based on generative adversarial networks (GANs) and transformers, have become mainstream. However, the existing methods face the challenge of preserving high-frequency details while maintaining overall image quality. They often struggle to reconstruct realistic and undistorted high-frequency information. To address this challenge, we propose the frequency-aware perceptual wavelet domain super-resolution (FA-PWSR) model. FA-PWSR deals with images’ low-frequency and high-frequency components separately and in parallel. FA-PWSR employs a divide-and-conquer strategy, utilizing stationary wavelet transform (SWT) to decompose images into low and high-frequency sub-images. Specialized subnetworks are designed to process each component, ensuring targeted optimization for different frequency bands. The reconstructed sub-images are subsequently integrated using the inverse stationary wavelet transform (ISWT) to generate the final SR image. FA-PWSR achieves superior performance over state-of-the-art methods in both fidelity (PSNR and SSIM) and perceptual metrics (PI and LPIPS) across various datasets including Set5, Set14, Urban100, and BSD100. Even in the challenging Urban100 dataset, FA-PWSR achieves a 1.754 dB improvement in PSNR and a 0.049 increase in SSIM compared to ESRGAN. Furthermore, our model significantly enhances perceptual quality, achieving a 12 % reduction in LPIPS on the same dataset. Moreover, visual comparisons confirm that FA-PWSR effectively preserves fine details, enhances edges, and noticeably reduces artifacts, leading to more realistic and high-fidelity super-resolved images.</div></div>\",\"PeriodicalId\":50737,\"journal\":{\"name\":\"Applied Soft Computing\",\"volume\":\"185 \",\"pages\":\"Article 114035\"},\"PeriodicalIF\":6.6000,\"publicationDate\":\"2025-10-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Soft Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1568494625013481\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494625013481","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Frequency-aware deep networks and patch-wise generative adversarial training for single image super-resolution in wavelet domain
With the increasing demand for high-resolution displays, image super-resolution has become essential for enhancing visual quality, especially in the era of 4K and 8K devices. Single image super-resolution (SISR) plays a vital role in numerous computer vision applications because of its ability to enhance image resolution while preserving important details. Recently, deep learning-based super-resolution methods, including methods based on generative adversarial networks (GANs) and transformers, have become mainstream. However, the existing methods face the challenge of preserving high-frequency details while maintaining overall image quality. They often struggle to reconstruct realistic and undistorted high-frequency information. To address this challenge, we propose the frequency-aware perceptual wavelet domain super-resolution (FA-PWSR) model. FA-PWSR deals with images’ low-frequency and high-frequency components separately and in parallel. FA-PWSR employs a divide-and-conquer strategy, utilizing stationary wavelet transform (SWT) to decompose images into low and high-frequency sub-images. Specialized subnetworks are designed to process each component, ensuring targeted optimization for different frequency bands. The reconstructed sub-images are subsequently integrated using the inverse stationary wavelet transform (ISWT) to generate the final SR image. FA-PWSR achieves superior performance over state-of-the-art methods in both fidelity (PSNR and SSIM) and perceptual metrics (PI and LPIPS) across various datasets including Set5, Set14, Urban100, and BSD100. Even in the challenging Urban100 dataset, FA-PWSR achieves a 1.754 dB improvement in PSNR and a 0.049 increase in SSIM compared to ESRGAN. Furthermore, our model significantly enhances perceptual quality, achieving a 12 % reduction in LPIPS on the same dataset. Moreover, visual comparisons confirm that FA-PWSR effectively preserves fine details, enhances edges, and noticeably reduces artifacts, leading to more realistic and high-fidelity super-resolved images.
期刊介绍:
Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities.
Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.