Zilu Ying;Wenyu Ke;Yikui Zhai;Zhihao Long;Jianhong Zhou;Hufei Zhu;C. L. Philip Chen
{"title":"DiffuSAR: SAR图像生成的频域感知扩散模型","authors":"Zilu Ying;Wenyu Ke;Yikui Zhai;Zhihao Long;Jianhong Zhou;Hufei Zhu;C. L. Philip Chen","doi":"10.1109/JSTARS.2025.3563798","DOIUrl":null,"url":null,"abstract":"In recent years, deep learning-based synthetic aperture radar (SAR) image detection, recognition, and segmentation models achieve remarkable accuracy when trained on large amounts of SAR image samples. However, the acquisition of SAR images tends to be costly. A practical approach to address this issue involves the use of artificially generated training samples to supplement the dataset. Current SAR image generation methods based on generative adversarial networks have issues with generation fidelity and are challenge to converge. Moreover, most of these methods only consider the spatial domain of SAR images without exploring the characteristics of their frequency domain. To tackle the aforementioned problems, we proposed a lightweight frequency domain-aware SAR image generation model based on the denoising diffusion probabilistic model. The proposed generative model is capable of producing highly realistic artificial SAR image samples while converging stably. Meanwhile, our research reveals that high-frequency (HF) components of SAR images play a crucial role in the generation process, and a frequency adjustment module (FAM) based on the fast Fourier transform is implemented in our generation model to prevent interference of HF components by low-frequency components. Tests on the MSTAR dataset highlight our model's advantages in both generation quality and parameter efficiency compared to other cutting-edge image generation models. An additional ablation study also confirms the effectiveness of our proposed FAM, which reduces the Fréchet inception distance by 2.85%, confirming our hypothesis that HF components are more crucial to SAR image generation.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"18 ","pages":"11851-11866"},"PeriodicalIF":4.7000,"publicationDate":"2025-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10974573","citationCount":"0","resultStr":"{\"title\":\"DiffuSAR: Frequency Domain-Aware Diffusion Model for SAR Image Generation\",\"authors\":\"Zilu Ying;Wenyu Ke;Yikui Zhai;Zhihao Long;Jianhong Zhou;Hufei Zhu;C. L. Philip Chen\",\"doi\":\"10.1109/JSTARS.2025.3563798\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, deep learning-based synthetic aperture radar (SAR) image detection, recognition, and segmentation models achieve remarkable accuracy when trained on large amounts of SAR image samples. However, the acquisition of SAR images tends to be costly. A practical approach to address this issue involves the use of artificially generated training samples to supplement the dataset. Current SAR image generation methods based on generative adversarial networks have issues with generation fidelity and are challenge to converge. Moreover, most of these methods only consider the spatial domain of SAR images without exploring the characteristics of their frequency domain. To tackle the aforementioned problems, we proposed a lightweight frequency domain-aware SAR image generation model based on the denoising diffusion probabilistic model. The proposed generative model is capable of producing highly realistic artificial SAR image samples while converging stably. Meanwhile, our research reveals that high-frequency (HF) components of SAR images play a crucial role in the generation process, and a frequency adjustment module (FAM) based on the fast Fourier transform is implemented in our generation model to prevent interference of HF components by low-frequency components. Tests on the MSTAR dataset highlight our model's advantages in both generation quality and parameter efficiency compared to other cutting-edge image generation models. An additional ablation study also confirms the effectiveness of our proposed FAM, which reduces the Fréchet inception distance by 2.85%, confirming our hypothesis that HF components are more crucial to SAR image generation.\",\"PeriodicalId\":13116,\"journal\":{\"name\":\"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing\",\"volume\":\"18 \",\"pages\":\"11851-11866\"},\"PeriodicalIF\":4.7000,\"publicationDate\":\"2025-04-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10974573\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10974573/\",\"RegionNum\":2,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10974573/","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
DiffuSAR: Frequency Domain-Aware Diffusion Model for SAR Image Generation
In recent years, deep learning-based synthetic aperture radar (SAR) image detection, recognition, and segmentation models achieve remarkable accuracy when trained on large amounts of SAR image samples. However, the acquisition of SAR images tends to be costly. A practical approach to address this issue involves the use of artificially generated training samples to supplement the dataset. Current SAR image generation methods based on generative adversarial networks have issues with generation fidelity and are challenge to converge. Moreover, most of these methods only consider the spatial domain of SAR images without exploring the characteristics of their frequency domain. To tackle the aforementioned problems, we proposed a lightweight frequency domain-aware SAR image generation model based on the denoising diffusion probabilistic model. The proposed generative model is capable of producing highly realistic artificial SAR image samples while converging stably. Meanwhile, our research reveals that high-frequency (HF) components of SAR images play a crucial role in the generation process, and a frequency adjustment module (FAM) based on the fast Fourier transform is implemented in our generation model to prevent interference of HF components by low-frequency components. Tests on the MSTAR dataset highlight our model's advantages in both generation quality and parameter efficiency compared to other cutting-edge image generation models. An additional ablation study also confirms the effectiveness of our proposed FAM, which reduces the Fréchet inception distance by 2.85%, confirming our hypothesis that HF components are more crucial to SAR image generation.
期刊介绍:
The IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing addresses the growing field of applications in Earth observations and remote sensing, and also provides a venue for the rapidly expanding special issues that are being sponsored by the IEEE Geosciences and Remote Sensing Society. The journal draws upon the experience of the highly successful “IEEE Transactions on Geoscience and Remote Sensing” and provide a complementary medium for the wide range of topics in applied earth observations. The ‘Applications’ areas encompasses the societal benefit areas of the Global Earth Observations Systems of Systems (GEOSS) program. Through deliberations over two years, ministers from 50 countries agreed to identify nine areas where Earth observation could positively impact the quality of life and health of their respective countries. Some of these are areas not traditionally addressed in the IEEE context. These include biodiversity, health and climate. Yet it is the skill sets of IEEE members, in areas such as observations, communications, computers, signal processing, standards and ocean engineering, that form the technical underpinnings of GEOSS. Thus, the Journal attracts a broad range of interests that serves both present members in new ways and expands the IEEE visibility into new areas.