Yusen ZHANG, Min Li, Wei Cai, Yao Gou, Shuaibing Shi
{"title":"SARCUT:基于自注意和相对论辨别的光学sar图像翻译的对比学习","authors":"Yusen ZHANG, Min Li, Wei Cai, Yao Gou, Shuaibing Shi","doi":"10.1117/12.2669086","DOIUrl":null,"url":null,"abstract":"Focusing on how to obtain high-quality and sufficient synthetic aperture radar (SAR) data in deep learning, this paper proposed a new mothed named SARCUT (Self-Attention Relativistic Contrastive Learning for Unpaired Image-to-Image Translation) to translate optical images into SAR images. In order to improve the coordination of generated images and stabilize the training process, we constructed a generator with the self-attention mechanism and spectral normalization operation. Meanwhile, relativistic discrimination adversarial loss function was designed to accelerate the model convergence and improved the authenticity of the generated images. Experiments on open datasets with 6 image quantitative evaluation metrics showed our model can learn the deeper internal relations and main features between multiple source images. Compared with the classical methods, SARCUT has more advantages in establishing the real image domain mapping, both the quality and authenticity of the generated image are significantly improved.","PeriodicalId":236099,"journal":{"name":"International Workshop on Frontiers of Graphics and Image Processing","volume":"12644 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SARCUT: Contrastive learning for optical-SAR image translation with self-attention and relativistic discrimination\",\"authors\":\"Yusen ZHANG, Min Li, Wei Cai, Yao Gou, Shuaibing Shi\",\"doi\":\"10.1117/12.2669086\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Focusing on how to obtain high-quality and sufficient synthetic aperture radar (SAR) data in deep learning, this paper proposed a new mothed named SARCUT (Self-Attention Relativistic Contrastive Learning for Unpaired Image-to-Image Translation) to translate optical images into SAR images. In order to improve the coordination of generated images and stabilize the training process, we constructed a generator with the self-attention mechanism and spectral normalization operation. Meanwhile, relativistic discrimination adversarial loss function was designed to accelerate the model convergence and improved the authenticity of the generated images. Experiments on open datasets with 6 image quantitative evaluation metrics showed our model can learn the deeper internal relations and main features between multiple source images. Compared with the classical methods, SARCUT has more advantages in establishing the real image domain mapping, both the quality and authenticity of the generated image are significantly improved.\",\"PeriodicalId\":236099,\"journal\":{\"name\":\"International Workshop on Frontiers of Graphics and Image Processing\",\"volume\":\"12644 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Workshop on Frontiers of Graphics and Image Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.2669086\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Workshop on Frontiers of Graphics and Image Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2669086","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
针对如何在深度学习中获得高质量和充足的合成孔径雷达(SAR)数据,本文提出了一种新的方法SARCUT (Self-Attention Relativistic contrast learning for Unpaired Image-to-Image Translation),将光学图像转化为SAR图像。为了提高生成图像的协调性,稳定训练过程,我们构造了一个具有自关注机制和谱归一化操作的生成器。同时,设计了相对论性判别对抗损失函数,加快了模型的收敛速度,提高了生成图像的真实性。在具有6个图像定量评价指标的开放数据集上的实验表明,该模型可以学习到多源图像之间更深层次的内部关系和主要特征。与经典方法相比,SARCUT在建立真实图像域映射方面更有优势,生成的图像质量和真实性都有显著提高。
SARCUT: Contrastive learning for optical-SAR image translation with self-attention and relativistic discrimination
Focusing on how to obtain high-quality and sufficient synthetic aperture radar (SAR) data in deep learning, this paper proposed a new mothed named SARCUT (Self-Attention Relativistic Contrastive Learning for Unpaired Image-to-Image Translation) to translate optical images into SAR images. In order to improve the coordination of generated images and stabilize the training process, we constructed a generator with the self-attention mechanism and spectral normalization operation. Meanwhile, relativistic discrimination adversarial loss function was designed to accelerate the model convergence and improved the authenticity of the generated images. Experiments on open datasets with 6 image quantitative evaluation metrics showed our model can learn the deeper internal relations and main features between multiple source images. Compared with the classical methods, SARCUT has more advantages in establishing the real image domain mapping, both the quality and authenticity of the generated image are significantly improved.