Information-Cycle Dual Contractive Learning with Regularization for Unsupervised Image-to-Image Translation

2022 International Symposium on Advances in Informatics, Electronics and Education (ISAIEE) Pub Date : 2022-12-01 DOI:10.1109/ISAIEE57420.2022.00047

Xin Sun, Zhuang Yin, Siqi Zhang

{"title":"Information-Cycle Dual Contractive Learning with Regularization for Unsupervised Image-to-Image Translation","authors":"Xin Sun, Zhuang Yin, Siqi Zhang","doi":"10.1109/ISAIEE57420.2022.00047","DOIUrl":null,"url":null,"abstract":"In the tasks of image-to-image translation, we aim to add the appearance features of the target domain on the basis of retaining the structural features of the input image. Contrastive learning methods are gradually applied to image translation tasks. Despite the previous progress in image translation models, it remains challenging to employ an efficient learning setting by capturing image features from multiple perspectives to build mappings. In this paper, we propose a novel method based on information-cycle dual contrastive learning with regularization (ICDCLGAN-R) to improve the overall quality and visual observability which mainly consists of three components: dual-domain GAN, contrastive regularization (CR) and information cycle. Specifically, dual-domain GAN contributes to learning cross-domain image mappings and capturing the domain gap more efficiently by maximizing mutual information between corresponding patches of input and output. Simultaneously, CR utilizes two images from one domain (one real image and one generated image) and one generated image belonging to another domain as positive, anchor and negative, respectively, aiming to pull positive pairs in some metric space and push apart the representation between negative pairs. CR compares multi-source image features meanwhile enhancing information reusability. Besides, we put generated images into dual-domain GAN ulteriorly to perform additional iteration. It reinforces and verifies reconfigurability between images from different domains. In our experiments, we demonstrate that our method improves the quality of generated images and utilizes image features more efficiently.","PeriodicalId":345703,"journal":{"name":"2022 International Symposium on Advances in Informatics, Electronics and Education (ISAIEE)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Symposium on Advances in Informatics, Electronics and Education (ISAIEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISAIEE57420.2022.00047","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In the tasks of image-to-image translation, we aim to add the appearance features of the target domain on the basis of retaining the structural features of the input image. Contrastive learning methods are gradually applied to image translation tasks. Despite the previous progress in image translation models, it remains challenging to employ an efficient learning setting by capturing image features from multiple perspectives to build mappings. In this paper, we propose a novel method based on information-cycle dual contrastive learning with regularization (ICDCLGAN-R) to improve the overall quality and visual observability which mainly consists of three components: dual-domain GAN, contrastive regularization (CR) and information cycle. Specifically, dual-domain GAN contributes to learning cross-domain image mappings and capturing the domain gap more efficiently by maximizing mutual information between corresponding patches of input and output. Simultaneously, CR utilizes two images from one domain (one real image and one generated image) and one generated image belonging to another domain as positive, anchor and negative, respectively, aiming to pull positive pairs in some metric space and push apart the representation between negative pairs. CR compares multi-source image features meanwhile enhancing information reusability. Besides, we put generated images into dual-domain GAN ulteriorly to perform additional iteration. It reinforces and verifies reconfigurability between images from different domains. In our experiments, we demonstrate that our method improves the quality of generated images and utilizes image features more efficiently.

查看原文本刊更多论文

无监督图像到图像翻译的正则化信息循环双收缩学习

在图像到图像的翻译任务中，我们的目标是在保留输入图像的结构特征的基础上添加目标域的外观特征。对比学习方法逐渐被应用到图像翻译任务中。尽管先前在图像翻译模型方面取得了进展，但通过从多个角度捕获图像特征来构建映射，采用有效的学习设置仍然具有挑战性。本文提出了一种基于信息循环的正则化双对比学习(ICDCLGAN-R)方法，该方法主要由双域GAN、对比正则化(CR)和信息循环三个部分组成，以提高图像的整体质量和视觉可观察性。具体而言，双域GAN通过最大化输入和输出对应patch之间的互信息，有助于学习跨域图像映射并更有效地捕获域间隙。同时，CR利用一个域的两幅图像(一幅实景图像和一幅生成图像)和另一个域的一幅生成图像分别作为正、锚和负，目的是在某个度量空间中拉动正对，拉开负对之间的表示。CR对多源图像特征进行比较，同时增强信息的可重用性。此外，我们将生成的图像进一步放入双域GAN中进行额外迭代。它加强并验证了来自不同域的图像之间的可重构性。在实验中，我们证明了我们的方法提高了生成图像的质量，并更有效地利用了图像特征。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 International Symposium on Advances in Informatics, Electronics and Education (ISAIEE)

自引率

0.00%

发文量