Multi-Component Image Translation for Deep Domain Generalization

2019 IEEE Winter Conference on Applications of Computer Vision (WACV) Pub Date : 2018-12-21 DOI:10.1109/WACV.2019.00067

Mohammad Mahfujur Rahman, C. Fookes, Mahsa Baktash, S. Sridharan

{"title":"Multi-Component Image Translation for Deep Domain Generalization","authors":"Mohammad Mahfujur Rahman, C. Fookes, Mahsa Baktash, S. Sridharan","doi":"10.1109/WACV.2019.00067","DOIUrl":null,"url":null,"abstract":"Domain adaption (DA) and domain generalization (DG) are two closely related methods which are both concerned with the task of assigning labels to an unlabeled data set. The only dissimilarity between these approaches is that DA can access the target data during the training phase, while the target data is totally unseen during the training phase in DG. The task of DG is challenging as we have no earlier knowledge of the target samples. If DA methods are applied directly to DG by a simple exclusion of the target data from training, poor performance will result for a given task. In this paper, we tackle the domain generalization challenge in two ways. In our first approach, we propose a novel deep domain generalization architecture utilizing synthetic data generated by a Generative Adversarial Network (GAN). The discrepancy between the generated images and synthetic images is minimized using existing domain discrepancy metrics such as maximum mean discrepancy or correlation alignment. In our second approach, we introduce a protocol for applying DA methods to a DG scenario by excluding the target data from the training phase, splitting the source data to training and validation parts, and treating the validation data as target data for DA. We conduct extensive experiments on four cross-domain benchmark datasets. Experimental results signify our proposed model outperforms the current state-of-the-art methods for DG.","PeriodicalId":436637,"journal":{"name":"2019 IEEE Winter Conference on Applications of Computer Vision (WACV)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"54","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE Winter Conference on Applications of Computer Vision (WACV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WACV.2019.00067","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 54

Abstract

Domain adaption (DA) and domain generalization (DG) are two closely related methods which are both concerned with the task of assigning labels to an unlabeled data set. The only dissimilarity between these approaches is that DA can access the target data during the training phase, while the target data is totally unseen during the training phase in DG. The task of DG is challenging as we have no earlier knowledge of the target samples. If DA methods are applied directly to DG by a simple exclusion of the target data from training, poor performance will result for a given task. In this paper, we tackle the domain generalization challenge in two ways. In our first approach, we propose a novel deep domain generalization architecture utilizing synthetic data generated by a Generative Adversarial Network (GAN). The discrepancy between the generated images and synthetic images is minimized using existing domain discrepancy metrics such as maximum mean discrepancy or correlation alignment. In our second approach, we introduce a protocol for applying DA methods to a DG scenario by excluding the target data from the training phase, splitting the source data to training and validation parts, and treating the validation data as target data for DA. We conduct extensive experiments on four cross-domain benchmark datasets. Experimental results signify our proposed model outperforms the current state-of-the-art methods for DG.

查看原文本刊更多论文

基于深度域泛化的多分量图像平移

领域自适应(DA)和领域概化(DG)是两种密切相关的方法，它们都涉及到为未标记的数据集分配标签的任务。这两种方法的唯一不同之处在于，数据挖掘可以在训练阶段访问目标数据，而在数据挖掘中，目标数据在训练阶段是完全看不见的。DG的任务是具有挑战性的，因为我们对目标样品没有更早的了解。如果通过简单地将目标数据从训练中排除，直接将数据分析方法应用于DG，则会导致给定任务的性能不佳。在本文中，我们从两方面解决了领域泛化的挑战。在我们的第一种方法中，我们提出了一种新的深度域泛化架构，利用生成对抗网络(GAN)生成的合成数据。使用现有的域差异度量(如最大平均差异或相关对齐)最小化生成图像和合成图像之间的差异。在我们的第二种方法中，我们引入了一种协议，通过将目标数据从训练阶段排除，将源数据分割为训练和验证部分，并将验证数据作为数据处理的目标数据，将数据处理方法应用于DG场景。我们在四个跨域基准数据集上进行了广泛的实验。实验结果表明，我们提出的模型优于目前最先进的DG方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 IEEE Winter Conference on Applications of Computer Vision (WACV)

自引率

0.00%

发文量