基于对比学习的语义分割领域自适应

2022 National Conference on Communications (NCC) Pub Date : 2022-05-24 DOI:10.1109/NCC55593.2022.9806740

Rishika Bhagwatkar, Saurabh Kemekar, Vinay Domatoti, Khursheed Munir Khan, Anamika Singh

{"title":"基于对比学习的语义分割领域自适应","authors":"Rishika Bhagwatkar, Saurabh Kemekar, Vinay Domatoti, Khursheed Munir Khan, Anamika Singh","doi":"10.1109/NCC55593.2022.9806740","DOIUrl":null,"url":null,"abstract":"Semantic segmentation is a crucial algorithm for identifying various objects in the surrounding of an autonomous vehicle. However, due to the limited size of real-world datasets, domain adaptation is employed. Hence, the models are made to adapt to real-world settings while being trained on large-scale synthetic datasets. In domain adaptation, domain-invariant features play a significant role in learning domain agnostic representations for each predefined category. While most of the prior work focuses on decreasing the distance between the domains, the works that utilize contrastive objectives for learning domain-invariant features depend heavily on the augmentations used. In this work, we completely eradicate the requirement of explicit data augmentations. We hypothesize that real-world images and their corresponding synthetic images are different views of the same abstract representation. To enhance the quality of domain-invariant features, we increase the mutual information between the two inputs. We first validate our hypothesis on the classification task using the standard datasets; Office31 and VisDA-2017. Further, we perform quantitative and qualitative analysis on the segmentation task using SYNTHIA, GTA and Cityscapes datasets.","PeriodicalId":403870,"journal":{"name":"2022 National Conference on Communications (NCC)","volume":"222 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Contrastive Learning-Based Domain Adaptation for Semantic Segmentation\",\"authors\":\"Rishika Bhagwatkar, Saurabh Kemekar, Vinay Domatoti, Khursheed Munir Khan, Anamika Singh\",\"doi\":\"10.1109/NCC55593.2022.9806740\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Semantic segmentation is a crucial algorithm for identifying various objects in the surrounding of an autonomous vehicle. However, due to the limited size of real-world datasets, domain adaptation is employed. Hence, the models are made to adapt to real-world settings while being trained on large-scale synthetic datasets. In domain adaptation, domain-invariant features play a significant role in learning domain agnostic representations for each predefined category. While most of the prior work focuses on decreasing the distance between the domains, the works that utilize contrastive objectives for learning domain-invariant features depend heavily on the augmentations used. In this work, we completely eradicate the requirement of explicit data augmentations. We hypothesize that real-world images and their corresponding synthetic images are different views of the same abstract representation. To enhance the quality of domain-invariant features, we increase the mutual information between the two inputs. We first validate our hypothesis on the classification task using the standard datasets; Office31 and VisDA-2017. Further, we perform quantitative and qualitative analysis on the segmentation task using SYNTHIA, GTA and Cityscapes datasets.\",\"PeriodicalId\":403870,\"journal\":{\"name\":\"2022 National Conference on Communications (NCC)\",\"volume\":\"222 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 National Conference on Communications (NCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NCC55593.2022.9806740\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 National Conference on Communications (NCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCC55593.2022.9806740","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

语义分割是自动驾驶汽车识别周围各种物体的关键算法。然而，由于实际数据集的规模有限，因此采用了领域自适应。因此，在大规模合成数据集上进行训练时，模型可以适应现实世界的设置。在领域自适应中，领域不变特征在学习每个预定义类别的领域不可知表示方面起着重要作用。虽然大多数先前的工作侧重于减少域之间的距离，但利用对比目标来学习域不变特征的工作在很大程度上依赖于所使用的增强。在这项工作中，我们完全消除了显式数据增强的需求。我们假设真实世界的图像及其相应的合成图像是同一抽象表征的不同视图。为了提高域不变特征的质量，我们增加了两个输入之间的互信息。我们首先使用标准数据集验证我们对分类任务的假设;Office31和vista -2017。此外，我们使用SYNTHIA、GTA和cityscape数据集对分割任务进行了定量和定性分析。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Contrastive Learning-Based Domain Adaptation for Semantic Segmentation

Semantic segmentation is a crucial algorithm for identifying various objects in the surrounding of an autonomous vehicle. However, due to the limited size of real-world datasets, domain adaptation is employed. Hence, the models are made to adapt to real-world settings while being trained on large-scale synthetic datasets. In domain adaptation, domain-invariant features play a significant role in learning domain agnostic representations for each predefined category. While most of the prior work focuses on decreasing the distance between the domains, the works that utilize contrastive objectives for learning domain-invariant features depend heavily on the augmentations used. In this work, we completely eradicate the requirement of explicit data augmentations. We hypothesize that real-world images and their corresponding synthetic images are different views of the same abstract representation. To enhance the quality of domain-invariant features, we increase the mutual information between the two inputs. We first validate our hypothesis on the classification task using the standard datasets; Office31 and VisDA-2017. Further, we perform quantitative and qualitative analysis on the segmentation task using SYNTHIA, GTA and Cityscapes datasets.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 National Conference on Communications (NCC)

自引率

0.00%

发文量