{"title":"Contrastive Learning-Based Domain Adaptation for Semantic Segmentation","authors":"Rishika Bhagwatkar, Saurabh Kemekar, Vinay Domatoti, Khursheed Munir Khan, Anamika Singh","doi":"10.1109/NCC55593.2022.9806740","DOIUrl":null,"url":null,"abstract":"Semantic segmentation is a crucial algorithm for identifying various objects in the surrounding of an autonomous vehicle. However, due to the limited size of real-world datasets, domain adaptation is employed. Hence, the models are made to adapt to real-world settings while being trained on large-scale synthetic datasets. In domain adaptation, domain-invariant features play a significant role in learning domain agnostic representations for each predefined category. While most of the prior work focuses on decreasing the distance between the domains, the works that utilize contrastive objectives for learning domain-invariant features depend heavily on the augmentations used. In this work, we completely eradicate the requirement of explicit data augmentations. We hypothesize that real-world images and their corresponding synthetic images are different views of the same abstract representation. To enhance the quality of domain-invariant features, we increase the mutual information between the two inputs. We first validate our hypothesis on the classification task using the standard datasets; Office31 and VisDA-2017. Further, we perform quantitative and qualitative analysis on the segmentation task using SYNTHIA, GTA and Cityscapes datasets.","PeriodicalId":403870,"journal":{"name":"2022 National Conference on Communications (NCC)","volume":"222 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 National Conference on Communications (NCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCC55593.2022.9806740","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Semantic segmentation is a crucial algorithm for identifying various objects in the surrounding of an autonomous vehicle. However, due to the limited size of real-world datasets, domain adaptation is employed. Hence, the models are made to adapt to real-world settings while being trained on large-scale synthetic datasets. In domain adaptation, domain-invariant features play a significant role in learning domain agnostic representations for each predefined category. While most of the prior work focuses on decreasing the distance between the domains, the works that utilize contrastive objectives for learning domain-invariant features depend heavily on the augmentations used. In this work, we completely eradicate the requirement of explicit data augmentations. We hypothesize that real-world images and their corresponding synthetic images are different views of the same abstract representation. To enhance the quality of domain-invariant features, we increase the mutual information between the two inputs. We first validate our hypothesis on the classification task using the standard datasets; Office31 and VisDA-2017. Further, we perform quantitative and qualitative analysis on the segmentation task using SYNTHIA, GTA and Cityscapes datasets.