Lijun Gou , Jinrong Yang , Hangcheng Yu , Pan Wang , Xiaoping Li , Tuo Shi
{"title":"A semantic consistent object detection model for domain adaptation based on mixed-class distribution metrics","authors":"Lijun Gou , Jinrong Yang , Hangcheng Yu , Pan Wang , Xiaoping Li , Tuo Shi","doi":"10.1016/j.neucom.2024.128944","DOIUrl":null,"url":null,"abstract":"<div><div>Unsupervised domain adaptation is crucial for mitigating the performance degradation caused by domain bias in object detection tasks. In previous studies, the focus has been on pixel-level and instance-level shift alignment to minimize domain discrepancy. However, it is important to note that this method may inadvertently align single-class instance features with mixed-class instance features that belong to multiple categories within the same image during image-level domain adaptation. This challenge arises because each image in object detection tasks contains objects of multiple categories. To achieve the same category feature alignment between single-class and mixed-class, our method considers features with different mixed categories as a new class and proposes a mixed-classes <span><math><mi>H</mi></math></span>-divergence to reduce domain bias for object detection. To enhance both single-class and mixed-class semantic information, and to achieve semantic separation for the mixed-classes in <span><math><mi>H</mi></math></span>-divergence, we employ Semantic Prediction Models (SPM) and Semantic Bridging Components (SBC). Furthermore, we reweigh the loss of the pixel domain discriminator based on the SPM results to reduce sample imbalance. Our extensive experiments on widely used datasets illustrate how our method can robustly improve object detection in domain bias settings.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"616 ","pages":"Article 128944"},"PeriodicalIF":5.5000,"publicationDate":"2024-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231224017156","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Unsupervised domain adaptation is crucial for mitigating the performance degradation caused by domain bias in object detection tasks. In previous studies, the focus has been on pixel-level and instance-level shift alignment to minimize domain discrepancy. However, it is important to note that this method may inadvertently align single-class instance features with mixed-class instance features that belong to multiple categories within the same image during image-level domain adaptation. This challenge arises because each image in object detection tasks contains objects of multiple categories. To achieve the same category feature alignment between single-class and mixed-class, our method considers features with different mixed categories as a new class and proposes a mixed-classes -divergence to reduce domain bias for object detection. To enhance both single-class and mixed-class semantic information, and to achieve semantic separation for the mixed-classes in -divergence, we employ Semantic Prediction Models (SPM) and Semantic Bridging Components (SBC). Furthermore, we reweigh the loss of the pixel domain discriminator based on the SPM results to reduce sample imbalance. Our extensive experiments on widely used datasets illustrate how our method can robustly improve object detection in domain bias settings.
期刊介绍:
Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.