{"title":"D2AP: Double Debiasing with Adaptive Proxies for Domain Generalization in Noisy Environment","authors":"Yunyun Wang, Xiaodong Liu, Yi Guo","doi":"10.1016/j.knosys.2025.113458","DOIUrl":null,"url":null,"abstract":"<div><div>Domain Generalization (DG) methods commonly rely on multiple source domains with correct annotations, while it is usually difficult to obtain a large amount of clean source samples in practice, which greatly limits their application in real noisy environments. Hence, we consider a more realistic setting of Noisy Domain Generalization (NDG), which learns with noisy data from multiple source domains. A simple solution is to introduce previous noise learning strategies into individual domains to detect and correct the noisy samples independently. However, knowledge from other domains can also help the noise detection and correction for the current domain. Moreover, there will be domain bias of domain imbalance in clean samples after noise detection, and confirmation bias of inaccurate pseudo-labels in noisy samples after noise correction, which greatly affect the learning performance. To this end, we propose a novel Double Debiasing method with Adaptive Proxies (D2AP) for NDG learning. In D2AP, an adaptive sample disentangling module is first developed with multi-scale Gaussian Mixture Model, so as to enable a mutual noise disentangling across domains. Further, double debiasing is proposed with the assistance of adaptive proxies, in order to make domain-invariant prototypes less sensitive to domain imbalance, and calibrate the pseudo-labels for noisy data, so as to address the both biases. Finally, empirical results over three benchmark datasets demonstrate the effectiveness of our D2AP.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"318 ","pages":"Article 113458"},"PeriodicalIF":7.2000,"publicationDate":"2025-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125005052","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Domain Generalization (DG) methods commonly rely on multiple source domains with correct annotations, while it is usually difficult to obtain a large amount of clean source samples in practice, which greatly limits their application in real noisy environments. Hence, we consider a more realistic setting of Noisy Domain Generalization (NDG), which learns with noisy data from multiple source domains. A simple solution is to introduce previous noise learning strategies into individual domains to detect and correct the noisy samples independently. However, knowledge from other domains can also help the noise detection and correction for the current domain. Moreover, there will be domain bias of domain imbalance in clean samples after noise detection, and confirmation bias of inaccurate pseudo-labels in noisy samples after noise correction, which greatly affect the learning performance. To this end, we propose a novel Double Debiasing method with Adaptive Proxies (D2AP) for NDG learning. In D2AP, an adaptive sample disentangling module is first developed with multi-scale Gaussian Mixture Model, so as to enable a mutual noise disentangling across domains. Further, double debiasing is proposed with the assistance of adaptive proxies, in order to make domain-invariant prototypes less sensitive to domain imbalance, and calibrate the pseudo-labels for noisy data, so as to address the both biases. Finally, empirical results over three benchmark datasets demonstrate the effectiveness of our D2AP.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.