D2AP: Double Debiasing with Adaptive Proxies for Domain Generalization in Noisy Environment

IF 7.2 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Yunyun Wang, Xiaodong Liu, Yi Guo
{"title":"D2AP: Double Debiasing with Adaptive Proxies for Domain Generalization in Noisy Environment","authors":"Yunyun Wang,&nbsp;Xiaodong Liu,&nbsp;Yi Guo","doi":"10.1016/j.knosys.2025.113458","DOIUrl":null,"url":null,"abstract":"<div><div>Domain Generalization (DG) methods commonly rely on multiple source domains with correct annotations, while it is usually difficult to obtain a large amount of clean source samples in practice, which greatly limits their application in real noisy environments. Hence, we consider a more realistic setting of Noisy Domain Generalization (NDG), which learns with noisy data from multiple source domains. A simple solution is to introduce previous noise learning strategies into individual domains to detect and correct the noisy samples independently. However, knowledge from other domains can also help the noise detection and correction for the current domain. Moreover, there will be domain bias of domain imbalance in clean samples after noise detection, and confirmation bias of inaccurate pseudo-labels in noisy samples after noise correction, which greatly affect the learning performance. To this end, we propose a novel Double Debiasing method with Adaptive Proxies (D2AP) for NDG learning. In D2AP, an adaptive sample disentangling module is first developed with multi-scale Gaussian Mixture Model, so as to enable a mutual noise disentangling across domains. Further, double debiasing is proposed with the assistance of adaptive proxies, in order to make domain-invariant prototypes less sensitive to domain imbalance, and calibrate the pseudo-labels for noisy data, so as to address the both biases. Finally, empirical results over three benchmark datasets demonstrate the effectiveness of our D2AP.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"318 ","pages":"Article 113458"},"PeriodicalIF":7.2000,"publicationDate":"2025-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125005052","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Domain Generalization (DG) methods commonly rely on multiple source domains with correct annotations, while it is usually difficult to obtain a large amount of clean source samples in practice, which greatly limits their application in real noisy environments. Hence, we consider a more realistic setting of Noisy Domain Generalization (NDG), which learns with noisy data from multiple source domains. A simple solution is to introduce previous noise learning strategies into individual domains to detect and correct the noisy samples independently. However, knowledge from other domains can also help the noise detection and correction for the current domain. Moreover, there will be domain bias of domain imbalance in clean samples after noise detection, and confirmation bias of inaccurate pseudo-labels in noisy samples after noise correction, which greatly affect the learning performance. To this end, we propose a novel Double Debiasing method with Adaptive Proxies (D2AP) for NDG learning. In D2AP, an adaptive sample disentangling module is first developed with multi-scale Gaussian Mixture Model, so as to enable a mutual noise disentangling across domains. Further, double debiasing is proposed with the assistance of adaptive proxies, in order to make domain-invariant prototypes less sensitive to domain imbalance, and calibrate the pseudo-labels for noisy data, so as to address the both biases. Finally, empirical results over three benchmark datasets demonstrate the effectiveness of our D2AP.
基于自适应代理的双去偏噪声环境下的领域泛化
领域泛化(DG)方法通常依赖于多个具有正确注释的源域,而在实际应用中通常难以获得大量干净的源样本,这极大地限制了其在实际噪声环境中的应用。因此,我们考虑了一个更现实的噪声域泛化(NDG)设置,它使用来自多个源域的噪声数据进行学习。一种简单的解决方案是将以前的噪声学习策略引入到各个域中,独立地检测和纠正噪声样本。然而,来自其他领域的知识也可以帮助当前领域的噪声检测和校正。此外,在噪声检测后的干净样本中会存在域不平衡的域偏差,在噪声校正后的有噪声样本中会存在伪标签不准确的确认偏差,极大地影响了学习性能。为此,我们提出了一种新的基于自适应代理(D2AP)的双去偏方法用于NDG学习。在D2AP中,首先利用多尺度高斯混合模型开发了自适应样本解纠缠模块,实现了跨域的相互噪声解纠缠。在此基础上,提出了基于自适应代理的双去偏方法,以降低域不变原型对域不平衡的敏感性,并对噪声数据的伪标签进行校正,以解决这两种偏差。最后,在三个基准数据集上的实证结果证明了我们的D2AP的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Knowledge-Based Systems
Knowledge-Based Systems 工程技术-计算机:人工智能
CiteScore
14.80
自引率
12.50%
发文量
1245
审稿时长
7.8 months
期刊介绍: Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信