Domain generalization for image classification with dynamic decision boundary

IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Zhiming Cheng , Mingxia Liu , Defu Yang , Zhidong Zhao , Chenggang Yan , Shuai Wang
{"title":"Domain generalization for image classification with dynamic decision boundary","authors":"Zhiming Cheng ,&nbsp;Mingxia Liu ,&nbsp;Defu Yang ,&nbsp;Zhidong Zhao ,&nbsp;Chenggang Yan ,&nbsp;Shuai Wang","doi":"10.1016/j.patcog.2025.111678","DOIUrl":null,"url":null,"abstract":"<div><div>Domain Generalization (DG) has been widely used in image classification tasks to effectively handle distribution shifts between source and target domains without accessing target domain data. Traditional DG methods typically rely on static models trained on the source domain for inference on unseen target domains, limiting their ability to fully leverage target domain characteristics. Test-Time Adaptation (TTA)-based DG methods improve generalization performance by adapting the model during inference using target domain samples. However, this often requires parameter fine-tuning on unseen target domains during inference, which may lead to forgetting of source domain knowledge or reduce real-time performance. To address this limitation, we propose a Dynamic Decision Boundary-based DG (DDB-DG) method for image classification, which effectively leverages target domain characteristics during inference without requiring additional training. In the proposed DDB-DG, we first introduce a Prototype-guide Multi-lever Prediction (PMP) module, which guides the dynamic adjustment of the decision boundary learned from the source domain by leveraging the correlation between test samples and prototypes. To enhance the accuracy of prototype computation, we also propose a data augmentation method called Uncertainty Style Mixture (USM), which expands the diversity of training samples to improve model generalization performance and enhance the accuracy of pseudo-labeling for target domain samples in prototypes. We validate DDB-DG using different backbone networks on three publicly available benchmark datasets: PACS, Office-Home, and VLCS. Experimental results demonstrate that our method achieves superior performance on both ResNet-18 and ResNet-50, surpassing the state-of-the-art DG and TTA methods.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"166 ","pages":"Article 111678"},"PeriodicalIF":7.5000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325003383","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Domain Generalization (DG) has been widely used in image classification tasks to effectively handle distribution shifts between source and target domains without accessing target domain data. Traditional DG methods typically rely on static models trained on the source domain for inference on unseen target domains, limiting their ability to fully leverage target domain characteristics. Test-Time Adaptation (TTA)-based DG methods improve generalization performance by adapting the model during inference using target domain samples. However, this often requires parameter fine-tuning on unseen target domains during inference, which may lead to forgetting of source domain knowledge or reduce real-time performance. To address this limitation, we propose a Dynamic Decision Boundary-based DG (DDB-DG) method for image classification, which effectively leverages target domain characteristics during inference without requiring additional training. In the proposed DDB-DG, we first introduce a Prototype-guide Multi-lever Prediction (PMP) module, which guides the dynamic adjustment of the decision boundary learned from the source domain by leveraging the correlation between test samples and prototypes. To enhance the accuracy of prototype computation, we also propose a data augmentation method called Uncertainty Style Mixture (USM), which expands the diversity of training samples to improve model generalization performance and enhance the accuracy of pseudo-labeling for target domain samples in prototypes. We validate DDB-DG using different backbone networks on three publicly available benchmark datasets: PACS, Office-Home, and VLCS. Experimental results demonstrate that our method achieves superior performance on both ResNet-18 and ResNet-50, surpassing the state-of-the-art DG and TTA methods.
基于动态决策边界的图像分类领域泛化
在不访问目标域数据的情况下,域概化(DG)可以有效地处理源域和目标域之间的分布变化,已广泛应用于图像分类任务中。传统的DG方法通常依赖于在源域上训练的静态模型来推断未知的目标域,限制了它们充分利用目标域特征的能力。基于测试时间自适应(TTA)的DG方法通过在目标域样本的推理过程中自适应模型来提高泛化性能。然而,这通常需要在推理过程中对未知的目标域进行参数微调,这可能导致源域知识的遗忘或降低实时性能。为了解决这一限制,我们提出了一种基于动态决策边界的DG (DDB-DG)图像分类方法,该方法在推理过程中有效地利用了目标域的特征,而不需要额外的训练。在提出的DDB-DG中,我们首先引入了原型引导多级预测(PMP)模块,该模块利用测试样本和原型之间的相关性,指导从源域学习到的决策边界的动态调整。为了提高原型计算的准确性,我们还提出了一种称为不确定性风格混合(USM)的数据增强方法,该方法扩大了训练样本的多样性,以提高模型泛化性能,并提高了原型中目标域样本的伪标记准确性。我们在三个公开可用的基准数据集(PACS、Office-Home和VLCS)上使用不同的骨干网验证了dbb - dg。实验结果表明,我们的方法在ResNet-18和ResNet-50上都取得了优异的性能,超过了最先进的DG和TTA方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Pattern Recognition
Pattern Recognition 工程技术-工程:电子与电气
CiteScore
14.40
自引率
16.20%
发文量
683
审稿时长
5.6 months
期刊介绍: The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信