Zhiming Cheng , Mingxia Liu , Defu Yang , Zhidong Zhao , Chenggang Yan , Shuai Wang
{"title":"Domain generalization for image classification with dynamic decision boundary","authors":"Zhiming Cheng , Mingxia Liu , Defu Yang , Zhidong Zhao , Chenggang Yan , Shuai Wang","doi":"10.1016/j.patcog.2025.111678","DOIUrl":null,"url":null,"abstract":"<div><div>Domain Generalization (DG) has been widely used in image classification tasks to effectively handle distribution shifts between source and target domains without accessing target domain data. Traditional DG methods typically rely on static models trained on the source domain for inference on unseen target domains, limiting their ability to fully leverage target domain characteristics. Test-Time Adaptation (TTA)-based DG methods improve generalization performance by adapting the model during inference using target domain samples. However, this often requires parameter fine-tuning on unseen target domains during inference, which may lead to forgetting of source domain knowledge or reduce real-time performance. To address this limitation, we propose a Dynamic Decision Boundary-based DG (DDB-DG) method for image classification, which effectively leverages target domain characteristics during inference without requiring additional training. In the proposed DDB-DG, we first introduce a Prototype-guide Multi-lever Prediction (PMP) module, which guides the dynamic adjustment of the decision boundary learned from the source domain by leveraging the correlation between test samples and prototypes. To enhance the accuracy of prototype computation, we also propose a data augmentation method called Uncertainty Style Mixture (USM), which expands the diversity of training samples to improve model generalization performance and enhance the accuracy of pseudo-labeling for target domain samples in prototypes. We validate DDB-DG using different backbone networks on three publicly available benchmark datasets: PACS, Office-Home, and VLCS. Experimental results demonstrate that our method achieves superior performance on both ResNet-18 and ResNet-50, surpassing the state-of-the-art DG and TTA methods.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"166 ","pages":"Article 111678"},"PeriodicalIF":7.5000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325003383","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Domain Generalization (DG) has been widely used in image classification tasks to effectively handle distribution shifts between source and target domains without accessing target domain data. Traditional DG methods typically rely on static models trained on the source domain for inference on unseen target domains, limiting their ability to fully leverage target domain characteristics. Test-Time Adaptation (TTA)-based DG methods improve generalization performance by adapting the model during inference using target domain samples. However, this often requires parameter fine-tuning on unseen target domains during inference, which may lead to forgetting of source domain knowledge or reduce real-time performance. To address this limitation, we propose a Dynamic Decision Boundary-based DG (DDB-DG) method for image classification, which effectively leverages target domain characteristics during inference without requiring additional training. In the proposed DDB-DG, we first introduce a Prototype-guide Multi-lever Prediction (PMP) module, which guides the dynamic adjustment of the decision boundary learned from the source domain by leveraging the correlation between test samples and prototypes. To enhance the accuracy of prototype computation, we also propose a data augmentation method called Uncertainty Style Mixture (USM), which expands the diversity of training samples to improve model generalization performance and enhance the accuracy of pseudo-labeling for target domain samples in prototypes. We validate DDB-DG using different backbone networks on three publicly available benchmark datasets: PACS, Office-Home, and VLCS. Experimental results demonstrate that our method achieves superior performance on both ResNet-18 and ResNet-50, surpassing the state-of-the-art DG and TTA methods.
期刊介绍:
The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.