一个简单而轻量级的模块，通过相对表示增强领域泛化

IF 7.6 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Pattern Recognition Pub Date : 2025-09-15 DOI:10.1016/j.patcog.2025.112423

Meng Cao , Songcan Chen

{"title":"一个简单而轻量级的模块，通过相对表示增强领域泛化","authors":"Meng Cao , Songcan Chen","doi":"10.1016/j.patcog.2025.112423","DOIUrl":null,"url":null,"abstract":"<div><div>Domain Generalization (DG) learns a model from multiple source domains to combat individual domain differences and ensure generalization to unseen domains. Most existing methods focus on learning domain-invariant <em>absolute</em> representations. However, we empirically observe that such representations often suffer from notable distribution divergence, leading to unstable performance in diverse unseen domains. In contrast, <em>relative</em> representations, constructed w.r.t. a set of anchors, naturally capture geometric relationships and exhibit intrinsic stability within a dataset. Despite this potential, their application to DG remains largely unexplored, due to their common transductive assumption that anchors require access to target-domain data, which is incompatible with the inductive setting of DG. To address this issue, we design Re2SL, a simple and lightweight plug-in module that follows a pre-trained encoder and constructs anchors solely from source-domain prototypes, thereby ensuring a completely inductive design. To our knowledge, Re2SL is the first to explore relative representation for DG. This design is inspired by the insight that <strong>ReS</strong>idual differences between absolute and domain-specific representations can spontaneously seek stable representations within the same distribution shared across <em>all domains</em>. Leveraging these stable representations, we construct cross-domain <strong>ReL</strong>ative representation to enhance stability and transferability without accessing any target data during training or anchor computation. Empirical studies show that our constructed representation exhibits minimal <span><math><mi>H</mi></math></span>-divergence, confirming its stability. Notably, Re2SL achieves up to 4.3 % improvement while reducing computational cost by 90 %, demonstrating its efficiency.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"172 ","pages":"Article 112423"},"PeriodicalIF":7.6000,"publicationDate":"2025-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A simple yet lightweight module for enhancing domain generalization through relative representation\",\"authors\":\"Meng Cao , Songcan Chen\",\"doi\":\"10.1016/j.patcog.2025.112423\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Domain Generalization (DG) learns a model from multiple source domains to combat individual domain differences and ensure generalization to unseen domains. Most existing methods focus on learning domain-invariant <em>absolute</em> representations. However, we empirically observe that such representations often suffer from notable distribution divergence, leading to unstable performance in diverse unseen domains. In contrast, <em>relative</em> representations, constructed w.r.t. a set of anchors, naturally capture geometric relationships and exhibit intrinsic stability within a dataset. Despite this potential, their application to DG remains largely unexplored, due to their common transductive assumption that anchors require access to target-domain data, which is incompatible with the inductive setting of DG. To address this issue, we design Re2SL, a simple and lightweight plug-in module that follows a pre-trained encoder and constructs anchors solely from source-domain prototypes, thereby ensuring a completely inductive design. To our knowledge, Re2SL is the first to explore relative representation for DG. This design is inspired by the insight that <strong>ReS</strong>idual differences between absolute and domain-specific representations can spontaneously seek stable representations within the same distribution shared across <em>all domains</em>. Leveraging these stable representations, we construct cross-domain <strong>ReL</strong>ative representation to enhance stability and transferability without accessing any target data during training or anchor computation. Empirical studies show that our constructed representation exhibits minimal <span><math><mi>H</mi></math></span>-divergence, confirming its stability. Notably, Re2SL achieves up to 4.3 % improvement while reducing computational cost by 90 %, demonstrating its efficiency.</div></div>\",\"PeriodicalId\":49713,\"journal\":{\"name\":\"Pattern Recognition\",\"volume\":\"172 \",\"pages\":\"Article 112423\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2025-09-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pattern Recognition\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0031320325010842\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325010842","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

域泛化（DG）从多个源域学习模型，以克服单个域的差异，并确保对未知域的泛化。大多数现有的方法都集中在学习领域不变的绝对表示。然而，我们的经验观察到，这样的表示往往遭受显著的分布分歧，导致不稳定的性能在不同的看不见的领域。相比之下，相对表示（相对于一组锚）自然地捕捉几何关系，并在数据集中表现出内在的稳定性。尽管具有这种潜力，但它们在DG中的应用在很大程度上仍未被探索，因为它们共同的转导假设是锚点需要访问目标域数据，这与DG的感应设置不兼容。为了解决这个问题，我们设计了Re2SL，这是一个简单而轻量级的插件模块，它遵循预训练的编码器，并仅从源域原型构建锚，从而确保了完全的归纳设计。据我们所知，Re2SL是第一个探索DG相对表示的。这种设计的灵感来自于这样一种见解，即绝对表示和特定领域表示之间的残差可以在所有领域共享的相同分布中自发地寻求稳定的表示。利用这些稳定的表示，我们构建了跨域的相对表示，以提高稳定性和可移植性，而无需在训练或锚定计算期间访问任何目标数据。实证研究表明，我们构建的表征具有极小的h散度，证实了其稳定性。值得注意的是，Re2SL实现了4.3%的改进，同时将计算成本降低了90%，证明了它的效率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A simple yet lightweight module for enhancing domain generalization through relative representation

Domain Generalization (DG) learns a model from multiple source domains to combat individual domain differences and ensure generalization to unseen domains. Most existing methods focus on learning domain-invariant absolute representations. However, we empirically observe that such representations often suffer from notable distribution divergence, leading to unstable performance in diverse unseen domains. In contrast, relative representations, constructed w.r.t. a set of anchors, naturally capture geometric relationships and exhibit intrinsic stability within a dataset. Despite this potential, their application to DG remains largely unexplored, due to their common transductive assumption that anchors require access to target-domain data, which is incompatible with the inductive setting of DG. To address this issue, we design Re2SL, a simple and lightweight plug-in module that follows a pre-trained encoder and constructs anchors solely from source-domain prototypes, thereby ensuring a completely inductive design. To our knowledge, Re2SL is the first to explore relative representation for DG. This design is inspired by the insight that ReSidual differences between absolute and domain-specific representations can spontaneously seek stable representations within the same distribution shared across all domains. Leveraging these stable representations, we construct cross-domain ReLative representation to enhance stability and transferability without accessing any target data during training or anchor computation. Empirical studies show that our constructed representation exhibits minimal

H

-divergence, confirming its stability. Notably, Re2SL achieves up to 4.3 % improvement while reducing computational cost by 90 %, demonstrating its efficiency.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Pattern Recognition 工程技术-工程：电子与电气

CiteScore

14.40

自引率

16.20%

发文量

683

审稿时长

5.6 months

期刊介绍： The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.