Using dynamic knowledge for kernel modulation: Towards image generation via one-shot multi-domain adaptation

IF 7.6 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Pattern Recognition Pub Date : 2025-10-04 DOI:10.1016/j.patcog.2025.112489

Yusen Zhang, Min Li, Xianjie Zhang, Song Yan, Yujie He

{"title":"Using dynamic knowledge for kernel modulation: Towards image generation via one-shot multi-domain adaptation","authors":"Yusen Zhang, Min Li, Xianjie Zhang, Song Yan, Yujie He","doi":"10.1016/j.patcog.2025.112489","DOIUrl":null,"url":null,"abstract":"<div><div>One-shot domain adaptation across multiple image domains aims to learn complex image distributions using just one training image from each target domain. Existing methods often select, preserve and transfer prior knowledge from the source domain pre-trained model to learn the target model with distinct styles. However, they always neglect the cross-domain shared knowledge and fail to consider the adaptation relationships in selecting source knowledge when facing varying styles in multi-target domain scenarios, casting doubt on their suitability. In this paper, we propose a novel one-shot multi-target image domain adaptation model based on kernel modulation. By leveraging the similarity information inherent in cross-domain knowledge to guide the correlation of feature learning, our method allows for precise control over the transfer of task-relevant knowledge while minimizing irrelevant information. Furthermore, we propose a novel cross-domain contrastive loss that incorporates dual constraints of structure and style. By effectively mining strong negative samples from cross-domain knowledge, it aims to maximize the structural feature from the source domain and accurately reflect the unique attributes of various style target domains. Extensive experiments on several datasets show that our method offers significant advantages in concurrently establishing multiple image domain mapping relationships. Moreover, it can effectively explore the potential for knowledge transfer in cross-domain feature learning, thereby generating higher-quality domain-adapted images.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"172 ","pages":"Article 112489"},"PeriodicalIF":7.6000,"publicationDate":"2025-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325011525","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

One-shot domain adaptation across multiple image domains aims to learn complex image distributions using just one training image from each target domain. Existing methods often select, preserve and transfer prior knowledge from the source domain pre-trained model to learn the target model with distinct styles. However, they always neglect the cross-domain shared knowledge and fail to consider the adaptation relationships in selecting source knowledge when facing varying styles in multi-target domain scenarios, casting doubt on their suitability. In this paper, we propose a novel one-shot multi-target image domain adaptation model based on kernel modulation. By leveraging the similarity information inherent in cross-domain knowledge to guide the correlation of feature learning, our method allows for precise control over the transfer of task-relevant knowledge while minimizing irrelevant information. Furthermore, we propose a novel cross-domain contrastive loss that incorporates dual constraints of structure and style. By effectively mining strong negative samples from cross-domain knowledge, it aims to maximize the structural feature from the source domain and accurately reflect the unique attributes of various style target domains. Extensive experiments on several datasets show that our method offers significant advantages in concurrently establishing multiple image domain mapping relationships. Moreover, it can effectively explore the potential for knowledge transfer in cross-domain feature learning, thereby generating higher-quality domain-adapted images.

查看原文本刊更多论文

利用动态知识进行核调制：实现一次多域自适应图像生成

跨多个图像域的一次性域自适应旨在仅使用来自每个目标域的一个训练图像来学习复杂的图像分布。现有的方法通常是从源域预训练模型中选择、保留和转移先验知识来学习具有不同风格的目标模型。然而，在面对多目标领域场景中不同风格的源知识选择时，往往忽略了跨领域共享知识，没有考虑源知识的自适应关系，使其适用性受到质疑。本文提出了一种基于核调制的单镜头多目标图像域自适应模型。通过利用跨领域知识中固有的相似性信息来指导特征学习的相关性，我们的方法可以在最小化不相关信息的同时精确控制任务相关知识的转移。此外，我们提出了一种新的跨域对比损失，它包含结构和风格的双重约束。通过有效地从跨领域知识中挖掘强负样本，最大限度地提取源领域的结构特征，准确地反映各种风格目标领域的独特属性。在多个数据集上的大量实验表明，该方法在同时建立多个图像域映射关系方面具有显著的优势。此外，它可以有效地挖掘跨领域特征学习中知识转移的潜力，从而生成更高质量的领域适应图像。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Pattern Recognition 工程技术-工程：电子与电气

CiteScore

14.40

自引率

16.20%

发文量

683

审稿时长

5.6 months

期刊介绍： The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.