Keep and Extent: Unified Knowledge Embedding for Few-Shot Image Generation

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society Pub Date : 2025-04-15 DOI:10.1109/TIP.2025.3557578

Chenghao Xu;Jiexi Yan;Cheng Deng

{"title":"Keep and Extent: Unified Knowledge Embedding for Few-Shot Image Generation","authors":"Chenghao Xu;Jiexi Yan;Cheng Deng","doi":"10.1109/TIP.2025.3557578","DOIUrl":null,"url":null,"abstract":"Training Generative Adversarial Networks (GANs) with few-shot data has been a challenging task, which is prevalently solved by adapting a deep generative model pre-trained on the large-scale data in a source domain to small target domains with limited training data. In practice, most of the existing methods focus on designing task-specific fine-tuning strategies or regularization terms to select and preserve compatible knowledge across the source and target domain. However, the compatible knowledge greatly depends on the target domain and is entangled with the incompatible one. For the few-shot image generation task, without accurate compatible knowledge as prior, the generated images will strongly overfit the scarce target images. From a different perspective, we propose a unified learning paradigm for better knowledge transfer, i.e., keep and extent (KAE). Specifically, we orthogonally decompose the latent space of GANs, where the resting direction that has an unnoticeable impact on the generated images is adopted to extend the new target latent subspace while the remaining directions keep intact to reconstruct the source latent subspace. In this way, the whole source domain knowledge is included in the source latent subspace and the compatible knowledge will be automatically transferred to the target domain along the resting direction, rather than manually selecting. Extensive experimental results on several benchmark datasets demonstrate the superiority of our method.","PeriodicalId":94032,"journal":{"name":"IEEE transactions on image processing : a publication of the IEEE Signal Processing Society","volume":"34 ","pages":"2315-2324"},"PeriodicalIF":0.0000,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on image processing : a publication of the IEEE Signal Processing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10965854/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Training Generative Adversarial Networks (GANs) with few-shot data has been a challenging task, which is prevalently solved by adapting a deep generative model pre-trained on the large-scale data in a source domain to small target domains with limited training data. In practice, most of the existing methods focus on designing task-specific fine-tuning strategies or regularization terms to select and preserve compatible knowledge across the source and target domain. However, the compatible knowledge greatly depends on the target domain and is entangled with the incompatible one. For the few-shot image generation task, without accurate compatible knowledge as prior, the generated images will strongly overfit the scarce target images. From a different perspective, we propose a unified learning paradigm for better knowledge transfer, i.e., keep and extent (KAE). Specifically, we orthogonally decompose the latent space of GANs, where the resting direction that has an unnoticeable impact on the generated images is adopted to extend the new target latent subspace while the remaining directions keep intact to reconstruct the source latent subspace. In this way, the whole source domain knowledge is included in the source latent subspace and the compatible knowledge will be automatically transferred to the target domain along the resting direction, rather than manually selecting. Extensive experimental results on several benchmark datasets demonstrate the superiority of our method.

查看原文本刊更多论文

保持与扩展：面向少拍图像生成的统一知识嵌入

训练具有少量数据的生成对抗网络（GANs）一直是一项具有挑战性的任务，通常通过将在源域的大规模数据上预训练的深度生成模型适应于训练数据有限的小目标域来解决这一问题。在实践中，大多数现有方法侧重于设计特定于任务的微调策略或正则化术语，以选择和保留跨源和目标领域的兼容知识。然而，相容知识对目标领域的依赖很大，并且与不相容知识纠缠在一起。对于少镜头图像生成任务，如果没有像之前那样精确的兼容知识，生成的图像将会对稀缺的目标图像产生强烈的过拟合。从另一个角度，我们提出了一个统一的学习范式，即保持和程度（KAE）。具体来说，我们对gan的隐隐空间进行正交分解，其中采用对生成图像影响不明显的静止方向扩展新的目标隐隐子空间，其余方向保持不变重建源隐隐子空间。这样，整个源领域的知识都被包含在源潜子空间中，兼容的知识将沿着静止方向自动转移到目标领域，而不是人工选择。在多个基准数据集上的大量实验结果证明了该方法的优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society

自引率

0.00%

发文量