{"title":"Bias-in-debias-out: Hierarchical channel-spatial bias calibration for cross-domain few-shot classification","authors":"Minghui Li , Hongxun Yao","doi":"10.1016/j.knosys.2025.114475","DOIUrl":null,"url":null,"abstract":"<div><div>The core challenge of cross-domain few-shot learning (CD-FSL) stems from models’ inability to generalize source-domain inductive biases to target domains under significant distribution shifts. While existing methods predominantly employ strategies like auxiliary target data adaptation, feature disentanglement, or metric space alignment, they overlook two inherent biases entrenched during source-domain training: (1) channel-wise dependency on source-specific feature patterns and (2) spatial-wise preference for source-typical structures, both of which hinder cross-domain transfer. We propose the first unified <u><em>C</em></u>hannel-<u><em>S</em></u>patial <u><em>D</em></u>ual-dimensional <u><em>B</em></u>ias <u><em>C</em></u>alibration (CSDBC) framework to systematically address these biases through progressive dilution, recomposition, and alignment. Our approach integrates three key innovations: (1) a parameter-free <u><em>S</em></u>tatic <u><em>B</em></u>ase-class <u><em>B</em></u>ias <u><em>D</em></u>ilution (SBBD) module that dilutes source-specific channel-spatial biases through layer-wise and point-wise modulation, effectively suppressing overfitting to source-specific patterns; (2) a <u><em>D</em></u>ynamic <u><em>N</em></u>ovel-class <u><em>B</em></u>ias <u><em>R</em></u>ecomposition (DNBR) module that generates target-adaptive channel-spatial soft masks via meta-optimized lightweight depthwise separable convolutions, enabling target-domain channel reweighting and spatial preference adjustment; and (3) a <u><em>N</em></u>ovel-class <u><em>C</em></u>ross-image <u><em>S</em></u>emantic <u><em>A</em></u>lignment (NCSA) module that establishes channel correlations and spatial correspondences between support-query pairs, significantly enhancing both discriminability and semantic consistency of target-domain features. Extensive experiments across eight CD-FSL benchmarks demonstrate consistent improvements, outperforming SOTA methods by 1.35 % (5-way 1-shot) and 2.00 % (5-way 5-shot) in average accuracy under varying domain shifts.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"330 ","pages":"Article 114475"},"PeriodicalIF":7.6000,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S095070512501514X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
The core challenge of cross-domain few-shot learning (CD-FSL) stems from models’ inability to generalize source-domain inductive biases to target domains under significant distribution shifts. While existing methods predominantly employ strategies like auxiliary target data adaptation, feature disentanglement, or metric space alignment, they overlook two inherent biases entrenched during source-domain training: (1) channel-wise dependency on source-specific feature patterns and (2) spatial-wise preference for source-typical structures, both of which hinder cross-domain transfer. We propose the first unified Channel-Spatial Dual-dimensional Bias Calibration (CSDBC) framework to systematically address these biases through progressive dilution, recomposition, and alignment. Our approach integrates three key innovations: (1) a parameter-free Static Base-class Bias Dilution (SBBD) module that dilutes source-specific channel-spatial biases through layer-wise and point-wise modulation, effectively suppressing overfitting to source-specific patterns; (2) a Dynamic Novel-class Bias Recomposition (DNBR) module that generates target-adaptive channel-spatial soft masks via meta-optimized lightweight depthwise separable convolutions, enabling target-domain channel reweighting and spatial preference adjustment; and (3) a Novel-class Cross-image Semantic Alignment (NCSA) module that establishes channel correlations and spatial correspondences between support-query pairs, significantly enhancing both discriminability and semantic consistency of target-domain features. Extensive experiments across eight CD-FSL benchmarks demonstrate consistent improvements, outperforming SOTA methods by 1.35 % (5-way 1-shot) and 2.00 % (5-way 5-shot) in average accuracy under varying domain shifts.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.