Wentao Liu , Zhiwei Ni , Xuhui Zhu , Qian Chen , Liping Ni , Pingfan Xia
{"title":"Spectrum intervention based invariant causal representation learning for single-domain generalizable medical image segmentation","authors":"Wentao Liu , Zhiwei Ni , Xuhui Zhu , Qian Chen , Liping Ni , Pingfan Xia","doi":"10.1016/j.media.2025.103741","DOIUrl":null,"url":null,"abstract":"<div><div>The performance of a well-trained segmentation model is often trapped by domain shift caused by acquisition variance. Existing efforts are devoted to expanding the diversity of single-source samples, as well as learning domain-invariant representations. Essentially, they are still modeling the statistical dependence between sample-label pairs to achieve a superficial portrayal of reality. On the contrary, we propose a Spectrum Intervention based Invariant Causal Representation Learning (SI<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span>CRL) framework, to unify the data generation and representation learning from causal view. Specifically, for the data generation, the unknown object elements can be reified in frequency domain as phase variables, then we propose an amplitude-based intervention module to generate low-frequency perturbations via random-weighted multilayer convolutional network. For the causal representations, a two-stage causal synergy modeling process is proposed to derive unobservable causal factors. In the first stage, the style-sensitive non-causal factors lying in the shallow layer of encoder are filtered out by contrastive-based causal decoupling mechanism. In the second stage, the hierarchical features in decoder are first factorized with cross-covariance regularization to ensure channel-wise independence; Subsequently, we introduce an adversarial-based causal purification module, which encourages the decoder to iteratively update causally sufficient information and make domain-robust predictions. We evaluate our SI<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span>CRL against the state-of-the-art methods on cross-site prostate MRI segmentation, cross-modality (CT-MRI) abdominal multi-organ segmentation, and cross-sequence (MRI) cardiac segmentation. Our approach achieves consistent performance gains compared to these peer methods.</div></div>","PeriodicalId":18328,"journal":{"name":"Medical image analysis","volume":"105 ","pages":"Article 103741"},"PeriodicalIF":11.8000,"publicationDate":"2025-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical image analysis","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1361841525002889","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
The performance of a well-trained segmentation model is often trapped by domain shift caused by acquisition variance. Existing efforts are devoted to expanding the diversity of single-source samples, as well as learning domain-invariant representations. Essentially, they are still modeling the statistical dependence between sample-label pairs to achieve a superficial portrayal of reality. On the contrary, we propose a Spectrum Intervention based Invariant Causal Representation Learning (SICRL) framework, to unify the data generation and representation learning from causal view. Specifically, for the data generation, the unknown object elements can be reified in frequency domain as phase variables, then we propose an amplitude-based intervention module to generate low-frequency perturbations via random-weighted multilayer convolutional network. For the causal representations, a two-stage causal synergy modeling process is proposed to derive unobservable causal factors. In the first stage, the style-sensitive non-causal factors lying in the shallow layer of encoder are filtered out by contrastive-based causal decoupling mechanism. In the second stage, the hierarchical features in decoder are first factorized with cross-covariance regularization to ensure channel-wise independence; Subsequently, we introduce an adversarial-based causal purification module, which encourages the decoder to iteratively update causally sufficient information and make domain-robust predictions. We evaluate our SICRL against the state-of-the-art methods on cross-site prostate MRI segmentation, cross-modality (CT-MRI) abdominal multi-organ segmentation, and cross-sequence (MRI) cardiac segmentation. Our approach achieves consistent performance gains compared to these peer methods.
期刊介绍:
Medical Image Analysis serves as a platform for sharing new research findings in the realm of medical and biological image analysis, with a focus on applications of computer vision, virtual reality, and robotics to biomedical imaging challenges. The journal prioritizes the publication of high-quality, original papers contributing to the fundamental science of processing, analyzing, and utilizing medical and biological images. It welcomes approaches utilizing biomedical image datasets across all spatial scales, from molecular/cellular imaging to tissue/organ imaging.