FSDA-DG: Improving cross-domain generalizability of medical image segmentation with few source domain annotations

IF 11.8 1区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Medical image analysis Pub Date : 2025-06-27 DOI:10.1016/j.media.2025.103704

Zanting Ye , Ke Wang , Wenbing Lv , Qianjin Feng , Lijun Lu

{"title":"FSDA-DG: Improving cross-domain generalizability of medical image segmentation with few source domain annotations","authors":"Zanting Ye , Ke Wang , Wenbing Lv , Qianjin Feng , Lijun Lu","doi":"10.1016/j.media.2025.103704","DOIUrl":null,"url":null,"abstract":"<div><div>Deep learning-based medical image segmentation faces significant challenges arising from limited labeled data and domain shifts. While prior approaches have primarily addressed these issues independently, their simultaneous occurrence is common in medical imaging. A method that generalizes to unseen domains using only minimal annotations offers significant practical value due to reduced data annotation and development costs. In pursuit of this goal, we propose FSDA-DG, a novel solution to improve cross-domain generalizability of medical image segmentation with few single-source domain annotations. Specifically, our approach introduces semantics-guided semi-supervised data augmentation. This method divides images into global broad regions and semantics-guided local regions, and applies distinct augmentation strategies to enrich data distribution. Within this framework, both labeled and unlabeled data are transformed into extensive domain knowledge while preserving domain-invariant semantic information. Additionally, FSDA-DG employs a multi-decoder U-Net pipeline semi-supervised learning (SSL) network to improve domain-invariant representation learning through consistent prior assumption across multiple perturbations. By integrating data-level and model-level designs, FSDA-DG achieves superior performance compared to state-of-the-art methods in two challenging single domain generalization (SDG) tasks with limited annotations. The code is publicly available at <span><span>https://github.com/yezanting/FSDA-DG</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":18328,"journal":{"name":"Medical image analysis","volume":"105 ","pages":"Article 103704"},"PeriodicalIF":11.8000,"publicationDate":"2025-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical image analysis","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1361841525002518","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Deep learning-based medical image segmentation faces significant challenges arising from limited labeled data and domain shifts. While prior approaches have primarily addressed these issues independently, their simultaneous occurrence is common in medical imaging. A method that generalizes to unseen domains using only minimal annotations offers significant practical value due to reduced data annotation and development costs. In pursuit of this goal, we propose FSDA-DG, a novel solution to improve cross-domain generalizability of medical image segmentation with few single-source domain annotations. Specifically, our approach introduces semantics-guided semi-supervised data augmentation. This method divides images into global broad regions and semantics-guided local regions, and applies distinct augmentation strategies to enrich data distribution. Within this framework, both labeled and unlabeled data are transformed into extensive domain knowledge while preserving domain-invariant semantic information. Additionally, FSDA-DG employs a multi-decoder U-Net pipeline semi-supervised learning (SSL) network to improve domain-invariant representation learning through consistent prior assumption across multiple perturbations. By integrating data-level and model-level designs, FSDA-DG achieves superior performance compared to state-of-the-art methods in two challenging single domain generalization (SDG) tasks with limited annotations. The code is publicly available at https://github.com/yezanting/FSDA-DG.

查看原文本刊更多论文

FSDA-DG：利用较少的源域注释提高医学图像分割的跨域泛化性

基于深度学习的医学图像分割面临着有限的标记数据和领域转移带来的重大挑战。虽然先前的方法主要是独立解决这些问题，但它们同时发生在医学成像中很常见。由于减少了数据注释和开发成本，一种只使用最少注释就可以泛化到未知领域的方法具有重要的实用价值。为了实现这一目标，我们提出了一种新的解决方案FSDA-DG，它可以在较少的单源域注释的情况下提高医学图像分割的跨域泛化性。具体来说，我们的方法引入了语义引导的半监督数据增强。该方法将图像划分为全局广义区域和语义引导局部区域，并采用不同的增强策略丰富数据分布。在该框架中，标记和未标记的数据都被转换为广泛的领域知识，同时保留了领域不变的语义信息。此外，FSDA-DG采用多解码器U-Net管道半监督学习（SSL）网络，通过跨多个扰动的一致先验假设来改进域不变表示学习。通过集成数据级和模型级设计，FSDA-DG在具有有限注释的两个具有挑战性的单域泛化（SDG）任务中实现了优于最先进方法的性能。该代码可在https://github.com/yezanting/FSDA-DG上公开获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Medical image analysis 工程技术-工程：生物医学

CiteScore

22.10

自引率

6.40%

发文量

309

审稿时长

6.6 months

期刊介绍： Medical Image Analysis serves as a platform for sharing new research findings in the realm of medical and biological image analysis, with a focus on applications of computer vision, virtual reality, and robotics to biomedical imaging challenges. The journal prioritizes the publication of high-quality, original papers contributing to the fundamental science of processing, analyzing, and utilizing medical and biological images. It welcomes approaches utilizing biomedical image datasets across all spatial scales, from molecular/cellular imaging to tissue/organ imaging.