Towards Generic Abdominal Multi-Organ Segmentation with multiple partially labeled datasets

IF 4.9 2区医学 Q1 ENGINEERING, BIOMEDICAL

Computerized Medical Imaging and Graphics Pub Date : 2025-09-01 DOI:10.1016/j.compmedimag.2025.102642

Xiang Li , Faming Fang , Liyan Ma , Tieyong Zeng , Guixu Zhang , Ming Xu

{"title":"Towards Generic Abdominal Multi-Organ Segmentation with multiple partially labeled datasets","authors":"Xiang Li , Faming Fang , Liyan Ma , Tieyong Zeng , Guixu Zhang , Ming Xu","doi":"10.1016/j.compmedimag.2025.102642","DOIUrl":null,"url":null,"abstract":"<div><div>An increasing number of publicly available datasets have facilitated the exploration of building universal medical segmentation models. Existing approaches address partially labeled problem of each dataset by harmonizing labels across datasets and independently focusing on the labeled foreground regions. However, significant challenges persist, particularly in the form of cross-site domain shifts and the limited utilization of partially labeled datasets. In this paper, we propose a GAMOS (<strong>G</strong>eneric <strong>A</strong>bdominal <strong>M</strong>ulti-<strong>O</strong>rgan <strong>S</strong>egmentation) framework. Specifically, GAMOS integrates a self-guidance strategy to adopt diffusion models for partial labeling issue, while employing a self-distillation mechanism to effectively leverage unlabeled data. A sparse semantic memory is introduced to mitigate domain shifts by ensuring consistent representations in the latent space. To further enhance performance, we design a sparse similarity loss to align multi-view memory representations and enhance the discriminability and compactness of the memory vectors. Extensive experiments on real-world medical datasets demonstrate the superiority and generalization ability of GAMOS. It achieves a mean Dice Similarity Coefficient (DSC) of 91.33% and a mean 95th percentile Hausdorff Distance (HD95) of 1.83 on labeled foreground regions. For unlabeled foreground regions, GAMOS obtains a mean DSC of 86.88% and a mean HD95 of 3.85, outperforming existing state-of-the-art methods.</div></div>","PeriodicalId":50631,"journal":{"name":"Computerized Medical Imaging and Graphics","volume":"125 ","pages":"Article 102642"},"PeriodicalIF":4.9000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computerized Medical Imaging and Graphics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S089561112500151X","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

Abstract

An increasing number of publicly available datasets have facilitated the exploration of building universal medical segmentation models. Existing approaches address partially labeled problem of each dataset by harmonizing labels across datasets and independently focusing on the labeled foreground regions. However, significant challenges persist, particularly in the form of cross-site domain shifts and the limited utilization of partially labeled datasets. In this paper, we propose a GAMOS (Generic Abdominal Multi-Organ Segmentation) framework. Specifically, GAMOS integrates a self-guidance strategy to adopt diffusion models for partial labeling issue, while employing a self-distillation mechanism to effectively leverage unlabeled data. A sparse semantic memory is introduced to mitigate domain shifts by ensuring consistent representations in the latent space. To further enhance performance, we design a sparse similarity loss to align multi-view memory representations and enhance the discriminability and compactness of the memory vectors. Extensive experiments on real-world medical datasets demonstrate the superiority and generalization ability of GAMOS. It achieves a mean Dice Similarity Coefficient (DSC) of 91.33% and a mean 95th percentile Hausdorff Distance (HD95) of 1.83 on labeled foreground regions. For unlabeled foreground regions, GAMOS obtains a mean DSC of 86.88% and a mean HD95 of 3.85, outperforming existing state-of-the-art methods.

查看原文本刊更多论文

基于多个部分标记数据集的通用腹部多器官分割

越来越多的公开数据集促进了构建通用医学分割模型的探索。现有方法通过协调数据集之间的标签和独立关注标记的前景区域来解决每个数据集的部分标记问题。然而，重大的挑战仍然存在，特别是在跨站点域转移和部分标记数据集的有限利用方面。在本文中，我们提出了一个GAMOS（通用腹部多器官分割）框架。具体而言，GAMOS集成了自引导策略，采用扩散模型解决部分标记问题，同时采用自蒸馏机制有效利用未标记数据。引入了稀疏语义记忆，通过确保潜在空间中的一致表示来缓解域偏移。为了进一步提高性能，我们设计了一个稀疏相似损失来对齐多视图内存表示，并增强了内存向量的可辨别性和紧凑性。在实际医疗数据集上的大量实验证明了GAMOS的优越性和泛化能力。在标记的前景区域上，平均骰子相似系数（DSC）为91.33%，平均第95百分位豪斯多夫距离（HD95）为1.83。对于未标记的前景区域，GAMOS的平均DSC为86.88%，平均HD95为3.85，优于现有的最先进的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computerized Medical Imaging and Graphics 医学-核医学

CiteScore

10.70

自引率

3.50%

发文量

审稿时长

26 days

期刊介绍： The purpose of the journal Computerized Medical Imaging and Graphics is to act as a source for the exchange of research results concerning algorithmic advances, development, and application of digital imaging in disease detection, diagnosis, intervention, prevention, precision medicine, and population health. Included in the journal will be articles on novel computerized imaging or visualization techniques, including artificial intelligence and machine learning, augmented reality for surgical planning and guidance, big biomedical data visualization, computer-aided diagnosis, computerized-robotic surgery, image-guided therapy, imaging scanning and reconstruction, mobile and tele-imaging, radiomics, and imaging integration and modeling with other information relevant to digital health. The types of biomedical imaging include: magnetic resonance, computed tomography, ultrasound, nuclear medicine, X-ray, microwave, optical and multi-photon microscopy, video and sensory imaging, and the convergence of biomedical images with other non-imaging datasets.