Xiang Li , Faming Fang , Liyan Ma , Tieyong Zeng , Guixu Zhang , Ming Xu
{"title":"Towards Generic Abdominal Multi-Organ Segmentation with multiple partially labeled datasets","authors":"Xiang Li , Faming Fang , Liyan Ma , Tieyong Zeng , Guixu Zhang , Ming Xu","doi":"10.1016/j.compmedimag.2025.102642","DOIUrl":null,"url":null,"abstract":"<div><div>An increasing number of publicly available datasets have facilitated the exploration of building universal medical segmentation models. Existing approaches address partially labeled problem of each dataset by harmonizing labels across datasets and independently focusing on the labeled foreground regions. However, significant challenges persist, particularly in the form of cross-site domain shifts and the limited utilization of partially labeled datasets. In this paper, we propose a GAMOS (<strong>G</strong>eneric <strong>A</strong>bdominal <strong>M</strong>ulti-<strong>O</strong>rgan <strong>S</strong>egmentation) framework. Specifically, GAMOS integrates a self-guidance strategy to adopt diffusion models for partial labeling issue, while employing a self-distillation mechanism to effectively leverage unlabeled data. A sparse semantic memory is introduced to mitigate domain shifts by ensuring consistent representations in the latent space. To further enhance performance, we design a sparse similarity loss to align multi-view memory representations and enhance the discriminability and compactness of the memory vectors. Extensive experiments on real-world medical datasets demonstrate the superiority and generalization ability of GAMOS. It achieves a mean Dice Similarity Coefficient (DSC) of 91.33% and a mean 95th percentile Hausdorff Distance (HD95) of 1.83 on labeled foreground regions. For unlabeled foreground regions, GAMOS obtains a mean DSC of 86.88% and a mean HD95 of 3.85, outperforming existing state-of-the-art methods.</div></div>","PeriodicalId":50631,"journal":{"name":"Computerized Medical Imaging and Graphics","volume":"125 ","pages":"Article 102642"},"PeriodicalIF":4.9000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computerized Medical Imaging and Graphics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S089561112500151X","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
An increasing number of publicly available datasets have facilitated the exploration of building universal medical segmentation models. Existing approaches address partially labeled problem of each dataset by harmonizing labels across datasets and independently focusing on the labeled foreground regions. However, significant challenges persist, particularly in the form of cross-site domain shifts and the limited utilization of partially labeled datasets. In this paper, we propose a GAMOS (Generic Abdominal Multi-Organ Segmentation) framework. Specifically, GAMOS integrates a self-guidance strategy to adopt diffusion models for partial labeling issue, while employing a self-distillation mechanism to effectively leverage unlabeled data. A sparse semantic memory is introduced to mitigate domain shifts by ensuring consistent representations in the latent space. To further enhance performance, we design a sparse similarity loss to align multi-view memory representations and enhance the discriminability and compactness of the memory vectors. Extensive experiments on real-world medical datasets demonstrate the superiority and generalization ability of GAMOS. It achieves a mean Dice Similarity Coefficient (DSC) of 91.33% and a mean 95th percentile Hausdorff Distance (HD95) of 1.83 on labeled foreground regions. For unlabeled foreground regions, GAMOS obtains a mean DSC of 86.88% and a mean HD95 of 3.85, outperforming existing state-of-the-art methods.
期刊介绍:
The purpose of the journal Computerized Medical Imaging and Graphics is to act as a source for the exchange of research results concerning algorithmic advances, development, and application of digital imaging in disease detection, diagnosis, intervention, prevention, precision medicine, and population health. Included in the journal will be articles on novel computerized imaging or visualization techniques, including artificial intelligence and machine learning, augmented reality for surgical planning and guidance, big biomedical data visualization, computer-aided diagnosis, computerized-robotic surgery, image-guided therapy, imaging scanning and reconstruction, mobile and tele-imaging, radiomics, and imaging integration and modeling with other information relevant to digital health. The types of biomedical imaging include: magnetic resonance, computed tomography, ultrasound, nuclear medicine, X-ray, microwave, optical and multi-photon microscopy, video and sensory imaging, and the convergence of biomedical images with other non-imaging datasets.