Diff-UNet: A diffusion embedded network for robust 3D medical image segmentation

IF 11.8 1区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Medical image analysis Pub Date : 2025-06-16 DOI:10.1016/j.media.2025.103654

Zhaohu Xing , Liang Wan , Huazhu Fu , Guang Yang , Yijun Yang , Lequan Yu , Baiying Lei , Lei Zhu

{"title":"Diff-UNet: A diffusion embedded network for robust 3D medical image segmentation","authors":"Zhaohu Xing , Liang Wan , Huazhu Fu , Guang Yang , Yijun Yang , Lequan Yu , Baiying Lei , Lei Zhu","doi":"10.1016/j.media.2025.103654","DOIUrl":null,"url":null,"abstract":"<div><div>Benefiting from the powerful generative capabilities of diffusion models, recent studies have utilized these models to address 2D medical image segmentation problems. However, directly extending these methods to 3D medical image segmentation slice-by-slice does not yield satisfactory results. The reason is that these approaches often ignore the inter-slice relations of 3D medical data and require significant computational costs. To overcome these challenges, we devise the first diffusion-based model (i.e., Diff-UNet) with two branches for general 3D medical image segmentation. Specifically, we devise an additional boundary-prediction branch to predict the auxiliary boundary information of the target segmentation region, which assists the diffusion-denoising branch in predicting 3D segmentation results. Furthermore, we design a Multi-granularity Boundary Aggregation (MBA) module to embed both low-level and high-level boundary features into the diffusion denoising process. Then, we propose a Monte Carlo Diffusion (MC-Diff) module to generate an uncertainty map and define an uncertainty-guided segmentation loss to improve the segmentation results of uncertain pixels. Moreover, during our diffusion inference stage, we develop a Progressive Uncertainty-driven REfinement (PURE) strategy to fuse intermediate segmentation results at each diffusion inference step. Experimental results on the three latest large-scale datasets (i.e., BraTS2023, SegRap2023, and AIIB2023) with diverse organs and modalities show that our Diff-UNet quantitatively and qualitatively outperforms state-of-the-art 3D medical segmentation methods, especially on regions with small or complex structures. Our code is available at the following link: <span><span>https://github.com/ge-xing/DiffUNet</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":18328,"journal":{"name":"Medical image analysis","volume":"105 ","pages":"Article 103654"},"PeriodicalIF":11.8000,"publicationDate":"2025-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical image analysis","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1361841525002014","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Benefiting from the powerful generative capabilities of diffusion models, recent studies have utilized these models to address 2D medical image segmentation problems. However, directly extending these methods to 3D medical image segmentation slice-by-slice does not yield satisfactory results. The reason is that these approaches often ignore the inter-slice relations of 3D medical data and require significant computational costs. To overcome these challenges, we devise the first diffusion-based model (i.e., Diff-UNet) with two branches for general 3D medical image segmentation. Specifically, we devise an additional boundary-prediction branch to predict the auxiliary boundary information of the target segmentation region, which assists the diffusion-denoising branch in predicting 3D segmentation results. Furthermore, we design a Multi-granularity Boundary Aggregation (MBA) module to embed both low-level and high-level boundary features into the diffusion denoising process. Then, we propose a Monte Carlo Diffusion (MC-Diff) module to generate an uncertainty map and define an uncertainty-guided segmentation loss to improve the segmentation results of uncertain pixels. Moreover, during our diffusion inference stage, we develop a Progressive Uncertainty-driven REfinement (PURE) strategy to fuse intermediate segmentation results at each diffusion inference step. Experimental results on the three latest large-scale datasets (i.e., BraTS2023, SegRap2023, and AIIB2023) with diverse organs and modalities show that our Diff-UNet quantitatively and qualitatively outperforms state-of-the-art 3D medical segmentation methods, especially on regions with small or complex structures. Our code is available at the following link: https://github.com/ge-xing/DiffUNet.

查看原文本刊更多论文

Diff-UNet：一种用于鲁棒三维医学图像分割的扩散嵌入式网络

得益于扩散模型强大的生成能力，最近的研究利用这些模型来解决二维医学图像分割问题。然而，将这些方法直接扩展到三维医学图像的切片分割中，并不能得到令人满意的结果。原因是这些方法往往忽略了三维医疗数据的层间关系，并且需要大量的计算成本。为了克服这些挑战，我们设计了第一个基于扩散的模型（即Diff-UNet），该模型具有两个分支，用于一般3D医学图像分割。具体来说，我们设计了一个额外的边界预测分支来预测目标分割区域的辅助边界信息，这有助于扩散去噪分支预测三维分割结果。此外，我们设计了一个多粒度边界聚集（MBA）模块，将低级和高级边界特征嵌入到扩散去噪过程中。然后，我们提出了蒙特卡罗扩散（MC-Diff）模块来生成不确定性映射，并定义了不确定性引导的分割损失，以改善不确定像素的分割结果。此外，在我们的扩散推理阶段，我们开发了一种渐进不确定性驱动的细化（PURE）策略来融合每个扩散推理步骤的中间分割结果。在三个最新的具有不同器官和模式的大型数据集（即BraTS2023， SegRap2023和AIIB2023）上的实验结果表明，我们的Diff-UNet在定量和定性上优于最先进的3D医学分割方法，特别是在具有小或复杂结构的区域上。我们的代码可从以下链接获得：https://github.com/ge-xing/DiffUNet。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Medical image analysis 工程技术-工程：生物医学

CiteScore

22.10

自引率

6.40%

发文量

309

审稿时长

6.6 months

期刊介绍： Medical Image Analysis serves as a platform for sharing new research findings in the realm of medical and biological image analysis, with a focus on applications of computer vision, virtual reality, and robotics to biomedical imaging challenges. The journal prioritizes the publication of high-quality, original papers contributing to the fundamental science of processing, analyzing, and utilizing medical and biological images. It welcomes approaches utilizing biomedical image datasets across all spatial scales, from molecular/cellular imaging to tissue/organ imaging.