FCFDiff-Net：全条件特征扩散嵌入网络，用于三维脑肿瘤分割。

IF 2.9 2区医学 Q2 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

Quantitative Imaging in Medicine and Surgery Pub Date : 2025-05-01 Epub Date: 2025-04-25 DOI:10.21037/qims-24-2300

Xiaosheng Wu, Qingyi Hou, Zhaozhao Xu, Chaosheng Tang, Shuihua Wang, Junding Sun, Yudong Zhang

{"title":"FCFDiff-Net：全条件特征扩散嵌入网络，用于三维脑肿瘤分割。","authors":"Xiaosheng Wu, Qingyi Hou, Zhaozhao Xu, Chaosheng Tang, Shuihua Wang, Junding Sun, Yudong Zhang","doi":"10.21037/qims-24-2300","DOIUrl":null,"url":null,"abstract":"Background: Brain tumor segmentation (BraTS) plays a critical role in medical imaging for early diagnosis and treatment planning. Recently, diffusion models have provided new insights into image segmentation, achieving significant success due to their ability to model nonlinearities. However, existing methods still face challenges, such as false negatives and false positives, caused by image blurring and noise interference, which remain major obstacles. This study aimed to develop a novel neural network approach to address these challenges in three-dimensional (3D) BraTS.Methods: We propose a novel full-conditional feature diffusion embedded network (FCFDiff-Net) for 3D BraTS. This model enhances segmentation accuracy and robustness, particularly in noisy or ambiguous regions. This model introduces the full-conditional feature embedding (FCFE) module and employs a more comprehensive conditional embedding approach, fully integrating feature information from the original image into the diffusion model. It establishes an effective connection between the decoder side of the denoising network and the encoder side of the diffusion model, thereby improving the model's ability to capture the tumor target region and its boundaries. To further optimize performance and minimize discrepancies between conditional features and the denoising module, we introduce the multi-head attention residual fusion (MHARF) module. The MHARF module integrates features from the FCFE with noisy features generated during the denoising process. Using multi-head attention aligns semantic and noise information refining the segmentation results. This fusion enhances segmentation accuracy and stability by reducing noise impact and ensuring greater consistency across tumor regions.Results: The BraTS 2020 and BraTS 2021 datasets served as the primary training and evaluation datasets. The proposed architecture was assessed using metrics such as Dice similarity coefficient (DSC), Hausdorff distance at the 95th percentile (HD95), specificity, and false positive rate (FPR). For the BraTS 2020 dataset, the DSC scores for the whole tumor (WT), tumor core (TC), and enhancing tumor (ET) were 0.916, 0.860, and 0.786, respectively. The HD95 values were 1.917, 2.571, and 2.581 mm, whereas specificity values were 0.998, 0.999, and 0.999, and FPR values were 0.002, 0.001, and 0.001, respectively. On the BraTS 2021 dataset, the DSC scores for the same regions were 0.926, 0.903, and 0.869, with HD95 values of 2.156, 1.834, and 1.583 mm, respectively. Specificity and FPR values were 0.999 across the board, and FPR values were consistently low at 0.001. These results demonstrate the model's excellent performance across the three regions.Conclusions: The proposed FCFDiff-Net provides an efficient and robust solution for 3D BraTS, outperforming existing models in terms of accuracy and robustness. Future work will focus on exploring the model's generalization capabilities and conducting lightweight experiments to further enhance its applicability.","PeriodicalId":54267,"journal":{"name":"Quantitative Imaging in Medicine and Surgery","volume":"15 5","pages":"4217-4234"},"PeriodicalIF":2.9000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12084721/pdf/","citationCount":"0","resultStr":"{\"title\":\"FCFDiff-Net: full-conditional feature diffusion embedded network for 3D brain tumor segmentation.\",\"authors\":\"Xiaosheng Wu, Qingyi Hou, Zhaozhao Xu, Chaosheng Tang, Shuihua Wang, Junding Sun, Yudong Zhang\",\"doi\":\"10.21037/qims-24-2300\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: Brain tumor segmentation (BraTS) plays a critical role in medical imaging for early diagnosis and treatment planning. Recently, diffusion models have provided new insights into image segmentation, achieving significant success due to their ability to model nonlinearities. However, existing methods still face challenges, such as false negatives and false positives, caused by image blurring and noise interference, which remain major obstacles. This study aimed to develop a novel neural network approach to address these challenges in three-dimensional (3D) BraTS.Methods: We propose a novel full-conditional feature diffusion embedded network (FCFDiff-Net) for 3D BraTS. This model enhances segmentation accuracy and robustness, particularly in noisy or ambiguous regions. This model introduces the full-conditional feature embedding (FCFE) module and employs a more comprehensive conditional embedding approach, fully integrating feature information from the original image into the diffusion model. It establishes an effective connection between the decoder side of the denoising network and the encoder side of the diffusion model, thereby improving the model's ability to capture the tumor target region and its boundaries. To further optimize performance and minimize discrepancies between conditional features and the denoising module, we introduce the multi-head attention residual fusion (MHARF) module. The MHARF module integrates features from the FCFE with noisy features generated during the denoising process. Using multi-head attention aligns semantic and noise information refining the segmentation results. This fusion enhances segmentation accuracy and stability by reducing noise impact and ensuring greater consistency across tumor regions.Results: The BraTS 2020 and BraTS 2021 datasets served as the primary training and evaluation datasets. The proposed architecture was assessed using metrics such as Dice similarity coefficient (DSC), Hausdorff distance at the 95th percentile (HD95), specificity, and false positive rate (FPR). For the BraTS 2020 dataset, the DSC scores for the whole tumor (WT), tumor core (TC), and enhancing tumor (ET) were 0.916, 0.860, and 0.786, respectively. The HD95 values were 1.917, 2.571, and 2.581 mm, whereas specificity values were 0.998, 0.999, and 0.999, and FPR values were 0.002, 0.001, and 0.001, respectively. On the BraTS 2021 dataset, the DSC scores for the same regions were 0.926, 0.903, and 0.869, with HD95 values of 2.156, 1.834, and 1.583 mm, respectively. Specificity and FPR values were 0.999 across the board, and FPR values were consistently low at 0.001. These results demonstrate the model's excellent performance across the three regions.Conclusions: The proposed FCFDiff-Net provides an efficient and robust solution for 3D BraTS, outperforming existing models in terms of accuracy and robustness. Future work will focus on exploring the model's generalization capabilities and conducting lightweight experiments to further enhance its applicability.\",\"PeriodicalId\":54267,\"journal\":{\"name\":\"Quantitative Imaging in Medicine and Surgery\",\"volume\":\"15 5\",\"pages\":\"4217-4234\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2025-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12084721/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Quantitative Imaging in Medicine and Surgery\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.21037/qims-24-2300\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/4/25 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Quantitative Imaging in Medicine and Surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21037/qims-24-2300","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/4/25 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

摘要

背景：脑肿瘤分割（BraTS）在医学影像早期诊断和治疗计划中起着至关重要的作用。最近，扩散模型为图像分割提供了新的见解，由于其建模非线性的能力，取得了显著的成功。然而，现有方法仍然面临着由图像模糊和噪声干扰引起的假阴性和假阳性等挑战，这仍然是主要障碍。本研究旨在开发一种新的神经网络方法来解决三维（3D） brat中的这些挑战。方法：我们提出了一种新的全条件特征扩散嵌入式网络（FCFDiff-Net）。该模型提高了分割的准确性和鲁棒性，特别是在噪声或模糊区域。该模型引入了全条件特征嵌入（full-conditional feature embedding， FCFE）模块，采用了更全面的条件嵌入方法，将原始图像的特征信息充分集成到扩散模型中。它在去噪网络的解码器侧和扩散模型的编码器侧之间建立了有效的连接，从而提高了模型捕捉肿瘤靶区及其边界的能力。为了进一步优化性能并最小化条件特征与去噪模块之间的差异，我们引入了多头注意残差融合（MHARF）模块。MHARF模块将FCFE的特征与去噪过程中产生的噪声特征集成在一起。利用多头注意力对语义信息和噪声信息进行对齐，改进分割结果。这种融合通过减少噪声影响和确保肿瘤区域更大的一致性来提高分割的准确性和稳定性。结果：BraTS 2020和BraTS 2021数据集作为主要的训练和评估数据集。采用Dice相似系数（DSC）、第95百分位Hausdorff距离（HD95）、特异性和假阳性率（FPR）等指标对所提出的体系结构进行评估。对于BraTS 2020数据集，整个肿瘤（WT）、肿瘤核心（TC）和增强肿瘤（ET）的DSC评分分别为0.916、0.860和0.786。HD95值分别为1.917、2.571、2.581 mm，特异性值分别为0.998、0.999、0.999，FPR值分别为0.002、0.001、0.001。在BraTS 2021数据集上，同一地区的DSC得分分别为0.926、0.903和0.869，HD95值分别为2.156、1.834和1.583 mm。特异性和FPR值均为0.999，FPR值持续较低，为0.001。这些结果证明了该模型在三个区域的优异性能。结论：提出的FCFDiff-Net为3D brat提供了一种高效、鲁棒的解决方案，在准确性和鲁棒性方面优于现有模型。未来的工作将侧重于探索模型的泛化能力，并进行轻量化实验，进一步增强模型的适用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

FCFDiff-Net: full-conditional feature diffusion embedded network for 3D brain tumor segmentation.

Background: Brain tumor segmentation (BraTS) plays a critical role in medical imaging for early diagnosis and treatment planning. Recently, diffusion models have provided new insights into image segmentation, achieving significant success due to their ability to model nonlinearities. However, existing methods still face challenges, such as false negatives and false positives, caused by image blurring and noise interference, which remain major obstacles. This study aimed to develop a novel neural network approach to address these challenges in three-dimensional (3D) BraTS.

Methods: We propose a novel full-conditional feature diffusion embedded network (FCFDiff-Net) for 3D BraTS. This model enhances segmentation accuracy and robustness, particularly in noisy or ambiguous regions. This model introduces the full-conditional feature embedding (FCFE) module and employs a more comprehensive conditional embedding approach, fully integrating feature information from the original image into the diffusion model. It establishes an effective connection between the decoder side of the denoising network and the encoder side of the diffusion model, thereby improving the model's ability to capture the tumor target region and its boundaries. To further optimize performance and minimize discrepancies between conditional features and the denoising module, we introduce the multi-head attention residual fusion (MHARF) module. The MHARF module integrates features from the FCFE with noisy features generated during the denoising process. Using multi-head attention aligns semantic and noise information refining the segmentation results. This fusion enhances segmentation accuracy and stability by reducing noise impact and ensuring greater consistency across tumor regions.

Results: The BraTS 2020 and BraTS 2021 datasets served as the primary training and evaluation datasets. The proposed architecture was assessed using metrics such as Dice similarity coefficient (DSC), Hausdorff distance at the 95th percentile (HD95), specificity, and false positive rate (FPR). For the BraTS 2020 dataset, the DSC scores for the whole tumor (WT), tumor core (TC), and enhancing tumor (ET) were 0.916, 0.860, and 0.786, respectively. The HD95 values were 1.917, 2.571, and 2.581 mm, whereas specificity values were 0.998, 0.999, and 0.999, and FPR values were 0.002, 0.001, and 0.001, respectively. On the BraTS 2021 dataset, the DSC scores for the same regions were 0.926, 0.903, and 0.869, with HD95 values of 2.156, 1.834, and 1.583 mm, respectively. Specificity and FPR values were 0.999 across the board, and FPR values were consistently low at 0.001. These results demonstrate the model's excellent performance across the three regions.

Conclusions: The proposed FCFDiff-Net provides an efficient and robust solution for 3D BraTS, outperforming existing models in terms of accuracy and robustness. Future work will focus on exploring the model's generalization capabilities and conducting lightweight experiments to further enhance its applicability.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Quantitative Imaging in Medicine and Surgery Medicine-Radiology, Nuclear Medicine and Imaging

CiteScore

4.20

自引率

17.90%

发文量

252

期刊介绍： Information not localized