DFA-UNet: dual-stream feature-fusion attention U-Net for lymph node segmentation in lung cancer diagnosis

Frontiers in Neuroscience Pub Date : 2024-07-15 DOI:10.3389/fnins.2024.1448294

Qi Zhou, Yingwen Zhou, Nailong Hou, Yaxuan Zhang, Guanyu Zhu, Liang Li

{"title":"DFA-UNet: dual-stream feature-fusion attention U-Net for lymph node segmentation in lung cancer diagnosis","authors":"Qi Zhou, Yingwen Zhou, Nailong Hou, Yaxuan Zhang, Guanyu Zhu, Liang Li","doi":"10.3389/fnins.2024.1448294","DOIUrl":null,"url":null,"abstract":"In bronchial ultrasound elastography, accurately segmenting mediastinal lymph nodes is of great significance for diagnosing whether lung cancer has metastasized. However, due to the ill-defined margin of ultrasound images and the complexity of lymph node structure, accurate segmentation of fine contours is still challenging. Therefore, we propose a dual-stream feature-fusion attention U-Net (DFA-UNet). Firstly, a dual-stream encoder (DSE) is designed by combining ConvNext with a lightweight vision transformer (ViT) to extract the local information and global information of images; Secondly, we propose a hybrid attention module (HAM) at the bottleneck, which incorporates spatial and channel attention to optimize the features transmission process by optimizing high-dimensional features at the bottom of the network. Finally, the feature-enhanced residual decoder (FRD) is developed to improve the fusion of features obtained from the encoder and decoder, ensuring a more comprehensive integration. Extensive experiments on the ultrasound elasticity image dataset show the superiority of our DFA-UNet over 9 state-of-the-art image segmentation models. Additionally, visual analysis, ablation studies, and generalization assessments highlight the significant enhancement effects of DFA-UNet. Comprehensive experiments confirm the excellent segmentation effectiveness of the DFA-UNet combined attention mechanism for ultrasound images, underscoring its important significance for future research on medical images.","PeriodicalId":509131,"journal":{"name":"Frontiers in Neuroscience","volume":"49 49","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Neuroscience","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fnins.2024.1448294","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In bronchial ultrasound elastography, accurately segmenting mediastinal lymph nodes is of great significance for diagnosing whether lung cancer has metastasized. However, due to the ill-defined margin of ultrasound images and the complexity of lymph node structure, accurate segmentation of fine contours is still challenging. Therefore, we propose a dual-stream feature-fusion attention U-Net (DFA-UNet). Firstly, a dual-stream encoder (DSE) is designed by combining ConvNext with a lightweight vision transformer (ViT) to extract the local information and global information of images; Secondly, we propose a hybrid attention module (HAM) at the bottleneck, which incorporates spatial and channel attention to optimize the features transmission process by optimizing high-dimensional features at the bottom of the network. Finally, the feature-enhanced residual decoder (FRD) is developed to improve the fusion of features obtained from the encoder and decoder, ensuring a more comprehensive integration. Extensive experiments on the ultrasound elasticity image dataset show the superiority of our DFA-UNet over 9 state-of-the-art image segmentation models. Additionally, visual analysis, ablation studies, and generalization assessments highlight the significant enhancement effects of DFA-UNet. Comprehensive experiments confirm the excellent segmentation effectiveness of the DFA-UNet combined attention mechanism for ultrasound images, underscoring its important significance for future research on medical images.

查看原文本刊更多论文

DFA-UNet：用于肺癌诊断中淋巴结分割的双流特征融合注意 U-Net

在支气管超声弹性成像中，准确分割纵隔淋巴结对诊断肺癌是否转移意义重大。然而，由于超声图像边缘不清晰，淋巴结结构复杂，准确分割精细轮廓仍具有挑战性。因此，我们提出了一种双流特征融合注意力 U-Net （DFA-UNet）。首先，结合 ConvNext 和轻量级视觉变换器（ViT）设计了双流编码器（DSE），以提取图像的局部信息和全局信息；其次，我们在瓶颈处提出了混合注意力模块（HAM），它结合了空间注意力和通道注意力，通过优化网络底部的高维特征来优化特征传输过程。最后，开发了特征增强残差解码器（FRD），以改进编码器和解码器所获特征的融合，确保更全面的整合。在超声弹性图像数据集上的广泛实验表明，我们的 DFA-UNet 优于 9 种最先进的图像分割模型。此外，视觉分析、消融研究和泛化评估也凸显了 DFA-UNet 的显著增强效果。综合实验证实了 DFA-UNet 联合注意力机制在超声波图像中的出色分割效果，突出了它对未来医学图像研究的重要意义。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Frontiers in Neuroscience

自引率

0.00%

发文量