FDB-Net: Fusion double branch network combining CNN and transformer for medical image segmentation.

IF 1.7 3区医学 Q3 INSTRUMENTS & INSTRUMENTATION

Journal of X-Ray Science and Technology Pub Date : 2024-01-01 DOI:10.3233/XST-230413

Zhongchuan Jiang, Yun Wu, Lei Huang, Maohua Gu

{"title":"FDB-Net: Fusion double branch network combining CNN and transformer for medical image segmentation.","authors":"Zhongchuan Jiang, Yun Wu, Lei Huang, Maohua Gu","doi":"10.3233/XST-230413","DOIUrl":null,"url":null,"abstract":"Background: The rapid development of deep learning techniques has greatly improved the performance of medical image segmentation, and medical image segmentation networks based on convolutional neural networks and Transformer have been widely used in this field. However, due to the limitation of the restricted receptive field of convolutional operation and the lack of local fine information extraction ability of the self-attention mechanism in Transformer, the current neural networks with pure convolutional or Transformer structure as the backbone still perform poorly in medical image segmentation.Methods: In this paper, we propose FDB-Net (Fusion Double Branch Network, FDB-Net), a double branch medical image segmentation network combining CNN and Transformer, by using a CNN containing gnConv blocks and a Transformer containing Varied-Size Window Attention (VWA) blocks as the feature extraction backbone network, the dual-path encoder ensures that the network has a global receptive field as well as access to the target local detail features. We also propose a new feature fusion module (Deep Feature Fusion, DFF), which helps the image to simultaneously fuse features from two different structural encoders during the encoding process, ensuring the effective fusion of global and local information of the image.Conclusion: Our model achieves advanced results in all three typical tasks of medical image segmentation, which fully validates the effectiveness of FDB-Net.","PeriodicalId":49948,"journal":{"name":"Journal of X-Ray Science and Technology","volume":" ","pages":"931-951"},"PeriodicalIF":1.7000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of X-Ray Science and Technology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3233/XST-230413","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"INSTRUMENTS & INSTRUMENTATION","Score":null,"Total":0}

引用次数: 0

Abstract

Background: The rapid development of deep learning techniques has greatly improved the performance of medical image segmentation, and medical image segmentation networks based on convolutional neural networks and Transformer have been widely used in this field. However, due to the limitation of the restricted receptive field of convolutional operation and the lack of local fine information extraction ability of the self-attention mechanism in Transformer, the current neural networks with pure convolutional or Transformer structure as the backbone still perform poorly in medical image segmentation.

Methods: In this paper, we propose FDB-Net (Fusion Double Branch Network, FDB-Net), a double branch medical image segmentation network combining CNN and Transformer, by using a CNN containing gnConv blocks and a Transformer containing Varied-Size Window Attention (VWA) blocks as the feature extraction backbone network, the dual-path encoder ensures that the network has a global receptive field as well as access to the target local detail features. We also propose a new feature fusion module (Deep Feature Fusion, DFF), which helps the image to simultaneously fuse features from two different structural encoders during the encoding process, ensuring the effective fusion of global and local information of the image.

Conclusion: Our model achieves advanced results in all three typical tasks of medical image segmentation, which fully validates the effectiveness of FDB-Net.

查看原文本刊更多论文

FDB-Net：结合 CNN 和变换器的融合双分支网络，用于医学图像分割。

背景：深度学习技术的快速发展极大地提高了医学图像分割的性能，基于卷积神经网络和Transformer的医学图像分割网络在该领域得到了广泛应用。然而，由于卷积运算的感受野受限和Transformer中自注意机制缺乏局部精细信息提取能力的限制，目前以纯卷积或Transformer结构为骨干的神经网络在医学图像分割中的表现仍然不佳：本文提出了 FDB-Net（融合双分支网络，Fusion Double Branch Network，FDB-Net），这是一种结合了 CNN 和 Transformer 的双分支医学图像分割网络，通过使用包含 gnConv 块的 CNN 和包含 Varied-Size Window Attention（VWA）块的 Transformer 作为特征提取骨干网络，双路径编码器确保了网络既有全局感受野，又能获取目标局部细节特征。我们还提出了一个新的特征融合模块（深度特征融合，DFF），帮助图像在编码过程中同时融合来自两个不同结构编码器的特征，确保图像的全局和局部信息得到有效融合：我们的模型在医学图像分割的三个典型任务中都取得了先进的结果，充分验证了 FDB-Net 的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of X-Ray Science and Technology 工程技术-光学

CiteScore

4.90

自引率

23.30%

发文量

150

审稿时长

3 months

期刊介绍： Research areas within the scope of the journal include: Interaction of x-rays with matter: x-ray phenomena, biological effects of radiation, radiation safety and optical constants X-ray sources: x-rays from synchrotrons, x-ray lasers, plasmas, and other sources, conventional or unconventional Optical elements: grazing incidence optics, multilayer mirrors, zone plates, gratings, other diffraction optics Optical instruments: interferometers, spectrometers, microscopes, telescopes, microprobes