IMDF-Net: Iterative U-Net With Multi-Kernel Dilated Convolution and Fusion Modules for Enhanced Retinal Vessel Segmentation

IF 3 4区计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

International Journal of Imaging Systems and Technology Pub Date : 2025-03-22 DOI:10.1002/ima.70073

Jiale Deng, Lina Yang, Yuwen Lin

{"title":"IMDF-Net: Iterative U-Net With Multi-Kernel Dilated Convolution and Fusion Modules for Enhanced Retinal Vessel Segmentation","authors":"Jiale Deng, Lina Yang, Yuwen Lin","doi":"10.1002/ima.70073","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>In the early diagnosis of diabetic retinopathy, the morphological properties of blood vessels serve as an important reference for doctors to assess a patient's condition, facilitating scientific diagnostic and therapeutic interventions. However, vascular deformations, proliferation, and rupture caused by retinal diseases are often difficult to detect in the early stages. The assessment of retinal vessel morphology is subjective, time-consuming, and heavily dependent on the professional experience of the physician. Therefore, computer-aided diagnostic systems have gradually played a significant role in this field. Existing neural networks, particularly U-Net and its variants, have shown promising results in retinal vessel segmentation. However, due to the information loss caused by multiple pooling operations and the insufficient handling of local contextual features in skip connections, most segmentation methods still face challenges in accurately detecting microvessels. To address these limitations and assist medical staff in the early diagnosis of retinal diseases, we propose an iterative retinal vessel segmentation network with multi-dimensional attention and multi-scale feature fusion, named IMDF-Net. The network consists of a backbone network and an iterative refinement network. In the backbone network, we have designed a cascaded multi-kernel dilated convolution module and a multi-scale feature fusion module during the upsampling phase. These components expand the receptive field, effectively combine global information and local features, and propagate deep features to the shallow layers. Additionally, we have designed an iterative network to further capture missing information and correct erroneous segmentation results. Experimental results demonstrate that IMDF-Net outperforms several state-of-the-art methods on the DRIVE dataset, achieving the best performance across all evaluation metrics. On the CHASE_DB1 dataset, it achieves optimal performance in four metrics. It demonstrates its superiority in both overall performance and visual results, with a significant improvement in the segmentation of microvessels.</p>\n </div>","PeriodicalId":14027,"journal":{"name":"International Journal of Imaging Systems and Technology","volume":"35 2","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Imaging Systems and Technology","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ima.70073","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

In the early diagnosis of diabetic retinopathy, the morphological properties of blood vessels serve as an important reference for doctors to assess a patient's condition, facilitating scientific diagnostic and therapeutic interventions. However, vascular deformations, proliferation, and rupture caused by retinal diseases are often difficult to detect in the early stages. The assessment of retinal vessel morphology is subjective, time-consuming, and heavily dependent on the professional experience of the physician. Therefore, computer-aided diagnostic systems have gradually played a significant role in this field. Existing neural networks, particularly U-Net and its variants, have shown promising results in retinal vessel segmentation. However, due to the information loss caused by multiple pooling operations and the insufficient handling of local contextual features in skip connections, most segmentation methods still face challenges in accurately detecting microvessels. To address these limitations and assist medical staff in the early diagnosis of retinal diseases, we propose an iterative retinal vessel segmentation network with multi-dimensional attention and multi-scale feature fusion, named IMDF-Net. The network consists of a backbone network and an iterative refinement network. In the backbone network, we have designed a cascaded multi-kernel dilated convolution module and a multi-scale feature fusion module during the upsampling phase. These components expand the receptive field, effectively combine global information and local features, and propagate deep features to the shallow layers. Additionally, we have designed an iterative network to further capture missing information and correct erroneous segmentation results. Experimental results demonstrate that IMDF-Net outperforms several state-of-the-art methods on the DRIVE dataset, achieving the best performance across all evaluation metrics. On the CHASE_DB1 dataset, it achieves optimal performance in four metrics. It demonstrates its superiority in both overall performance and visual results, with a significant improvement in the segmentation of microvessels.

查看原文本刊更多论文

基于多核扩展卷积和融合模块的迭代U-Net增强视网膜血管分割

在糖尿病视网膜病变的早期诊断中，血管形态学特征是医生评估患者病情的重要依据，便于科学的诊断和治疗干预。然而，视网膜疾病引起的血管变形、增生和破裂在早期往往难以发现。视网膜血管形态的评估是主观的，耗时的，并且严重依赖于医生的专业经验。因此，计算机辅助诊断系统逐渐在这一领域发挥了重要作用。现有的神经网络，特别是U-Net及其变体，在视网膜血管分割方面已经显示出很好的结果。然而，由于多次池化操作造成的信息丢失以及跳跃连接中对局部上下文特征的处理不足，大多数分割方法在准确检测微血管方面仍然面临挑战。为了解决这些局限性，帮助医务人员早期诊断视网膜疾病，我们提出了一个具有多维关注和多尺度特征融合的迭代视网膜血管分割网络，命名为IMDF-Net。该网络由一个骨干网络和一个迭代细化网络组成。在骨干网中，我们设计了级联多核扩展卷积模块和上采样阶段的多尺度特征融合模块。这些成分扩展了接受域，有效地结合了全局信息和局部特征，并将深层特征传播到浅层。此外，我们还设计了一个迭代网络来进一步捕获缺失信息并纠正错误的分割结果。实验结果表明，在DRIVE数据集上，IMDF-Net优于几种最先进的方法，在所有评估指标上都取得了最佳性能。在CHASE_DB1数据集上，它在四个指标上实现了最佳性能。它在整体性能和视觉效果上都显示出优越性，在微血管分割方面有明显的改善。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Imaging Systems and Technology 工程技术-成像科学与照相技术

CiteScore

6.90

自引率

6.10%

发文量

138

审稿时长

3 months

期刊介绍： The International Journal of Imaging Systems and Technology (IMA) is a forum for the exchange of ideas and results relevant to imaging systems, including imaging physics and informatics. The journal covers all imaging modalities in humans and animals. IMA accepts technically sound and scientifically rigorous research in the interdisciplinary field of imaging, including relevant algorithmic research and hardware and software development, and their applications relevant to medical research. The journal provides a platform to publish original research in structural and functional imaging. The journal is also open to imaging studies of the human body and on animals that describe novel diagnostic imaging and analyses methods. Technical, theoretical, and clinical research in both normal and clinical populations is encouraged. Submissions describing methods, software, databases, replication studies as well as negative results are also considered. The scope of the journal includes, but is not limited to, the following in the context of biomedical research: Imaging and neuro-imaging modalities: structural MRI, functional MRI, PET, SPECT, CT, ultrasound, EEG, MEG, NIRS etc.; Neuromodulation and brain stimulation techniques such as TMS and tDCS; Software and hardware for imaging, especially related to human and animal health; Image segmentation in normal and clinical populations; Pattern analysis and classification using machine learning techniques; Computational modeling and analysis; Brain connectivity and connectomics; Systems-level characterization of brain function; Neural networks and neurorobotics; Computer vision, based on human/animal physiology; Brain-computer interface (BCI) technology; Big data, databasing and data mining.