{"title":"IMDF-Net: Iterative U-Net With Multi-Kernel Dilated Convolution and Fusion Modules for Enhanced Retinal Vessel Segmentation","authors":"Jiale Deng, Lina Yang, Yuwen Lin","doi":"10.1002/ima.70073","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>In the early diagnosis of diabetic retinopathy, the morphological properties of blood vessels serve as an important reference for doctors to assess a patient's condition, facilitating scientific diagnostic and therapeutic interventions. However, vascular deformations, proliferation, and rupture caused by retinal diseases are often difficult to detect in the early stages. The assessment of retinal vessel morphology is subjective, time-consuming, and heavily dependent on the professional experience of the physician. Therefore, computer-aided diagnostic systems have gradually played a significant role in this field. Existing neural networks, particularly U-Net and its variants, have shown promising results in retinal vessel segmentation. However, due to the information loss caused by multiple pooling operations and the insufficient handling of local contextual features in skip connections, most segmentation methods still face challenges in accurately detecting microvessels. To address these limitations and assist medical staff in the early diagnosis of retinal diseases, we propose an iterative retinal vessel segmentation network with multi-dimensional attention and multi-scale feature fusion, named IMDF-Net. The network consists of a backbone network and an iterative refinement network. In the backbone network, we have designed a cascaded multi-kernel dilated convolution module and a multi-scale feature fusion module during the upsampling phase. These components expand the receptive field, effectively combine global information and local features, and propagate deep features to the shallow layers. Additionally, we have designed an iterative network to further capture missing information and correct erroneous segmentation results. Experimental results demonstrate that IMDF-Net outperforms several state-of-the-art methods on the DRIVE dataset, achieving the best performance across all evaluation metrics. On the CHASE_DB1 dataset, it achieves optimal performance in four metrics. It demonstrates its superiority in both overall performance and visual results, with a significant improvement in the segmentation of microvessels.</p>\n </div>","PeriodicalId":14027,"journal":{"name":"International Journal of Imaging Systems and Technology","volume":"35 2","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Imaging Systems and Technology","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ima.70073","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
In the early diagnosis of diabetic retinopathy, the morphological properties of blood vessels serve as an important reference for doctors to assess a patient's condition, facilitating scientific diagnostic and therapeutic interventions. However, vascular deformations, proliferation, and rupture caused by retinal diseases are often difficult to detect in the early stages. The assessment of retinal vessel morphology is subjective, time-consuming, and heavily dependent on the professional experience of the physician. Therefore, computer-aided diagnostic systems have gradually played a significant role in this field. Existing neural networks, particularly U-Net and its variants, have shown promising results in retinal vessel segmentation. However, due to the information loss caused by multiple pooling operations and the insufficient handling of local contextual features in skip connections, most segmentation methods still face challenges in accurately detecting microvessels. To address these limitations and assist medical staff in the early diagnosis of retinal diseases, we propose an iterative retinal vessel segmentation network with multi-dimensional attention and multi-scale feature fusion, named IMDF-Net. The network consists of a backbone network and an iterative refinement network. In the backbone network, we have designed a cascaded multi-kernel dilated convolution module and a multi-scale feature fusion module during the upsampling phase. These components expand the receptive field, effectively combine global information and local features, and propagate deep features to the shallow layers. Additionally, we have designed an iterative network to further capture missing information and correct erroneous segmentation results. Experimental results demonstrate that IMDF-Net outperforms several state-of-the-art methods on the DRIVE dataset, achieving the best performance across all evaluation metrics. On the CHASE_DB1 dataset, it achieves optimal performance in four metrics. It demonstrates its superiority in both overall performance and visual results, with a significant improvement in the segmentation of microvessels.
期刊介绍:
The International Journal of Imaging Systems and Technology (IMA) is a forum for the exchange of ideas and results relevant to imaging systems, including imaging physics and informatics. The journal covers all imaging modalities in humans and animals.
IMA accepts technically sound and scientifically rigorous research in the interdisciplinary field of imaging, including relevant algorithmic research and hardware and software development, and their applications relevant to medical research. The journal provides a platform to publish original research in structural and functional imaging.
The journal is also open to imaging studies of the human body and on animals that describe novel diagnostic imaging and analyses methods. Technical, theoretical, and clinical research in both normal and clinical populations is encouraged. Submissions describing methods, software, databases, replication studies as well as negative results are also considered.
The scope of the journal includes, but is not limited to, the following in the context of biomedical research:
Imaging and neuro-imaging modalities: structural MRI, functional MRI, PET, SPECT, CT, ultrasound, EEG, MEG, NIRS etc.;
Neuromodulation and brain stimulation techniques such as TMS and tDCS;
Software and hardware for imaging, especially related to human and animal health;
Image segmentation in normal and clinical populations;
Pattern analysis and classification using machine learning techniques;
Computational modeling and analysis;
Brain connectivity and connectomics;
Systems-level characterization of brain function;
Neural networks and neurorobotics;
Computer vision, based on human/animal physiology;
Brain-computer interface (BCI) technology;
Big data, databasing and data mining.