CBAM Attention Gate-Based Lightweight Deep Neural Network Model for Improved Retinal Vessel Segmentation

IF 3 4区计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

International Journal of Imaging Systems and Technology Pub Date : 2025-01-22 DOI:10.1002/ima.70031

Kashif Fareed, Anas Khan, Musaed Alhussein, Khursheed Aurangzeb, Aamir Shahzad, Mazhar Islam

{"title":"CBAM Attention Gate-Based Lightweight Deep Neural Network Model for Improved Retinal Vessel Segmentation","authors":"Kashif Fareed, Anas Khan, Musaed Alhussein, Khursheed Aurangzeb, Aamir Shahzad, Mazhar Islam","doi":"10.1002/ima.70031","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Over the years, researchers have been using deep learning in different fields of science including disease diagnosis. Retinal vessel segmentation has seen significant advancements through deep learning techniques, resulting in high accuracy. Despite this progress, challenges remain in automating the segmentation process. One of the most pressing and often overlooked issues is computational complexity, which is critical for developing portable diagnostic systems. To address this, this study introduces a CBAM-Attention Gate-based U-Netmodel aimed at reducing computational complexity without sacrificing performance on evaluation metrics. The performance of the model was analyzed using four publicly available fundus image datasets: CHASE_DB1, DRIVE, STARE, and HRF, and it achieved sensitivity, specificity, accuracy, AUC, and MCC performances (0.7909, 0.9975, 0.9723, 0.9867, and 0.8011), (0.8217, 0.9816, 0.9674, 0.9849, and 0.9778), (0.8346, 0.9790, 0.9680, 0.9855, and 0.7810), and (0.8082, 0.9769, 0.9638, 0.9723, and 0.7575), respectively. Moreover, this model comprises of only 0.8 million parameters, which makes it one of the lightest available models used for retinal vessel segmentation. This lightweight yet efficient model is most suitable for use in low-end hardware devices. The attributes of significantly lower computational complexity along with improved evaluation metrics advocates for its deployment in portable embedded devices to be used for population-level screening programs.</p>\n </div>","PeriodicalId":14027,"journal":{"name":"International Journal of Imaging Systems and Technology","volume":"35 1","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Imaging Systems and Technology","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ima.70031","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

Over the years, researchers have been using deep learning in different fields of science including disease diagnosis. Retinal vessel segmentation has seen significant advancements through deep learning techniques, resulting in high accuracy. Despite this progress, challenges remain in automating the segmentation process. One of the most pressing and often overlooked issues is computational complexity, which is critical for developing portable diagnostic systems. To address this, this study introduces a CBAM-Attention Gate-based U-Netmodel aimed at reducing computational complexity without sacrificing performance on evaluation metrics. The performance of the model was analyzed using four publicly available fundus image datasets: CHASE_DB1, DRIVE, STARE, and HRF, and it achieved sensitivity, specificity, accuracy, AUC, and MCC performances (0.7909, 0.9975, 0.9723, 0.9867, and 0.8011), (0.8217, 0.9816, 0.9674, 0.9849, and 0.9778), (0.8346, 0.9790, 0.9680, 0.9855, and 0.7810), and (0.8082, 0.9769, 0.9638, 0.9723, and 0.7575), respectively. Moreover, this model comprises of only 0.8 million parameters, which makes it one of the lightest available models used for retinal vessel segmentation. This lightweight yet efficient model is most suitable for use in low-end hardware devices. The attributes of significantly lower computational complexity along with improved evaluation metrics advocates for its deployment in portable embedded devices to be used for population-level screening programs.

查看原文本刊更多论文

基于CBAM注意门的改进视网膜血管分割轻量级深度神经网络模型

多年来，研究人员一直在包括疾病诊断在内的不同科学领域使用深度学习。通过深度学习技术，视网膜血管分割取得了重大进展，从而提高了准确性。尽管取得了这些进展，但在自动化分割过程中仍然存在挑战。最紧迫的问题之一是计算复杂性，这对于开发便携式诊断系统至关重要。为了解决这个问题，本研究引入了一个基于cbam -注意力门的u - net模型，旨在降低计算复杂性，同时不牺牲评估指标的性能。利用CHASE_DB1、DRIVE、STARE和HRF 4个公开的眼底图像数据集对模型进行性能分析，获得了灵敏度、特异度、准确度、AUC和MCC性能分别为0.7909、0.9975、0.9723、0.9867和0.8011、0.8217、0.9816、0.9674、0.9849和0.9778、0.8346、0.9790、0.9680、0.9855和0.7810、0.8082、0.9769、0.9638、0.9723和0.7575。此外，该模型仅包含80万个参数，这使其成为用于视网膜血管分割的最轻的模型之一。这种轻量级但高效的模型最适合用于低端硬件设备。显著降低计算复杂性的特性以及改进的评估指标提倡将其部署在便携式嵌入式设备中，用于人口水平的筛选程序。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Imaging Systems and Technology 工程技术-成像科学与照相技术

CiteScore

6.90

自引率

6.10%

发文量

138

审稿时长

3 months

期刊介绍： The International Journal of Imaging Systems and Technology (IMA) is a forum for the exchange of ideas and results relevant to imaging systems, including imaging physics and informatics. The journal covers all imaging modalities in humans and animals. IMA accepts technically sound and scientifically rigorous research in the interdisciplinary field of imaging, including relevant algorithmic research and hardware and software development, and their applications relevant to medical research. The journal provides a platform to publish original research in structural and functional imaging. The journal is also open to imaging studies of the human body and on animals that describe novel diagnostic imaging and analyses methods. Technical, theoretical, and clinical research in both normal and clinical populations is encouraged. Submissions describing methods, software, databases, replication studies as well as negative results are also considered. The scope of the journal includes, but is not limited to, the following in the context of biomedical research: Imaging and neuro-imaging modalities: structural MRI, functional MRI, PET, SPECT, CT, ultrasound, EEG, MEG, NIRS etc.; Neuromodulation and brain stimulation techniques such as TMS and tDCS; Software and hardware for imaging, especially related to human and animal health; Image segmentation in normal and clinical populations; Pattern analysis and classification using machine learning techniques; Computational modeling and analysis; Brain connectivity and connectomics; Systems-level characterization of brain function; Neural networks and neurorobotics; Computer vision, based on human/animal physiology; Brain-computer interface (BCI) technology; Big data, databasing and data mining.