Global–Local Hybrid Modulation Network for Retinal Vessel and Coronary Angiograph Segmentation

IF 5.8 3区计算机科学 Q1 ENGINEERING, MULTIDISCIPLINARY

Journal of Bionic Engineering Pub Date : 2025-08-06 DOI:10.1007/s42235-025-00727-3

Pengfei Cai, Biyuan Li, Jinying Ma, Xiao Tian, Jun Yan

{"title":"Global–Local Hybrid Modulation Network for Retinal Vessel and Coronary Angiograph Segmentation","authors":"Pengfei Cai, Biyuan Li, Jinying Ma, Xiao Tian, Jun Yan","doi":"10.1007/s42235-025-00727-3","DOIUrl":null,"url":null,"abstract":"<div><p>The segmentation of retinal vessels and coronary angiographs is essential for diagnosing conditions such as glaucoma, diabetes, hypertension, and coronary artery disease. However, retinal vessels and coronary angiographs are characterized by low contrast and complex structures, posing challenges for vessel segmentation. Moreover, CNN-based approaches are limited in capturing long-range pixel relationships due to their focus on local feature extraction, while ViT-based approaches struggle to capture fine local details, impacting tasks like vessel segmentation that require precise boundary detection. To address these issues, in this paper, we propose a Global–Local Hybrid Modulation Network (GLHM-Net), a dual-encoder architecture that combines the strengths of CNNs and ViTs for vessel segmentation. First, the Hybrid Non-Local Transformer Block (HNLTB) is proposed to efficiently consolidate long-range spatial dependencies into a compact feature representation, providing a global perspective while significantly reducing computational overhead. Second, the Collaborative Attention Fusion Block (CAFB) is proposed to more effectively integrate local and global vessel features at the same hierarchical level during the encoding phase. Finally, the proposed Feature Cross-Modulation Block (FCMB) better complements the local and global features in the decoding stage, effectively enhancing feature learning and minimizing information loss. The experiments conducted on the DRIVE, CHASEDB1, DCA1, and XCAD datasets, achieving AUC values of 0.9811, 0.9864, 0.9915, and 0.9919, F1 scores of 0.8288, 0.8202, 0.8040, and 0.8150, and IOU values of 0.7076, 0.6952, 0.6723, and 0.6878, respectively, demonstrate the strong performance of our proposed network for vessel segmentation.</p></div>","PeriodicalId":614,"journal":{"name":"Journal of Bionic Engineering","volume":"22 4","pages":"2050 - 2074"},"PeriodicalIF":5.8000,"publicationDate":"2025-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Bionic Engineering","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s42235-025-00727-3","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

The segmentation of retinal vessels and coronary angiographs is essential for diagnosing conditions such as glaucoma, diabetes, hypertension, and coronary artery disease. However, retinal vessels and coronary angiographs are characterized by low contrast and complex structures, posing challenges for vessel segmentation. Moreover, CNN-based approaches are limited in capturing long-range pixel relationships due to their focus on local feature extraction, while ViT-based approaches struggle to capture fine local details, impacting tasks like vessel segmentation that require precise boundary detection. To address these issues, in this paper, we propose a Global–Local Hybrid Modulation Network (GLHM-Net), a dual-encoder architecture that combines the strengths of CNNs and ViTs for vessel segmentation. First, the Hybrid Non-Local Transformer Block (HNLTB) is proposed to efficiently consolidate long-range spatial dependencies into a compact feature representation, providing a global perspective while significantly reducing computational overhead. Second, the Collaborative Attention Fusion Block (CAFB) is proposed to more effectively integrate local and global vessel features at the same hierarchical level during the encoding phase. Finally, the proposed Feature Cross-Modulation Block (FCMB) better complements the local and global features in the decoding stage, effectively enhancing feature learning and minimizing information loss. The experiments conducted on the DRIVE, CHASEDB1, DCA1, and XCAD datasets, achieving AUC values of 0.9811, 0.9864, 0.9915, and 0.9919, F1 scores of 0.8288, 0.8202, 0.8040, and 0.8150, and IOU values of 0.7076, 0.6952, 0.6723, and 0.6878, respectively, demonstrate the strong performance of our proposed network for vessel segmentation.

查看原文本刊更多论文

视网膜血管和冠状动脉图像分割的全局-局部混合调制网络

视网膜血管分割和冠状动脉造影对于诊断青光眼、糖尿病、高血压和冠状动脉疾病等疾病至关重要。然而，视网膜血管和冠状动脉造影的特点是对比度低，结构复杂，给血管分割带来了挑战。此外，基于cnn的方法在捕获远程像素关系方面受到限制，因为它们专注于局部特征提取，而基于vit的方法难以捕获精细的局部细节，影响了需要精确边界检测的血管分割等任务。为了解决这些问题，在本文中，我们提出了一种全局-局部混合调制网络（glhmm - net），这是一种双编码器架构，结合了cnn和ViTs的优势进行血管分割。首先，提出了混合非局部变压器块（HNLTB），将远程空间依赖关系有效地整合到紧凑的特征表示中，在提供全局视角的同时显著降低了计算开销。其次，提出协同注意力融合块（CAFB），在编码阶段更有效地将局部和全局血管特征在同一层次上进行融合。最后，提出的特征交叉调制块（FCMB）在解码阶段更好地补充了局部特征和全局特征，有效地增强了特征学习，最大限度地减少了信息损失。在DRIVE、CHASEDB1、DCA1和XCAD数据集上进行的实验，AUC值分别为0.9811、0.9864、0.9915和0.9919，F1值分别为0.8288、0.8202、0.8040和0.8150，IOU值分别为0.7076、0.6952、0.6723和0.6878，证明了我们所提出的网络在船舶分割方面的强大性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Bionic Engineering 工程技术-材料科学：生物材料

CiteScore

7.10

自引率

10.00%

发文量

162

审稿时长

10.0 months

期刊介绍： The Journal of Bionic Engineering (JBE) is a peer-reviewed journal that publishes original research papers and reviews that apply the knowledge learned from nature and biological systems to solve concrete engineering problems. The topics that JBE covers include but are not limited to: Mechanisms, kinematical mechanics and control of animal locomotion, development of mobile robots with walking (running and crawling), swimming or flying abilities inspired by animal locomotion. Structures, morphologies, composition and physical properties of natural and biomaterials; fabrication of new materials mimicking the properties and functions of natural and biomaterials. Biomedical materials, artificial organs and tissue engineering for medical applications; rehabilitation equipment and devices. Development of bioinspired computation methods and artificial intelligence for engineering applications.