Xinxu Wei, Xi Lin, Haiyun Liu, Shixuan Zhao, Yongjie Li
{"title":"Retinal Vessel Segmentation with Deep Graph and Capsule Reasoning","authors":"Xinxu Wei, Xi Lin, Haiyun Liu, Shixuan Zhao, Yongjie Li","doi":"arxiv-2409.11508","DOIUrl":null,"url":null,"abstract":"Effective retinal vessel segmentation requires a sophisticated integration of\nglobal contextual awareness and local vessel continuity. To address this\nchallenge, we propose the Graph Capsule Convolution Network (GCC-UNet), which\nmerges capsule convolutions with CNNs to capture both local and global\nfeatures. The Graph Capsule Convolution operator is specifically designed to\nenhance the representation of global context, while the Selective Graph\nAttention Fusion module ensures seamless integration of local and global\ninformation. To further improve vessel continuity, we introduce the Bottleneck\nGraph Attention module, which incorporates Channel-wise and Spatial Graph\nAttention mechanisms. The Multi-Scale Graph Fusion module adeptly combines\nfeatures from various scales. Our approach has been rigorously validated\nthrough experiments on widely used public datasets, with ablation studies\nconfirming the efficacy of each component. Comparative results highlight\nGCC-UNet's superior performance over existing methods, setting a new benchmark\nin retinal vessel segmentation. Notably, this work represents the first\nintegration of vanilla, graph, and capsule convolutional techniques in the\ndomain of medical image segmentation.","PeriodicalId":501289,"journal":{"name":"arXiv - EE - Image and Video Processing","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - EE - Image and Video Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.11508","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Effective retinal vessel segmentation requires a sophisticated integration of
global contextual awareness and local vessel continuity. To address this
challenge, we propose the Graph Capsule Convolution Network (GCC-UNet), which
merges capsule convolutions with CNNs to capture both local and global
features. The Graph Capsule Convolution operator is specifically designed to
enhance the representation of global context, while the Selective Graph
Attention Fusion module ensures seamless integration of local and global
information. To further improve vessel continuity, we introduce the Bottleneck
Graph Attention module, which incorporates Channel-wise and Spatial Graph
Attention mechanisms. The Multi-Scale Graph Fusion module adeptly combines
features from various scales. Our approach has been rigorously validated
through experiments on widely used public datasets, with ablation studies
confirming the efficacy of each component. Comparative results highlight
GCC-UNet's superior performance over existing methods, setting a new benchmark
in retinal vessel segmentation. Notably, this work represents the first
integration of vanilla, graph, and capsule convolutional techniques in the
domain of medical image segmentation.