Yang Li;Lingfu Xu;Yizhu Jin;Xihe Kuang;Yue Zhang;Weigang Cui;Teng Zhang
{"title":"Diffusion Probabilistic Learning With Gate-Fusion Transformer and Edge-Frequency Attention for Retinal Vessel Segmentation","authors":"Yang Li;Lingfu Xu;Yizhu Jin;Xihe Kuang;Yue Zhang;Weigang Cui;Teng Zhang","doi":"10.1109/TIM.2024.3420264","DOIUrl":null,"url":null,"abstract":"Retinal vessel topology provides unique biological information for the diagnosis of fundus diseases. However, most existing deep learning-based vessel segmentation methods mainly focus on global fundus structure, which may suffer from generalization errors and blurring caused by lesions and image noise. Besides, vessel edge details and feature channel information are generally ignored or not considered simultaneously, and this insufficiency commonly leads to suboptimal segmentation performance. To tackle these issues, we propose a novel diffusion probabilistic learning with gate-fusion transformer and edge-frequency attention (DPL-GFT-EFA) for retinal vessel segmentation. Specifically, the DPL leverages the image denoising as a proxy task to pretrain the segmentation model, which enhances the anti-interference ability by learning noise-related information. Then, the gate-fusion transformer (GFT) block fuses high-level representations from condition and diffusion encoders (DEs) with a gate mechanism, highlighting the mutual features between fundus patterns and noisy images. Finally, the edge-frequency attention (EFA) block is introduced to further consolidate the vessel edge details and discriminative channel features. We conduct the experiments on five public retinal image datasets, and achieve the accuracies of 97.05%, 97.70%, 97.71%, 97.16%, and 97.26% on DRIVE, STARE, CHASE_DB1, HRF, and IOSTAR datasets, respectively. These results demonstrate that the proposed method outperforms state-of-the-art models and achieve promising segmentation performance even in complex images containing fundus lesions and noise. Our source code is available at \n<uri>https://github.com/YangLibuaa/DPL-GTF-EFA</uri>\n.","PeriodicalId":13341,"journal":{"name":"IEEE Transactions on Instrumentation and Measurement","volume":"73 ","pages":"1-13"},"PeriodicalIF":5.6000,"publicationDate":"2024-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Instrumentation and Measurement","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10577222/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Retinal vessel topology provides unique biological information for the diagnosis of fundus diseases. However, most existing deep learning-based vessel segmentation methods mainly focus on global fundus structure, which may suffer from generalization errors and blurring caused by lesions and image noise. Besides, vessel edge details and feature channel information are generally ignored or not considered simultaneously, and this insufficiency commonly leads to suboptimal segmentation performance. To tackle these issues, we propose a novel diffusion probabilistic learning with gate-fusion transformer and edge-frequency attention (DPL-GFT-EFA) for retinal vessel segmentation. Specifically, the DPL leverages the image denoising as a proxy task to pretrain the segmentation model, which enhances the anti-interference ability by learning noise-related information. Then, the gate-fusion transformer (GFT) block fuses high-level representations from condition and diffusion encoders (DEs) with a gate mechanism, highlighting the mutual features between fundus patterns and noisy images. Finally, the edge-frequency attention (EFA) block is introduced to further consolidate the vessel edge details and discriminative channel features. We conduct the experiments on five public retinal image datasets, and achieve the accuracies of 97.05%, 97.70%, 97.71%, 97.16%, and 97.26% on DRIVE, STARE, CHASE_DB1, HRF, and IOSTAR datasets, respectively. These results demonstrate that the proposed method outperforms state-of-the-art models and achieve promising segmentation performance even in complex images containing fundus lesions and noise. Our source code is available at
https://github.com/YangLibuaa/DPL-GTF-EFA
.
期刊介绍:
Papers are sought that address innovative solutions to the development and use of electrical and electronic instruments and equipment to measure, monitor and/or record physical phenomena for the purpose of advancing measurement science, methods, functionality and applications. The scope of these papers may encompass: (1) theory, methodology, and practice of measurement; (2) design, development and evaluation of instrumentation and measurement systems and components used in generating, acquiring, conditioning and processing signals; (3) analysis, representation, display, and preservation of the information obtained from a set of measurements; and (4) scientific and technical support to establishment and maintenance of technical standards in the field of Instrumentation and Measurement.