Retinal vessel segmentation driven by structure prior tokens

IF 3.2 2区医学 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

Medical physics Pub Date : 2025-09-03 DOI:10.1002/mp.18018

Jiaqi Guo, Xinyu Guo, Quanyong Yi, Huaying Hao, Yitian Zhao

{"title":"Retinal vessel segmentation driven by structure prior tokens","authors":"Jiaqi Guo, Xinyu Guo, Quanyong Yi, Huaying Hao, Yitian Zhao","doi":"10.1002/mp.18018","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Background</h3>\n \n <p>Accurate retinal vessel segmentation from Optical Coherence Tomography Angiography (OCTA) images is vital in ophthalmic medicine, particularly for the early diagnosis and monitoring of diseases, such as diabetic retinopathy and hypertensive retinopathy. The retinal vascular system exhibits complex characteristics, including branching, crossing, and continuity, which are crucial for precise segmentation and subsequent medical analysis. However, traditional pixel-wise vessel segmentation methods focus on learning how to effectively divide each pixel into different categories, relying mainly on local features, such as intensity and texture, and often neglecting the intrinsic structural properties of vessels. This can cause suboptimal segmentation accuracy and robustness, particularly when handling low-contrast, noisy, or pathological images.</p>\n </section>\n \n <section>\n \n <h3> Purpose</h3>\n \n <p>This study aims to integrate structural priors into a segmentation framework. Prior embeddings are used to guide the segmentation process, which encode the typical morphology and topological structure of blood vessels. Incorporating these embeddings can improve the accuracy of retinal vessel segmentation, particularly in challenging areas such as small vessels and regions with ambiguous boundaries. This approach could help to preserve the integrity and continuity of the vascular structure, resulting in more reliable and precise segmentation.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>This study adopts a generative image segmentation framework. A structured representation in a latent embedding space is presented to explore retinal vessel priors. On this basis, a prior-driven retinal vessel segmentation network is introduced. First, the retinal vessel priors from ground truth data are learned, which are encoded as embedding tokens through a residual quantization reconstruction network. The learned priors are stored in a codebook. In our network, a raw OCTA image is transformed into semantic features using an encoder. Each semantic feature is subsequently represented by a set of embedding tokens from the codebook. Finally, the retinal vessels are reconstructed, preserving the integrity and continuity of the vascular structures using the learned structural priors.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>The performance of our proposed network was assessed across three OCTA datasets: publically available ROSE-1 and ROSE-2, and the private dataset OCTA-Z. Both quantitative and qualitative evaluations revealed that our network outperformed current state-of-the-art methods. In particular, our approach achieved the average Dice scores of 77.63, 71.01, and 81.11% for ROSE-1, ROSE-2, and OCTA-Z, respectively.</p>\n </section>\n \n <section>\n \n <h3> Conclusions</h3>\n \n <p>The experimental results demonstrated that the proposed network effectively learned and utilized implicit vessel priors for OCTA vessel segmentation. Extensive evaluations indicated that our network surpassed the current state-of-the-art methods. Specifically, our reconstruction approach of applying latent prior tokens offered a promising solution for the pattern representation of retinal vessels. In the future, we will extend this method to support more comprehensive retinal structure segmentation and eye-related disease classification tasks. Additionally, we plan to integrate diverse and detailed prior knowledge, such as anatomical and pathological information, to enhance the performance and versatility of our approach.</p>\n </section>\n </div>","PeriodicalId":18384,"journal":{"name":"Medical physics","volume":"52 8","pages":""},"PeriodicalIF":3.2000,"publicationDate":"2025-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical physics","FirstCategoryId":"3","ListUrlMain":"https://aapm.onlinelibrary.wiley.com/doi/10.1002/mp.18018","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

Abstract

Background

Accurate retinal vessel segmentation from Optical Coherence Tomography Angiography (OCTA) images is vital in ophthalmic medicine, particularly for the early diagnosis and monitoring of diseases, such as diabetic retinopathy and hypertensive retinopathy. The retinal vascular system exhibits complex characteristics, including branching, crossing, and continuity, which are crucial for precise segmentation and subsequent medical analysis. However, traditional pixel-wise vessel segmentation methods focus on learning how to effectively divide each pixel into different categories, relying mainly on local features, such as intensity and texture, and often neglecting the intrinsic structural properties of vessels. This can cause suboptimal segmentation accuracy and robustness, particularly when handling low-contrast, noisy, or pathological images.

Purpose

This study aims to integrate structural priors into a segmentation framework. Prior embeddings are used to guide the segmentation process, which encode the typical morphology and topological structure of blood vessels. Incorporating these embeddings can improve the accuracy of retinal vessel segmentation, particularly in challenging areas such as small vessels and regions with ambiguous boundaries. This approach could help to preserve the integrity and continuity of the vascular structure, resulting in more reliable and precise segmentation.

Methods

This study adopts a generative image segmentation framework. A structured representation in a latent embedding space is presented to explore retinal vessel priors. On this basis, a prior-driven retinal vessel segmentation network is introduced. First, the retinal vessel priors from ground truth data are learned, which are encoded as embedding tokens through a residual quantization reconstruction network. The learned priors are stored in a codebook. In our network, a raw OCTA image is transformed into semantic features using an encoder. Each semantic feature is subsequently represented by a set of embedding tokens from the codebook. Finally, the retinal vessels are reconstructed, preserving the integrity and continuity of the vascular structures using the learned structural priors.

Results

The performance of our proposed network was assessed across three OCTA datasets: publically available ROSE-1 and ROSE-2, and the private dataset OCTA-Z. Both quantitative and qualitative evaluations revealed that our network outperformed current state-of-the-art methods. In particular, our approach achieved the average Dice scores of 77.63, 71.01, and 81.11% for ROSE-1, ROSE-2, and OCTA-Z, respectively.

Conclusions

The experimental results demonstrated that the proposed network effectively learned and utilized implicit vessel priors for OCTA vessel segmentation. Extensive evaluations indicated that our network surpassed the current state-of-the-art methods. Specifically, our reconstruction approach of applying latent prior tokens offered a promising solution for the pattern representation of retinal vessels. In the future, we will extend this method to support more comprehensive retinal structure segmentation and eye-related disease classification tasks. Additionally, we plan to integrate diverse and detailed prior knowledge, such as anatomical and pathological information, to enhance the performance and versatility of our approach.

Abstract Image

查看原文本刊更多论文

基于结构先验标记的视网膜血管分割

从光学相干断层扫描血管造影（OCTA）图像中准确分割视网膜血管在眼科医学中至关重要，特别是对于糖尿病视网膜病变和高血压视网膜病变等疾病的早期诊断和监测。视网膜血管系统表现出复杂的特征，包括分支、交叉和连续性，这对精确分割和随后的医学分析至关重要。然而，传统的基于像素的血管分割方法侧重于学习如何有效地将每个像素划分为不同的类别，主要依赖于局部特征，如强度和纹理，而往往忽略了血管的内在结构特性。这可能导致次优分割精度和鲁棒性，特别是在处理低对比度、噪声或病理图像时。本研究旨在将结构先验整合到分割框架中。利用先验嵌入来指导分割过程，对血管的典型形态和拓扑结构进行编码。结合这些嵌入可以提高视网膜血管分割的准确性，特别是在具有挑战性的区域，如小血管和边界模糊的区域。这种方法有助于保持血管结构的完整性和连续性，从而获得更可靠和精确的分割。方法采用生成式图像分割框架。在潜在嵌入空间中提出了一种结构化表示来探索视网膜血管先验。在此基础上，提出了一种先验驱动的视网膜血管分割网络。首先，从基真数据中学习视网膜血管先验，并通过残差量化重建网络将其编码为嵌入标记。学习到的先验被存储在一个代码本中。在我们的网络中，使用编码器将原始OCTA图像转换为语义特征。每个语义特征随后由码本中的一组嵌入令牌表示。最后，利用学习到的结构先验对视网膜血管进行重构，保持血管结构的完整性和连续性。我们提出的网络性能在三个OCTA数据集上进行了评估：公开可用的ROSE-1和ROSE-2，以及私有数据集OCTA- z。定量和定性评估都表明，我们的网络优于目前最先进的方法。特别是，我们的方法在ROSE-1、ROSE-2和OCTA-Z上的平均Dice得分分别为77.63、71.01和81.11%。结论该网络能够有效地学习和利用隐式血管先验进行OCTA血管分割。广泛的评估表明，我们的网络超越了目前最先进的方法。具体来说，我们应用潜在先验标记的重建方法为视网膜血管的模式表示提供了一个有希望的解决方案。在未来，我们将扩展该方法，以支持更全面的视网膜结构分割和眼相关疾病分类任务。此外，我们计划整合各种详细的先验知识，如解剖和病理信息，以提高我们方法的性能和通用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Medical physics 医学-核医学

CiteScore

6.80

自引率

15.80%

发文量

660

审稿时长

1.7 months

期刊介绍： Medical Physics publishes original, high impact physics, imaging science, and engineering research that advances patient diagnosis and therapy through contributions in 1) Basic science developments with high potential for clinical translation 2) Clinical applications of cutting edge engineering and physics innovations 3) Broadly applicable and innovative clinical physics developments Medical Physics is a journal of global scope and reach. By publishing in Medical Physics your research will reach an international, multidisciplinary audience including practicing medical physicists as well as physics- and engineering based translational scientists. We work closely with authors of promising articles to improve their quality.