{"title":"基于结构先验标记的视网膜血管分割","authors":"Jiaqi Guo, Xinyu Guo, Quanyong Yi, Huaying Hao, Yitian Zhao","doi":"10.1002/mp.18018","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Background</h3>\n \n <p>Accurate retinal vessel segmentation from Optical Coherence Tomography Angiography (OCTA) images is vital in ophthalmic medicine, particularly for the early diagnosis and monitoring of diseases, such as diabetic retinopathy and hypertensive retinopathy. The retinal vascular system exhibits complex characteristics, including branching, crossing, and continuity, which are crucial for precise segmentation and subsequent medical analysis. However, traditional pixel-wise vessel segmentation methods focus on learning how to effectively divide each pixel into different categories, relying mainly on local features, such as intensity and texture, and often neglecting the intrinsic structural properties of vessels. This can cause suboptimal segmentation accuracy and robustness, particularly when handling low-contrast, noisy, or pathological images.</p>\n </section>\n \n <section>\n \n <h3> Purpose</h3>\n \n <p>This study aims to integrate structural priors into a segmentation framework. Prior embeddings are used to guide the segmentation process, which encode the typical morphology and topological structure of blood vessels. Incorporating these embeddings can improve the accuracy of retinal vessel segmentation, particularly in challenging areas such as small vessels and regions with ambiguous boundaries. This approach could help to preserve the integrity and continuity of the vascular structure, resulting in more reliable and precise segmentation.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>This study adopts a generative image segmentation framework. A structured representation in a latent embedding space is presented to explore retinal vessel priors. On this basis, a prior-driven retinal vessel segmentation network is introduced. First, the retinal vessel priors from ground truth data are learned, which are encoded as embedding tokens through a residual quantization reconstruction network. The learned priors are stored in a codebook. In our network, a raw OCTA image is transformed into semantic features using an encoder. Each semantic feature is subsequently represented by a set of embedding tokens from the codebook. Finally, the retinal vessels are reconstructed, preserving the integrity and continuity of the vascular structures using the learned structural priors.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>The performance of our proposed network was assessed across three OCTA datasets: publically available ROSE-1 and ROSE-2, and the private dataset OCTA-Z. Both quantitative and qualitative evaluations revealed that our network outperformed current state-of-the-art methods. In particular, our approach achieved the average Dice scores of 77.63, 71.01, and 81.11% for ROSE-1, ROSE-2, and OCTA-Z, respectively.</p>\n </section>\n \n <section>\n \n <h3> Conclusions</h3>\n \n <p>The experimental results demonstrated that the proposed network effectively learned and utilized implicit vessel priors for OCTA vessel segmentation. Extensive evaluations indicated that our network surpassed the current state-of-the-art methods. Specifically, our reconstruction approach of applying latent prior tokens offered a promising solution for the pattern representation of retinal vessels. In the future, we will extend this method to support more comprehensive retinal structure segmentation and eye-related disease classification tasks. Additionally, we plan to integrate diverse and detailed prior knowledge, such as anatomical and pathological information, to enhance the performance and versatility of our approach.</p>\n </section>\n </div>","PeriodicalId":18384,"journal":{"name":"Medical physics","volume":"52 8","pages":""},"PeriodicalIF":3.2000,"publicationDate":"2025-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Retinal vessel segmentation driven by structure prior tokens\",\"authors\":\"Jiaqi Guo, Xinyu Guo, Quanyong Yi, Huaying Hao, Yitian Zhao\",\"doi\":\"10.1002/mp.18018\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n \\n <section>\\n \\n <h3> Background</h3>\\n \\n <p>Accurate retinal vessel segmentation from Optical Coherence Tomography Angiography (OCTA) images is vital in ophthalmic medicine, particularly for the early diagnosis and monitoring of diseases, such as diabetic retinopathy and hypertensive retinopathy. The retinal vascular system exhibits complex characteristics, including branching, crossing, and continuity, which are crucial for precise segmentation and subsequent medical analysis. However, traditional pixel-wise vessel segmentation methods focus on learning how to effectively divide each pixel into different categories, relying mainly on local features, such as intensity and texture, and often neglecting the intrinsic structural properties of vessels. This can cause suboptimal segmentation accuracy and robustness, particularly when handling low-contrast, noisy, or pathological images.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Purpose</h3>\\n \\n <p>This study aims to integrate structural priors into a segmentation framework. Prior embeddings are used to guide the segmentation process, which encode the typical morphology and topological structure of blood vessels. Incorporating these embeddings can improve the accuracy of retinal vessel segmentation, particularly in challenging areas such as small vessels and regions with ambiguous boundaries. This approach could help to preserve the integrity and continuity of the vascular structure, resulting in more reliable and precise segmentation.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Methods</h3>\\n \\n <p>This study adopts a generative image segmentation framework. A structured representation in a latent embedding space is presented to explore retinal vessel priors. On this basis, a prior-driven retinal vessel segmentation network is introduced. First, the retinal vessel priors from ground truth data are learned, which are encoded as embedding tokens through a residual quantization reconstruction network. The learned priors are stored in a codebook. In our network, a raw OCTA image is transformed into semantic features using an encoder. Each semantic feature is subsequently represented by a set of embedding tokens from the codebook. Finally, the retinal vessels are reconstructed, preserving the integrity and continuity of the vascular structures using the learned structural priors.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Results</h3>\\n \\n <p>The performance of our proposed network was assessed across three OCTA datasets: publically available ROSE-1 and ROSE-2, and the private dataset OCTA-Z. Both quantitative and qualitative evaluations revealed that our network outperformed current state-of-the-art methods. In particular, our approach achieved the average Dice scores of 77.63, 71.01, and 81.11% for ROSE-1, ROSE-2, and OCTA-Z, respectively.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Conclusions</h3>\\n \\n <p>The experimental results demonstrated that the proposed network effectively learned and utilized implicit vessel priors for OCTA vessel segmentation. Extensive evaluations indicated that our network surpassed the current state-of-the-art methods. Specifically, our reconstruction approach of applying latent prior tokens offered a promising solution for the pattern representation of retinal vessels. In the future, we will extend this method to support more comprehensive retinal structure segmentation and eye-related disease classification tasks. Additionally, we plan to integrate diverse and detailed prior knowledge, such as anatomical and pathological information, to enhance the performance and versatility of our approach.</p>\\n </section>\\n </div>\",\"PeriodicalId\":18384,\"journal\":{\"name\":\"Medical physics\",\"volume\":\"52 8\",\"pages\":\"\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2025-09-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Medical physics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://aapm.onlinelibrary.wiley.com/doi/10.1002/mp.18018\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical physics","FirstCategoryId":"3","ListUrlMain":"https://aapm.onlinelibrary.wiley.com/doi/10.1002/mp.18018","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
Retinal vessel segmentation driven by structure prior tokens
Background
Accurate retinal vessel segmentation from Optical Coherence Tomography Angiography (OCTA) images is vital in ophthalmic medicine, particularly for the early diagnosis and monitoring of diseases, such as diabetic retinopathy and hypertensive retinopathy. The retinal vascular system exhibits complex characteristics, including branching, crossing, and continuity, which are crucial for precise segmentation and subsequent medical analysis. However, traditional pixel-wise vessel segmentation methods focus on learning how to effectively divide each pixel into different categories, relying mainly on local features, such as intensity and texture, and often neglecting the intrinsic structural properties of vessels. This can cause suboptimal segmentation accuracy and robustness, particularly when handling low-contrast, noisy, or pathological images.
Purpose
This study aims to integrate structural priors into a segmentation framework. Prior embeddings are used to guide the segmentation process, which encode the typical morphology and topological structure of blood vessels. Incorporating these embeddings can improve the accuracy of retinal vessel segmentation, particularly in challenging areas such as small vessels and regions with ambiguous boundaries. This approach could help to preserve the integrity and continuity of the vascular structure, resulting in more reliable and precise segmentation.
Methods
This study adopts a generative image segmentation framework. A structured representation in a latent embedding space is presented to explore retinal vessel priors. On this basis, a prior-driven retinal vessel segmentation network is introduced. First, the retinal vessel priors from ground truth data are learned, which are encoded as embedding tokens through a residual quantization reconstruction network. The learned priors are stored in a codebook. In our network, a raw OCTA image is transformed into semantic features using an encoder. Each semantic feature is subsequently represented by a set of embedding tokens from the codebook. Finally, the retinal vessels are reconstructed, preserving the integrity and continuity of the vascular structures using the learned structural priors.
Results
The performance of our proposed network was assessed across three OCTA datasets: publically available ROSE-1 and ROSE-2, and the private dataset OCTA-Z. Both quantitative and qualitative evaluations revealed that our network outperformed current state-of-the-art methods. In particular, our approach achieved the average Dice scores of 77.63, 71.01, and 81.11% for ROSE-1, ROSE-2, and OCTA-Z, respectively.
Conclusions
The experimental results demonstrated that the proposed network effectively learned and utilized implicit vessel priors for OCTA vessel segmentation. Extensive evaluations indicated that our network surpassed the current state-of-the-art methods. Specifically, our reconstruction approach of applying latent prior tokens offered a promising solution for the pattern representation of retinal vessels. In the future, we will extend this method to support more comprehensive retinal structure segmentation and eye-related disease classification tasks. Additionally, we plan to integrate diverse and detailed prior knowledge, such as anatomical and pathological information, to enhance the performance and versatility of our approach.
期刊介绍:
Medical Physics publishes original, high impact physics, imaging science, and engineering research that advances patient diagnosis and therapy through contributions in 1) Basic science developments with high potential for clinical translation 2) Clinical applications of cutting edge engineering and physics innovations 3) Broadly applicable and innovative clinical physics developments
Medical Physics is a journal of global scope and reach. By publishing in Medical Physics your research will reach an international, multidisciplinary audience including practicing medical physicists as well as physics- and engineering based translational scientists. We work closely with authors of promising articles to improve their quality.