黄斑OCT改变的三维量化提高了人工智能模型的诊断性能。

IF 2.6 3区医学 Q2 OPHTHALMOLOGY

Translational Vision Science & Technology Pub Date : 2025-07-01 DOI:10.1167/tvst.14.7.8

Lukas Heine, Anna Vahldiek, Benja Vahldiek, Fabian Hörst, Constantin Seibold, Mael Lever, Laurenz Pauleikhoff, Nikolaos Bechrakis, Daniel Pauleikhoff, Jens Kleesiek

{"title":"黄斑OCT改变的三维量化提高了人工智能模型的诊断性能。","authors":"Lukas Heine, Anna Vahldiek, Benja Vahldiek, Fabian Hörst, Constantin Seibold, Mael Lever, Laurenz Pauleikhoff, Nikolaos Bechrakis, Daniel Pauleikhoff, Jens Kleesiek","doi":"10.1167/tvst.14.7.8","DOIUrl":null,"url":null,"abstract":"Purpose: To evaluate the performance of state-of-the-art semantic segmentation methods on OCT data for age-related macular degeneration (AMD). We measured variability between annotators to quantify differences in ground truth arising from personal bias.Methods: From 94 patients suffering from exudative neovascular AMD (nAMD), 24 volume scans (49 slices each) were selected. Trained members of a reading center for AMD created pixel-wise masks for 12 retinal layers and two pathological labels (fluid, hyperreflective material) to benchmark two-dimensional (2D) and three-dimensional (3D) segmentation models on clinical data. Models were evaluated using fivefold cross-validation, and the best model was used to quantify errors between ground truth and predictions.Results: The nnU-Net (3D) achieves the best segmentation performance (mean Dice similarity coefficient [DSC] of 0.907), leaving a theoretical gap of 0.036 DSC to the mean interrater agreement, which is the upper bound of model performances. Comparing the volumes calculated for each structure using the model masks with the ground truth produced an average error of 0.065 mm3.Conclusions: Models like nnU-Net can produce high-quality 3D masks, challenging the conventional reliance on 2D slices for optimal performance. Both DSC and low average errors indicate that such a model is fit for the large-scale analysis of cohorts.Translational relevance: The presented approach can streamline clinical workflows by reducing the time and effort required for manual annotations, ultimately supporting more efficient and accurate monitoring of AMD progression and treatment response. We provide open-source access to the model weights, annotation instructions and sample data.","PeriodicalId":23322,"journal":{"name":"Translational Vision Science & Technology","volume":"14 7","pages":"8"},"PeriodicalIF":2.6000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12279069/pdf/","citationCount":"0","resultStr":"{\"title\":\"Three-Dimensional Quantification of Macular OCT Alterations Improves the Diagnostic Performance of Artificial Intelligence Models.\",\"authors\":\"Lukas Heine, Anna Vahldiek, Benja Vahldiek, Fabian Hörst, Constantin Seibold, Mael Lever, Laurenz Pauleikhoff, Nikolaos Bechrakis, Daniel Pauleikhoff, Jens Kleesiek\",\"doi\":\"10.1167/tvst.14.7.8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Purpose: To evaluate the performance of state-of-the-art semantic segmentation methods on OCT data for age-related macular degeneration (AMD). We measured variability between annotators to quantify differences in ground truth arising from personal bias.Methods: From 94 patients suffering from exudative neovascular AMD (nAMD), 24 volume scans (49 slices each) were selected. Trained members of a reading center for AMD created pixel-wise masks for 12 retinal layers and two pathological labels (fluid, hyperreflective material) to benchmark two-dimensional (2D) and three-dimensional (3D) segmentation models on clinical data. Models were evaluated using fivefold cross-validation, and the best model was used to quantify errors between ground truth and predictions.Results: The nnU-Net (3D) achieves the best segmentation performance (mean Dice similarity coefficient [DSC] of 0.907), leaving a theoretical gap of 0.036 DSC to the mean interrater agreement, which is the upper bound of model performances. Comparing the volumes calculated for each structure using the model masks with the ground truth produced an average error of 0.065 mm3.Conclusions: Models like nnU-Net can produce high-quality 3D masks, challenging the conventional reliance on 2D slices for optimal performance. Both DSC and low average errors indicate that such a model is fit for the large-scale analysis of cohorts.Translational relevance: The presented approach can streamline clinical workflows by reducing the time and effort required for manual annotations, ultimately supporting more efficient and accurate monitoring of AMD progression and treatment response. We provide open-source access to the model weights, annotation instructions and sample data.\",\"PeriodicalId\":23322,\"journal\":{\"name\":\"Translational Vision Science & Technology\",\"volume\":\"14 7\",\"pages\":\"8\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2025-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12279069/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Translational Vision Science & Technology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1167/tvst.14.7.8\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"OPHTHALMOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Translational Vision Science & Technology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1167/tvst.14.7.8","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"OPHTHALMOLOGY","Score":null,"Total":0}

引用次数: 0

摘要

目的：评价最先进的语义分割方法对年龄相关性黄斑变性（AMD） OCT数据的性能。我们测量了注释者之间的可变性，以量化由个人偏见引起的基本事实的差异。方法：选取94例渗出性新生血管性AMD （nAMD）患者24张容积扫描（每张49片）。AMD阅读中心训练有素的成员为12个视网膜层和两个病理标签（液体，高反射材料）创建了像素级掩模，以临床数据为基准进行二维（2D）和三维（3D）分割模型。使用五倍交叉验证来评估模型，并使用最佳模型来量化基本事实与预测之间的误差。结果：nnU-Net （3D）实现了最佳分割性能（平均Dice相似系数[DSC]为0.907），与平均interterater一致性的理论差距为0.036 DSC，这是模型性能的上界。将使用模型掩模计算的每个结构的体积与地面真实值进行比较，平均误差为0.065 mm3。结论：像nnU-Net这样的模型可以产生高质量的3D掩模，挑战了传统的依赖2D切片的最佳性能。DSC和较低的平均误差表明，该模型适合大规模的队列分析。翻译相关性：本文提出的方法可以通过减少人工注释所需的时间和精力来简化临床工作流程，最终支持更有效和准确地监测AMD的进展和治疗反应。我们提供对模型权重、注释指令和样本数据的开源访问。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Three-Dimensional Quantification of Macular OCT Alterations Improves the Diagnostic Performance of Artificial Intelligence Models.

查看原文本刊更多论文

Three-Dimensional Quantification of Macular OCT Alterations Improves the Diagnostic Performance of Artificial Intelligence Models.

Purpose: To evaluate the performance of state-of-the-art semantic segmentation methods on OCT data for age-related macular degeneration (AMD). We measured variability between annotators to quantify differences in ground truth arising from personal bias.

Methods: From 94 patients suffering from exudative neovascular AMD (nAMD), 24 volume scans (49 slices each) were selected. Trained members of a reading center for AMD created pixel-wise masks for 12 retinal layers and two pathological labels (fluid, hyperreflective material) to benchmark two-dimensional (2D) and three-dimensional (3D) segmentation models on clinical data. Models were evaluated using fivefold cross-validation, and the best model was used to quantify errors between ground truth and predictions.

Results: The nnU-Net (3D) achieves the best segmentation performance (mean Dice similarity coefficient [DSC] of 0.907), leaving a theoretical gap of 0.036 DSC to the mean interrater agreement, which is the upper bound of model performances. Comparing the volumes calculated for each structure using the model masks with the ground truth produced an average error of 0.065 mm3.

Conclusions: Models like nnU-Net can produce high-quality 3D masks, challenging the conventional reliance on 2D slices for optimal performance. Both DSC and low average errors indicate that such a model is fit for the large-scale analysis of cohorts.

Translational relevance: The presented approach can streamline clinical workflows by reducing the time and effort required for manual annotations, ultimately supporting more efficient and accurate monitoring of AMD progression and treatment response. We provide open-source access to the model weights, annotation instructions and sample data.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Translational Vision Science & Technology Engineering-Biomedical Engineering

CiteScore

5.70

自引率

3.30%

发文量

346

审稿时长

25 weeks

期刊介绍： Translational Vision Science & Technology (TVST), an official journal of the Association for Research in Vision and Ophthalmology (ARVO), an international organization whose purpose is to advance research worldwide into understanding the visual system and preventing, treating and curing its disorders, is an online, open access, peer-reviewed journal emphasizing multidisciplinary research that bridges the gap between basic research and clinical care. A highly qualified and diverse group of Associate Editors and Editorial Board Members is led by Editor-in-Chief Marco Zarbin, MD, PhD, FARVO. The journal covers a broad spectrum of work, including but not limited to: Applications of stem cell technology for regenerative medicine, Development of new animal models of human diseases, Tissue bioengineering, Chemical engineering to improve virus-based gene delivery, Nanotechnology for drug delivery, Design and synthesis of artificial extracellular matrices, Development of a true microsurgical operating environment, Refining data analysis algorithms to improve in vivo imaging technology, Results of Phase 1 clinical trials, Reverse translational ("bedside to bench") research. TVST seeks manuscripts from scientists and clinicians with diverse backgrounds ranging from basic chemistry to ophthalmic surgery that will advance or change the way we understand and/or treat vision-threatening diseases. TVST encourages the use of color, multimedia, hyperlinks, program code and other digital enhancements.