Lukas Heine, Anna Vahldiek, Benja Vahldiek, Fabian Hörst, Constantin Seibold, Mael Lever, Laurenz Pauleikhoff, Nikolaos Bechrakis, Daniel Pauleikhoff, Jens Kleesiek
{"title":"黄斑OCT改变的三维量化提高了人工智能模型的诊断性能。","authors":"Lukas Heine, Anna Vahldiek, Benja Vahldiek, Fabian Hörst, Constantin Seibold, Mael Lever, Laurenz Pauleikhoff, Nikolaos Bechrakis, Daniel Pauleikhoff, Jens Kleesiek","doi":"10.1167/tvst.14.7.8","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>To evaluate the performance of state-of-the-art semantic segmentation methods on OCT data for age-related macular degeneration (AMD). We measured variability between annotators to quantify differences in ground truth arising from personal bias.</p><p><strong>Methods: </strong>From 94 patients suffering from exudative neovascular AMD (nAMD), 24 volume scans (49 slices each) were selected. Trained members of a reading center for AMD created pixel-wise masks for 12 retinal layers and two pathological labels (fluid, hyperreflective material) to benchmark two-dimensional (2D) and three-dimensional (3D) segmentation models on clinical data. Models were evaluated using fivefold cross-validation, and the best model was used to quantify errors between ground truth and predictions.</p><p><strong>Results: </strong>The nnU-Net (3D) achieves the best segmentation performance (mean Dice similarity coefficient [DSC] of 0.907), leaving a theoretical gap of 0.036 DSC to the mean interrater agreement, which is the upper bound of model performances. Comparing the volumes calculated for each structure using the model masks with the ground truth produced an average error of 0.065 mm3.</p><p><strong>Conclusions: </strong>Models like nnU-Net can produce high-quality 3D masks, challenging the conventional reliance on 2D slices for optimal performance. Both DSC and low average errors indicate that such a model is fit for the large-scale analysis of cohorts.</p><p><strong>Translational relevance: </strong>The presented approach can streamline clinical workflows by reducing the time and effort required for manual annotations, ultimately supporting more efficient and accurate monitoring of AMD progression and treatment response. We provide open-source access to the model weights, annotation instructions and sample data.</p>","PeriodicalId":23322,"journal":{"name":"Translational Vision Science & Technology","volume":"14 7","pages":"8"},"PeriodicalIF":2.6000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12279069/pdf/","citationCount":"0","resultStr":"{\"title\":\"Three-Dimensional Quantification of Macular OCT Alterations Improves the Diagnostic Performance of Artificial Intelligence Models.\",\"authors\":\"Lukas Heine, Anna Vahldiek, Benja Vahldiek, Fabian Hörst, Constantin Seibold, Mael Lever, Laurenz Pauleikhoff, Nikolaos Bechrakis, Daniel Pauleikhoff, Jens Kleesiek\",\"doi\":\"10.1167/tvst.14.7.8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Purpose: </strong>To evaluate the performance of state-of-the-art semantic segmentation methods on OCT data for age-related macular degeneration (AMD). We measured variability between annotators to quantify differences in ground truth arising from personal bias.</p><p><strong>Methods: </strong>From 94 patients suffering from exudative neovascular AMD (nAMD), 24 volume scans (49 slices each) were selected. Trained members of a reading center for AMD created pixel-wise masks for 12 retinal layers and two pathological labels (fluid, hyperreflective material) to benchmark two-dimensional (2D) and three-dimensional (3D) segmentation models on clinical data. Models were evaluated using fivefold cross-validation, and the best model was used to quantify errors between ground truth and predictions.</p><p><strong>Results: </strong>The nnU-Net (3D) achieves the best segmentation performance (mean Dice similarity coefficient [DSC] of 0.907), leaving a theoretical gap of 0.036 DSC to the mean interrater agreement, which is the upper bound of model performances. Comparing the volumes calculated for each structure using the model masks with the ground truth produced an average error of 0.065 mm3.</p><p><strong>Conclusions: </strong>Models like nnU-Net can produce high-quality 3D masks, challenging the conventional reliance on 2D slices for optimal performance. Both DSC and low average errors indicate that such a model is fit for the large-scale analysis of cohorts.</p><p><strong>Translational relevance: </strong>The presented approach can streamline clinical workflows by reducing the time and effort required for manual annotations, ultimately supporting more efficient and accurate monitoring of AMD progression and treatment response. We provide open-source access to the model weights, annotation instructions and sample data.</p>\",\"PeriodicalId\":23322,\"journal\":{\"name\":\"Translational Vision Science & Technology\",\"volume\":\"14 7\",\"pages\":\"8\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2025-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12279069/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Translational Vision Science & Technology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1167/tvst.14.7.8\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"OPHTHALMOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Translational Vision Science & Technology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1167/tvst.14.7.8","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"OPHTHALMOLOGY","Score":null,"Total":0}
Three-Dimensional Quantification of Macular OCT Alterations Improves the Diagnostic Performance of Artificial Intelligence Models.
Purpose: To evaluate the performance of state-of-the-art semantic segmentation methods on OCT data for age-related macular degeneration (AMD). We measured variability between annotators to quantify differences in ground truth arising from personal bias.
Methods: From 94 patients suffering from exudative neovascular AMD (nAMD), 24 volume scans (49 slices each) were selected. Trained members of a reading center for AMD created pixel-wise masks for 12 retinal layers and two pathological labels (fluid, hyperreflective material) to benchmark two-dimensional (2D) and three-dimensional (3D) segmentation models on clinical data. Models were evaluated using fivefold cross-validation, and the best model was used to quantify errors between ground truth and predictions.
Results: The nnU-Net (3D) achieves the best segmentation performance (mean Dice similarity coefficient [DSC] of 0.907), leaving a theoretical gap of 0.036 DSC to the mean interrater agreement, which is the upper bound of model performances. Comparing the volumes calculated for each structure using the model masks with the ground truth produced an average error of 0.065 mm3.
Conclusions: Models like nnU-Net can produce high-quality 3D masks, challenging the conventional reliance on 2D slices for optimal performance. Both DSC and low average errors indicate that such a model is fit for the large-scale analysis of cohorts.
Translational relevance: The presented approach can streamline clinical workflows by reducing the time and effort required for manual annotations, ultimately supporting more efficient and accurate monitoring of AMD progression and treatment response. We provide open-source access to the model weights, annotation instructions and sample data.
期刊介绍:
Translational Vision Science & Technology (TVST), an official journal of the Association for Research in Vision and Ophthalmology (ARVO), an international organization whose purpose is to advance research worldwide into understanding the visual system and preventing, treating and curing its disorders, is an online, open access, peer-reviewed journal emphasizing multidisciplinary research that bridges the gap between basic research and clinical care. A highly qualified and diverse group of Associate Editors and Editorial Board Members is led by Editor-in-Chief Marco Zarbin, MD, PhD, FARVO.
The journal covers a broad spectrum of work, including but not limited to:
Applications of stem cell technology for regenerative medicine,
Development of new animal models of human diseases,
Tissue bioengineering,
Chemical engineering to improve virus-based gene delivery,
Nanotechnology for drug delivery,
Design and synthesis of artificial extracellular matrices,
Development of a true microsurgical operating environment,
Refining data analysis algorithms to improve in vivo imaging technology,
Results of Phase 1 clinical trials,
Reverse translational ("bedside to bench") research.
TVST seeks manuscripts from scientists and clinicians with diverse backgrounds ranging from basic chemistry to ophthalmic surgery that will advance or change the way we understand and/or treat vision-threatening diseases. TVST encourages the use of color, multimedia, hyperlinks, program code and other digital enhancements.