Mak B Djulbegovic, Henry Bair, David J Taylor Gonzalez, Hiroshi Ishikawa, Gadi Wollstein, Joel S Schuman
{"title":"Artificial Intelligence for Optical Coherence Tomography in Glaucoma.","authors":"Mak B Djulbegovic, Henry Bair, David J Taylor Gonzalez, Hiroshi Ishikawa, Gadi Wollstein, Joel S Schuman","doi":"10.1167/tvst.14.1.27","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>The integration of artificial intelligence (AI), particularly deep learning (DL), with optical coherence tomography (OCT) offers significant opportunities in the diagnosis and management of glaucoma. This article explores the application of various DL models in enhancing OCT capabilities and addresses the challenges associated with their clinical implementation.</p><p><strong>Methods: </strong>A review of articles utilizing DL models was conducted, including convolutional neural networks (CNNs), recurrent neural networks (RNNs), generative adversarial networks (GANs), autoencoders, and large language models (LLMs). Key developments and practical applications of these models in OCT image analysis were emphasized, particularly in the context of enhancing image quality, glaucoma diagnosis, and monitoring progression.</p><p><strong>Results: </strong>CNNs excel in segmenting retinal layers and detecting glaucomatous damage, whereas RNNs are effective in analyzing sequential OCT scans for disease progression. GANs enhance image quality and data augmentation, and autoencoders facilitate advanced feature extraction. LLMs show promise in integrating textual and visual data for comprehensive diagnostic assessments. Despite these advancements, challenges such as data availability, variability, potential biases, and the need for extensive validation persist.</p><p><strong>Conclusions: </strong>DL models are reshaping glaucoma management by enhancing OCT's diagnostic capabilities. However, the successful translation into clinical practice requires addressing major challenges related to data variability, biases, fairness, and model validation to ensure accurate and reliable patient care.</p><p><strong>Translational relevance: </strong>This review bridges the gap between basic research and clinical care by demonstrating how AI, particularly DL models, can markedly enhance OCT's clinical utility in diagnosis, monitoring, and prediction, moving toward more individualized, personalized, and precise treatment strategies.</p>","PeriodicalId":23322,"journal":{"name":"Translational Vision Science & Technology","volume":"14 1","pages":"27"},"PeriodicalIF":2.6000,"publicationDate":"2025-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11760759/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Translational Vision Science & Technology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1167/tvst.14.1.27","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"OPHTHALMOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose: The integration of artificial intelligence (AI), particularly deep learning (DL), with optical coherence tomography (OCT) offers significant opportunities in the diagnosis and management of glaucoma. This article explores the application of various DL models in enhancing OCT capabilities and addresses the challenges associated with their clinical implementation.
Methods: A review of articles utilizing DL models was conducted, including convolutional neural networks (CNNs), recurrent neural networks (RNNs), generative adversarial networks (GANs), autoencoders, and large language models (LLMs). Key developments and practical applications of these models in OCT image analysis were emphasized, particularly in the context of enhancing image quality, glaucoma diagnosis, and monitoring progression.
Results: CNNs excel in segmenting retinal layers and detecting glaucomatous damage, whereas RNNs are effective in analyzing sequential OCT scans for disease progression. GANs enhance image quality and data augmentation, and autoencoders facilitate advanced feature extraction. LLMs show promise in integrating textual and visual data for comprehensive diagnostic assessments. Despite these advancements, challenges such as data availability, variability, potential biases, and the need for extensive validation persist.
Conclusions: DL models are reshaping glaucoma management by enhancing OCT's diagnostic capabilities. However, the successful translation into clinical practice requires addressing major challenges related to data variability, biases, fairness, and model validation to ensure accurate and reliable patient care.
Translational relevance: This review bridges the gap between basic research and clinical care by demonstrating how AI, particularly DL models, can markedly enhance OCT's clinical utility in diagnosis, monitoring, and prediction, moving toward more individualized, personalized, and precise treatment strategies.
目的:人工智能(AI),尤其是深度学习(DL)与光学相干断层扫描(OCT)的整合为青光眼的诊断和管理提供了重大机遇。本文探讨了各种深度学习模型在增强 OCT 功能方面的应用,并探讨了与临床应用相关的挑战:方法:对使用 DL 模型的文章进行了综述,包括卷积神经网络 (CNN)、递归神经网络 (RNN)、生成对抗网络 (GAN)、自动编码器和大型语言模型 (LLM)。会议强调了这些模型在 OCT 图像分析中的主要发展和实际应用,特别是在提高图像质量、青光眼诊断和监测进展方面:结果:CNNs 在分割视网膜层和检测青光眼损伤方面表现出色,而 RNNs 在分析连续 OCT 扫描以了解疾病进展方面也很有效。GANs 可提高图像质量和数据增量,而自动编码器可促进高级特征提取。LLM 在整合文本和视觉数据以进行综合诊断评估方面大有可为。尽管取得了这些进步,但数据可用性、可变性、潜在偏差以及需要广泛验证等挑战依然存在:结论:DL 模型通过增强 OCT 的诊断能力,正在重塑青光眼管理。然而,要成功转化为临床实践,需要解决与数据可变性、偏差、公平性和模型验证有关的主要挑战,以确保准确可靠的患者护理:本综述通过展示人工智能(尤其是 DL 模型)如何显著提高 OCT 在诊断、监测和预测方面的临床实用性,从而实现更加个体化、个性化和精确的治疗策略,在基础研究与临床治疗之间架起了一座桥梁。
期刊介绍:
Translational Vision Science & Technology (TVST), an official journal of the Association for Research in Vision and Ophthalmology (ARVO), an international organization whose purpose is to advance research worldwide into understanding the visual system and preventing, treating and curing its disorders, is an online, open access, peer-reviewed journal emphasizing multidisciplinary research that bridges the gap between basic research and clinical care. A highly qualified and diverse group of Associate Editors and Editorial Board Members is led by Editor-in-Chief Marco Zarbin, MD, PhD, FARVO.
The journal covers a broad spectrum of work, including but not limited to:
Applications of stem cell technology for regenerative medicine,
Development of new animal models of human diseases,
Tissue bioengineering,
Chemical engineering to improve virus-based gene delivery,
Nanotechnology for drug delivery,
Design and synthesis of artificial extracellular matrices,
Development of a true microsurgical operating environment,
Refining data analysis algorithms to improve in vivo imaging technology,
Results of Phase 1 clinical trials,
Reverse translational ("bedside to bench") research.
TVST seeks manuscripts from scientists and clinicians with diverse backgrounds ranging from basic chemistry to ophthalmic surgery that will advance or change the way we understand and/or treat vision-threatening diseases. TVST encourages the use of color, multimedia, hyperlinks, program code and other digital enhancements.