Foundation models in ophthalmology: opportunities and challenges.

IF 2.6 2区医学 Q1 OPHTHALMOLOGY

Current Opinion in Ophthalmology Pub Date : 2025-01-01 Epub Date: 2024-11-04 DOI:10.1097/ICU.0000000000001091

Mertcan Sevgi, Eden Ruffell, Fares Antaki, Mark A Chia, Pearse A Keane

{"title":"Foundation models in ophthalmology: opportunities and challenges.","authors":"Mertcan Sevgi, Eden Ruffell, Fares Antaki, Mark A Chia, Pearse A Keane","doi":"10.1097/ICU.0000000000001091","DOIUrl":null,"url":null,"abstract":"Purpose of review: Last year marked the development of the first foundation model in ophthalmology, RETFound, setting the stage for generalizable medical artificial intelligence (GMAI) that can adapt to novel tasks. Additionally, rapid advancements in large language model (LLM) technology, including models such as GPT-4 and Gemini, have been tailored for medical specialization and evaluated on clinical scenarios with promising results. This review explores the opportunities and challenges for further advancements in these technologies.Recent findings: RETFound outperforms traditional deep learning models in specific tasks, even when only fine-tuned on small datasets. Additionally, LMMs like Med-Gemini and Medprompt GPT-4 perform better than out-of-the-box models for ophthalmology tasks. However, there is still a significant deficiency in ophthalmology-specific multimodal models. This gap is primarily due to the substantial computational resources required to train these models and the limitations of high-quality ophthalmology datasets.Summary: Overall, foundation models in ophthalmology present promising opportunities but face challenges, particularly the need for high-quality, standardized datasets for training and specialization. Although development has primarily focused on large language and vision models, the greatest opportunities lie in advancing large multimodal models, which can more closely mimic the capabilities of clinicians.","PeriodicalId":50604,"journal":{"name":"Current Opinion in Ophthalmology","volume":" ","pages":"90-98"},"PeriodicalIF":2.6000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11620320/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current Opinion in Ophthalmology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1097/ICU.0000000000001091","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/4 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"OPHTHALMOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Purpose of review: Last year marked the development of the first foundation model in ophthalmology, RETFound, setting the stage for generalizable medical artificial intelligence (GMAI) that can adapt to novel tasks. Additionally, rapid advancements in large language model (LLM) technology, including models such as GPT-4 and Gemini, have been tailored for medical specialization and evaluated on clinical scenarios with promising results. This review explores the opportunities and challenges for further advancements in these technologies.

Recent findings: RETFound outperforms traditional deep learning models in specific tasks, even when only fine-tuned on small datasets. Additionally, LMMs like Med-Gemini and Medprompt GPT-4 perform better than out-of-the-box models for ophthalmology tasks. However, there is still a significant deficiency in ophthalmology-specific multimodal models. This gap is primarily due to the substantial computational resources required to train these models and the limitations of high-quality ophthalmology datasets.

Summary: Overall, foundation models in ophthalmology present promising opportunities but face challenges, particularly the need for high-quality, standardized datasets for training and specialization. Although development has primarily focused on large language and vision models, the greatest opportunities lie in advancing large multimodal models, which can more closely mimic the capabilities of clinicians.

Abstract Image

查看原文本刊更多论文

眼科基础模式：机遇与挑战。

回顾的目的：去年，眼科领域开发出首个基础模型 RETFound，为可适应新任务的通用医学人工智能（GMAI）奠定了基础。此外，包括 GPT-4 和 Gemini 等模型在内的大型语言模型（LLM）技术也取得了突飞猛进的发展，这些模型已针对医学专业进行了定制，并在临床场景中进行了评估，取得了可喜的成果。本综述探讨了这些技术进一步发展的机遇和挑战：RETFound 在特定任务中的表现优于传统的深度学习模型，即使仅在小型数据集上进行微调也是如此。此外，Med-Gemini 和 Medprompt GPT-4 等 LMM 在眼科任务中的表现优于开箱即用的模型。然而，眼科专用的多模态模型仍然存在很大的不足。总结：总体而言，眼科基础模型带来了良好的机遇，但也面临着挑战，特别是需要高质量、标准化的数据集来进行训练和专业化。虽然开发工作主要集中在大型语言和视觉模型上，但最大的机遇在于推进大型多模态模型的开发，因为这些模型能更接近临床医生的能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Current Opinion in Ophthalmology 医学-眼科学

CiteScore

6.80

自引率

5.40%

发文量

120

审稿时长

6-12 weeks

期刊介绍： Current Opinion in Ophthalmology is an indispensable resource featuring key up-to-date and important advances in the field from around the world. With renowned guest editors for each section, every bimonthly issue of Current Opinion in Ophthalmology delivers a fresh insight into topics such as glaucoma, refractive surgery and corneal and external disorders. With ten sections in total, the journal provides a convenient and thorough review of the field and will be of interest to researchers, clinicians and other healthcare professionals alike.