Retinal Imaging Analysis Performed By ChatGPT-4o And Gemini Advanced: The Turning Point Of The Revolution?

IF 2.3 2区 医学 Q2 OPHTHALMOLOGY
Matteo Mario Carlà, Emanuele Crincoli, Stanislao Rizzo
{"title":"Retinal Imaging Analysis Performed By ChatGPT-4o And Gemini Advanced: The Turning Point Of The Revolution?","authors":"Matteo Mario Carlà, Emanuele Crincoli, Stanislao Rizzo","doi":"10.1097/IAE.0000000000004351","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>To assess the diagnostic capabilities of the most recent chatbots releases, GPT-4o and Gemini Advanced, facing different retinal diseases.</p><p><strong>Methods: </strong>Exploratory analysis on 50 cases with different surgical (n=27) and medical (n=23) retinal pathologies, whose optical coherence tomography/angiography (OCT/OCTA) scans were dragged into ChatGPT and Gemini's interfaces. Then, we asked \"Please describe this image\" and classified the diagnosis as: 1) Correct; 2) Partially correct; 3) Wrong; 4) Unable to assess exam type and 5) Diagnosis not given.</p><p><strong>Results: </strong>ChatGPT indicated the correct diagnosis in 31/50 cases (62%), significantly higher than Gemini Advanced 16/50 cases (p=0.0048). In 24% of cases, Gemini Advanced was not able to produce any answer, stating \"That's not something I'm able to do yet\". For both, primary misdiagnosis was macular edema, given erroneously in 16% and 14% of cases, respectively. ChatGPT-4o showed higher rates of correct diagnoses either in surgical (52% vs 30%) and medical retina (78% vs 43%). Notably, when presented without the corresponding structural image, in any case Gemini was able to recognize OCTA scans, confusing images for artworks.</p><p><strong>Conclusion: </strong>ChatGPT-4o outperformed Gemini Advanced in terms of diagnostic accuracy facing OCT/OCTA images, even if the range of diagnoses is still limited.</p>","PeriodicalId":54486,"journal":{"name":"Retina-The Journal of Retinal and Vitreous Diseases","volume":" ","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2024-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Retina-The Journal of Retinal and Vitreous Diseases","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1097/IAE.0000000000004351","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"OPHTHALMOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose: To assess the diagnostic capabilities of the most recent chatbots releases, GPT-4o and Gemini Advanced, facing different retinal diseases.

Methods: Exploratory analysis on 50 cases with different surgical (n=27) and medical (n=23) retinal pathologies, whose optical coherence tomography/angiography (OCT/OCTA) scans were dragged into ChatGPT and Gemini's interfaces. Then, we asked "Please describe this image" and classified the diagnosis as: 1) Correct; 2) Partially correct; 3) Wrong; 4) Unable to assess exam type and 5) Diagnosis not given.

Results: ChatGPT indicated the correct diagnosis in 31/50 cases (62%), significantly higher than Gemini Advanced 16/50 cases (p=0.0048). In 24% of cases, Gemini Advanced was not able to produce any answer, stating "That's not something I'm able to do yet". For both, primary misdiagnosis was macular edema, given erroneously in 16% and 14% of cases, respectively. ChatGPT-4o showed higher rates of correct diagnoses either in surgical (52% vs 30%) and medical retina (78% vs 43%). Notably, when presented without the corresponding structural image, in any case Gemini was able to recognize OCTA scans, confusing images for artworks.

Conclusion: ChatGPT-4o outperformed Gemini Advanced in terms of diagnostic accuracy facing OCT/OCTA images, even if the range of diagnoses is still limited.

chatgpt - 40和Gemini Advanced视网膜成像分析:革命的转折点?
目的:评估最新发布的聊天机器人gpt - 40和Gemini Advanced在面对不同视网膜疾病时的诊断能力。方法:对50例不同手术和内科病理的视网膜病变患者(n=27)进行探索性分析,这些患者的光学相干断层扫描/血管造影(OCT/OCTA)扫描被拖入ChatGPT和Gemini的界面。然后,我们问“请描述这个图像”,并将诊断分类为:1)正确;2)部分正确;3)错了;4)无法评估检查类型;5)未给出诊断。结果:ChatGPT诊断正确率为31/50例(62%),显著高于Gemini Advanced 16/50例(p=0.0048)。在24%的情况下,Gemini Advanced无法给出任何答案,并表示“这不是我能做的事情”。对于这两种情况,主要的误诊是黄斑水肿,分别有16%和14%的病例被误诊。chatgpt - 40在外科(52%对30%)和医学视网膜(78%对43%)中显示出更高的正确诊断率。值得注意的是,当没有相应的结构图像呈现时,双子座在任何情况下都能够识别OCTA扫描,将图像混淆为艺术品。结论:chatgpt - 40在面对OCT/OCTA图像的诊断准确性方面优于Gemini Advanced,即使诊断范围仍然有限。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
5.70
自引率
9.10%
发文量
554
审稿时长
3-6 weeks
期刊介绍: ​RETINA® focuses exclusively on the growing specialty of vitreoretinal disorders. The Journal provides current information on diagnostic and therapeutic techniques. Its highly specialized and informative, peer-reviewed articles are easily applicable to clinical practice. In addition to regular reports from clinical and basic science investigators, RETINA® publishes special features including periodic review articles on pertinent topics, special articles dealing with surgical and other therapeutic techniques, and abstract cards. Issues are abundantly illustrated in vivid full color. Published 12 times per year, RETINA® is truly a “must have” publication for anyone connected to this field.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信