Retinal Imaging Analysis Performed By ChatGPT-4o And Gemini Advanced: The Turning Point Of The Revolution?

IF 2.3 2区医学 Q2 OPHTHALMOLOGY

Retina-The Journal of Retinal and Vitreous Diseases Pub Date : 2024-12-11 DOI:10.1097/IAE.0000000000004351

Matteo Mario Carlà, Emanuele Crincoli, Stanislao Rizzo

{"title":"Retinal Imaging Analysis Performed By ChatGPT-4o And Gemini Advanced: The Turning Point Of The Revolution?","authors":"Matteo Mario Carlà, Emanuele Crincoli, Stanislao Rizzo","doi":"10.1097/IAE.0000000000004351","DOIUrl":null,"url":null,"abstract":"Purpose: To assess the diagnostic capabilities of the most recent chatbots releases, GPT-4o and Gemini Advanced, facing different retinal diseases.Methods: Exploratory analysis on 50 cases with different surgical (n=27) and medical (n=23) retinal pathologies, whose optical coherence tomography/angiography (OCT/OCTA) scans were dragged into ChatGPT and Gemini's interfaces. Then, we asked \"Please describe this image\" and classified the diagnosis as: 1) Correct; 2) Partially correct; 3) Wrong; 4) Unable to assess exam type and 5) Diagnosis not given.Results: ChatGPT indicated the correct diagnosis in 31/50 cases (62%), significantly higher than Gemini Advanced 16/50 cases (p=0.0048). In 24% of cases, Gemini Advanced was not able to produce any answer, stating \"That's not something I'm able to do yet\". For both, primary misdiagnosis was macular edema, given erroneously in 16% and 14% of cases, respectively. ChatGPT-4o showed higher rates of correct diagnoses either in surgical (52% vs 30%) and medical retina (78% vs 43%). Notably, when presented without the corresponding structural image, in any case Gemini was able to recognize OCTA scans, confusing images for artworks.Conclusion: ChatGPT-4o outperformed Gemini Advanced in terms of diagnostic accuracy facing OCT/OCTA images, even if the range of diagnoses is still limited.","PeriodicalId":54486,"journal":{"name":"Retina-The Journal of Retinal and Vitreous Diseases","volume":" ","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2024-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Retina-The Journal of Retinal and Vitreous Diseases","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1097/IAE.0000000000004351","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"OPHTHALMOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Purpose: To assess the diagnostic capabilities of the most recent chatbots releases, GPT-4o and Gemini Advanced, facing different retinal diseases.

Methods: Exploratory analysis on 50 cases with different surgical (n=27) and medical (n=23) retinal pathologies, whose optical coherence tomography/angiography (OCT/OCTA) scans were dragged into ChatGPT and Gemini's interfaces. Then, we asked "Please describe this image" and classified the diagnosis as: 1) Correct; 2) Partially correct; 3) Wrong; 4) Unable to assess exam type and 5) Diagnosis not given.

Results: ChatGPT indicated the correct diagnosis in 31/50 cases (62%), significantly higher than Gemini Advanced 16/50 cases (p=0.0048). In 24% of cases, Gemini Advanced was not able to produce any answer, stating "That's not something I'm able to do yet". For both, primary misdiagnosis was macular edema, given erroneously in 16% and 14% of cases, respectively. ChatGPT-4o showed higher rates of correct diagnoses either in surgical (52% vs 30%) and medical retina (78% vs 43%). Notably, when presented without the corresponding structural image, in any case Gemini was able to recognize OCTA scans, confusing images for artworks.

Conclusion: ChatGPT-4o outperformed Gemini Advanced in terms of diagnostic accuracy facing OCT/OCTA images, even if the range of diagnoses is still limited.

查看原文本刊更多论文

chatgpt - 40和Gemini Advanced视网膜成像分析：革命的转折点？

目的：评估最新发布的聊天机器人gpt - 40和Gemini Advanced在面对不同视网膜疾病时的诊断能力。方法：对50例不同手术和内科病理的视网膜病变患者（n=27）进行探索性分析，这些患者的光学相干断层扫描/血管造影（OCT/OCTA）扫描被拖入ChatGPT和Gemini的界面。然后，我们问“请描述这个图像”，并将诊断分类为：1)正确；2)部分正确；3)错了;4)无法评估检查类型；5)未给出诊断。结果：ChatGPT诊断正确率为31/50例（62%），显著高于Gemini Advanced 16/50例（p=0.0048）。在24%的情况下，Gemini Advanced无法给出任何答案，并表示“这不是我能做的事情”。对于这两种情况，主要的误诊是黄斑水肿，分别有16%和14%的病例被误诊。chatgpt - 40在外科（52%对30%）和医学视网膜（78%对43%）中显示出更高的正确诊断率。值得注意的是，当没有相应的结构图像呈现时，双子座在任何情况下都能够识别OCTA扫描，将图像混淆为艺术品。结论：chatgpt - 40在面对OCT/OCTA图像的诊断准确性方面优于Gemini Advanced，即使诊断范围仍然有限。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Retina-The Journal of Retinal and Vitreous Diseases 医学-眼科学

CiteScore

5.70

自引率

9.10%

发文量

554

审稿时长

3-6 weeks

期刊介绍： RETINA® focuses exclusively on the growing specialty of vitreoretinal disorders. The Journal provides current information on diagnostic and therapeutic techniques. Its highly specialized and informative, peer-reviewed articles are easily applicable to clinical practice. In addition to regular reports from clinical and basic science investigators, RETINA® publishes special features including periodic review articles on pertinent topics, special articles dealing with surgical and other therapeutic techniques, and abstract cards. Issues are abundantly illustrated in vivid full color. Published 12 times per year, RETINA® is truly a “must have” publication for anyone connected to this field.