"多模态生成式人工智能在真实世界视网膜诊所环境中的应用"

Retina Pub Date : 2024-07-03 DOI:10.1097/iae.0000000000004204

Seyyedehfatemeh Ghalibafan, David J. Taylor Gonzalez, Louis Z Cai, Brandon Graham Chou, Sugi Panneerselvam, Spencer Conrad Barrett, Mak B. Djulbegovic, Nicolas A. Yannuzzi

{"title":"\"多模态生成式人工智能在真实世界视网膜诊所环境中的应用\"","authors":"Seyyedehfatemeh Ghalibafan, David J. Taylor Gonzalez, Louis Z Cai, Brandon Graham Chou, Sugi Panneerselvam, Spencer Conrad Barrett, Mak B. Djulbegovic, Nicolas A. Yannuzzi","doi":"10.1097/iae.0000000000004204","DOIUrl":null,"url":null,"abstract":"\n \n Evaluate a large language model, GPT4 with vision (GPT-4V), for diagnosing vitreoretinal diseases in real-world ophthalmology settings.\n \n \n \n A retrospective cross-sectional study at Bascom Palmer Eye Clinic, analyzing patient data from January 2010 to March 2023, assesses GPT-4V’s performance on retinal image analysis and ICD-10 coding across two patient groups: simpler cases (Group A) and complex cases (Group B) requiring more in-depth analysis. Diagnostic accuracy was assessed through open-ended (OEQ) and multiple-choice questions (MCQs) independently verified by three retina specialists.\n \n \n \n In 256 eyes from 143 patients, GPT4-V demonstrated a 13.7% accuracy for OEQs and 31.3% for MCQs, with ICD-10 code accuracies at 5.5% and 31.3% respectively. Accurately diagnosed posterior vitreous detachment, non-exudative age-related macular degeneration, and retinal detachment. ICD-10 coding was most accurate for non-exudative age-related macular degeneration, central retinal vein occlusion, and macular hole in EOQs, and for posterior vitreous detachment, non-exudative age-related macular degeneration, and retinal detachment in MCQs. No significant difference in diagnostic or coding accuracy was found in Groups A and B.\n \n \n \n GPT-4V has potential in clinical care and record-keeping, particularly with standardized questions. Its effectiveness in open-ended scenarios is limited, indicating a significant limitation in providing complex medical advice.\n","PeriodicalId":21178,"journal":{"name":"Retina","volume":"69 1‐2","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"“Applications of Multimodal Generative AI in a Real-World Retina Clinic Setting”\",\"authors\":\"Seyyedehfatemeh Ghalibafan, David J. Taylor Gonzalez, Louis Z Cai, Brandon Graham Chou, Sugi Panneerselvam, Spencer Conrad Barrett, Mak B. Djulbegovic, Nicolas A. Yannuzzi\",\"doi\":\"10.1097/iae.0000000000004204\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n \\n Evaluate a large language model, GPT4 with vision (GPT-4V), for diagnosing vitreoretinal diseases in real-world ophthalmology settings.\\n \\n \\n \\n A retrospective cross-sectional study at Bascom Palmer Eye Clinic, analyzing patient data from January 2010 to March 2023, assesses GPT-4V’s performance on retinal image analysis and ICD-10 coding across two patient groups: simpler cases (Group A) and complex cases (Group B) requiring more in-depth analysis. Diagnostic accuracy was assessed through open-ended (OEQ) and multiple-choice questions (MCQs) independently verified by three retina specialists.\\n \\n \\n \\n In 256 eyes from 143 patients, GPT4-V demonstrated a 13.7% accuracy for OEQs and 31.3% for MCQs, with ICD-10 code accuracies at 5.5% and 31.3% respectively. Accurately diagnosed posterior vitreous detachment, non-exudative age-related macular degeneration, and retinal detachment. ICD-10 coding was most accurate for non-exudative age-related macular degeneration, central retinal vein occlusion, and macular hole in EOQs, and for posterior vitreous detachment, non-exudative age-related macular degeneration, and retinal detachment in MCQs. No significant difference in diagnostic or coding accuracy was found in Groups A and B.\\n \\n \\n \\n GPT-4V has potential in clinical care and record-keeping, particularly with standardized questions. Its effectiveness in open-ended scenarios is limited, indicating a significant limitation in providing complex medical advice.\\n\",\"PeriodicalId\":21178,\"journal\":{\"name\":\"Retina\",\"volume\":\"69 1‐2\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Retina\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1097/iae.0000000000004204\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Retina","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1097/iae.0000000000004204","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

评估大型语言模型 GPT4 with vision（GPT-4V）在实际眼科环境中诊断玻璃体视网膜疾病的效果。巴斯康帕尔默眼科诊所进行了一项回顾性横断面研究，分析了 2010 年 1 月至 2023 年 3 月期间的患者数据，评估了 GPT-4V 在两组患者中的视网膜图像分析和 ICD-10 编码性能：较简单的病例（A 组）和需要更深入分析的复杂病例（B 组）。诊断准确性通过开放式问题（OEQ）和多项选择题（MCQ）进行评估，由三位视网膜专家独立验证。在 143 名患者的 256 只眼睛中，GPT4-V 的开放式问答准确率为 13.7%，多选题准确率为 31.3%，ICD-10 编码准确率分别为 5.5% 和 31.3%。准确诊断出玻璃体后脱离、非渗出性老年性黄斑变性和视网膜脱离。在 EOQs 中，非渗出性老年性黄斑变性、视网膜中央静脉闭塞和黄斑孔的 ICD-10 编码最为准确；在 MCQs 中，玻璃体后脱离、非渗出性老年性黄斑变性和视网膜脱离的 ICD-10 编码最为准确。GPT-4V 在临床护理和记录保存方面具有潜力，尤其是在标准化问题方面。它在开放式场景中的效果有限，这表明它在提供复杂的医疗建议方面存在很大局限性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

“Applications of Multimodal Generative AI in a Real-World Retina Clinic Setting”

Evaluate a large language model, GPT4 with vision (GPT-4V), for diagnosing vitreoretinal diseases in real-world ophthalmology settings. A retrospective cross-sectional study at Bascom Palmer Eye Clinic, analyzing patient data from January 2010 to March 2023, assesses GPT-4V’s performance on retinal image analysis and ICD-10 coding across two patient groups: simpler cases (Group A) and complex cases (Group B) requiring more in-depth analysis. Diagnostic accuracy was assessed through open-ended (OEQ) and multiple-choice questions (MCQs) independently verified by three retina specialists. In 256 eyes from 143 patients, GPT4-V demonstrated a 13.7% accuracy for OEQs and 31.3% for MCQs, with ICD-10 code accuracies at 5.5% and 31.3% respectively. Accurately diagnosed posterior vitreous detachment, non-exudative age-related macular degeneration, and retinal detachment. ICD-10 coding was most accurate for non-exudative age-related macular degeneration, central retinal vein occlusion, and macular hole in EOQs, and for posterior vitreous detachment, non-exudative age-related macular degeneration, and retinal detachment in MCQs. No significant difference in diagnostic or coding accuracy was found in Groups A and B. GPT-4V has potential in clinical care and record-keeping, particularly with standardized questions. Its effectiveness in open-ended scenarios is limited, indicating a significant limitation in providing complex medical advice.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Retina

自引率

0.00%

发文量