Seyyedehfatemeh Ghalibafan, David J. Taylor Gonzalez, Louis Z Cai, Brandon Graham Chou, Sugi Panneerselvam, Spencer Conrad Barrett, Mak B. Djulbegovic, Nicolas A. Yannuzzi
{"title":"\"多模态生成式人工智能在真实世界视网膜诊所环境中的应用\"","authors":"Seyyedehfatemeh Ghalibafan, David J. Taylor Gonzalez, Louis Z Cai, Brandon Graham Chou, Sugi Panneerselvam, Spencer Conrad Barrett, Mak B. Djulbegovic, Nicolas A. Yannuzzi","doi":"10.1097/iae.0000000000004204","DOIUrl":null,"url":null,"abstract":"\n \n Evaluate a large language model, GPT4 with vision (GPT-4V), for diagnosing vitreoretinal diseases in real-world ophthalmology settings.\n \n \n \n A retrospective cross-sectional study at Bascom Palmer Eye Clinic, analyzing patient data from January 2010 to March 2023, assesses GPT-4V’s performance on retinal image analysis and ICD-10 coding across two patient groups: simpler cases (Group A) and complex cases (Group B) requiring more in-depth analysis. Diagnostic accuracy was assessed through open-ended (OEQ) and multiple-choice questions (MCQs) independently verified by three retina specialists.\n \n \n \n In 256 eyes from 143 patients, GPT4-V demonstrated a 13.7% accuracy for OEQs and 31.3% for MCQs, with ICD-10 code accuracies at 5.5% and 31.3% respectively. Accurately diagnosed posterior vitreous detachment, non-exudative age-related macular degeneration, and retinal detachment. ICD-10 coding was most accurate for non-exudative age-related macular degeneration, central retinal vein occlusion, and macular hole in EOQs, and for posterior vitreous detachment, non-exudative age-related macular degeneration, and retinal detachment in MCQs. No significant difference in diagnostic or coding accuracy was found in Groups A and B.\n \n \n \n GPT-4V has potential in clinical care and record-keeping, particularly with standardized questions. Its effectiveness in open-ended scenarios is limited, indicating a significant limitation in providing complex medical advice.\n","PeriodicalId":21178,"journal":{"name":"Retina","volume":"69 1‐2","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"“Applications of Multimodal Generative AI in a Real-World Retina Clinic Setting”\",\"authors\":\"Seyyedehfatemeh Ghalibafan, David J. Taylor Gonzalez, Louis Z Cai, Brandon Graham Chou, Sugi Panneerselvam, Spencer Conrad Barrett, Mak B. Djulbegovic, Nicolas A. Yannuzzi\",\"doi\":\"10.1097/iae.0000000000004204\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n \\n Evaluate a large language model, GPT4 with vision (GPT-4V), for diagnosing vitreoretinal diseases in real-world ophthalmology settings.\\n \\n \\n \\n A retrospective cross-sectional study at Bascom Palmer Eye Clinic, analyzing patient data from January 2010 to March 2023, assesses GPT-4V’s performance on retinal image analysis and ICD-10 coding across two patient groups: simpler cases (Group A) and complex cases (Group B) requiring more in-depth analysis. Diagnostic accuracy was assessed through open-ended (OEQ) and multiple-choice questions (MCQs) independently verified by three retina specialists.\\n \\n \\n \\n In 256 eyes from 143 patients, GPT4-V demonstrated a 13.7% accuracy for OEQs and 31.3% for MCQs, with ICD-10 code accuracies at 5.5% and 31.3% respectively. Accurately diagnosed posterior vitreous detachment, non-exudative age-related macular degeneration, and retinal detachment. ICD-10 coding was most accurate for non-exudative age-related macular degeneration, central retinal vein occlusion, and macular hole in EOQs, and for posterior vitreous detachment, non-exudative age-related macular degeneration, and retinal detachment in MCQs. No significant difference in diagnostic or coding accuracy was found in Groups A and B.\\n \\n \\n \\n GPT-4V has potential in clinical care and record-keeping, particularly with standardized questions. Its effectiveness in open-ended scenarios is limited, indicating a significant limitation in providing complex medical advice.\\n\",\"PeriodicalId\":21178,\"journal\":{\"name\":\"Retina\",\"volume\":\"69 1‐2\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Retina\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1097/iae.0000000000004204\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Retina","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1097/iae.0000000000004204","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
“Applications of Multimodal Generative AI in a Real-World Retina Clinic Setting”
Evaluate a large language model, GPT4 with vision (GPT-4V), for diagnosing vitreoretinal diseases in real-world ophthalmology settings.
A retrospective cross-sectional study at Bascom Palmer Eye Clinic, analyzing patient data from January 2010 to March 2023, assesses GPT-4V’s performance on retinal image analysis and ICD-10 coding across two patient groups: simpler cases (Group A) and complex cases (Group B) requiring more in-depth analysis. Diagnostic accuracy was assessed through open-ended (OEQ) and multiple-choice questions (MCQs) independently verified by three retina specialists.
In 256 eyes from 143 patients, GPT4-V demonstrated a 13.7% accuracy for OEQs and 31.3% for MCQs, with ICD-10 code accuracies at 5.5% and 31.3% respectively. Accurately diagnosed posterior vitreous detachment, non-exudative age-related macular degeneration, and retinal detachment. ICD-10 coding was most accurate for non-exudative age-related macular degeneration, central retinal vein occlusion, and macular hole in EOQs, and for posterior vitreous detachment, non-exudative age-related macular degeneration, and retinal detachment in MCQs. No significant difference in diagnostic or coding accuracy was found in Groups A and B.
GPT-4V has potential in clinical care and record-keeping, particularly with standardized questions. Its effectiveness in open-ended scenarios is limited, indicating a significant limitation in providing complex medical advice.