C.E. Monera Lucas , C. Mora Caballero , J. Escolano Serrano , A. Machan , G. Castilla Martínez , D. Romero Valero , J. Campello Lluch
{"title":"Analysis of ChatGPT-4's performance on ophthalmology questions from the MIR exam","authors":"C.E. Monera Lucas , C. Mora Caballero , J. Escolano Serrano , A. Machan , G. Castilla Martínez , D. Romero Valero , J. Campello Lluch","doi":"10.1016/j.oftale.2025.05.002","DOIUrl":null,"url":null,"abstract":"<div><h3>Purpose</h3><div>To evaluate the performance of ChatGPT in solving clinical scenarios in ophthalmology, specifically questions from the specialty exams for Resident Medical Interns (MIR).</div></div><div><h3>Design</h3><div>Cross-sectional design for evaluating a diagnostic tool.</div></div><div><h3>Method</h3><div>Ophthalmology questions from the MIR exams from the 2010–2023 sessions were collected. The performance of ChatGPT in successfully answering the questions was calculated. The results were also compared with those obtained by ophthalmology professionals. Additionally, sensitivity, specificity, and positive and negative probability coefficients were calculated.</div></div><div><h3>Results</h3><div>A total of 54 questions were collected, with those from the subspecialty \"Retina\" being the most frequent. ChatGPT's overall score was 90.2%, with a sensitivity of 92.59% and a specificity of 96.8%. The average concordance with the evaluators' answers was 86.41%. The agreement between the evaluators was 79.62%.</div></div><div><h3>Conclusions</h3><div>ChatGPT-4 is a useful tool for solving clinical scenarios and theoretical questions in ophthalmology. Proper use of the tool, supervised by professionals, can help optimize the care processes for ophthalmology patients.</div></div>","PeriodicalId":93886,"journal":{"name":"Archivos de la Sociedad Espanola de Oftalmologia","volume":"100 6","pages":"Pages 314-319"},"PeriodicalIF":0.0000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Archivos de la Sociedad Espanola de Oftalmologia","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2173579425000775","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose
To evaluate the performance of ChatGPT in solving clinical scenarios in ophthalmology, specifically questions from the specialty exams for Resident Medical Interns (MIR).
Design
Cross-sectional design for evaluating a diagnostic tool.
Method
Ophthalmology questions from the MIR exams from the 2010–2023 sessions were collected. The performance of ChatGPT in successfully answering the questions was calculated. The results were also compared with those obtained by ophthalmology professionals. Additionally, sensitivity, specificity, and positive and negative probability coefficients were calculated.
Results
A total of 54 questions were collected, with those from the subspecialty "Retina" being the most frequent. ChatGPT's overall score was 90.2%, with a sensitivity of 92.59% and a specificity of 96.8%. The average concordance with the evaluators' answers was 86.41%. The agreement between the evaluators was 79.62%.
Conclusions
ChatGPT-4 is a useful tool for solving clinical scenarios and theoretical questions in ophthalmology. Proper use of the tool, supervised by professionals, can help optimize the care processes for ophthalmology patients.