Dalia Camacho-García-Formentí, Gabriela Baylón-Vázquez, Karen Arriozola-Rodríguez, Enrique Avalos-Ramirez, Curt Hartleben-Matkin, Hugo Valdez-Flores, Damaris Hodelin-Fuentes, Alejandro Noriega
{"title":"Synergistic AI-resident approach achieves superior diagnostic accuracy in tertiary ophthalmic care for glaucoma and retinal disease.","authors":"Dalia Camacho-García-Formentí, Gabriela Baylón-Vázquez, Karen Arriozola-Rodríguez, Enrique Avalos-Ramirez, Curt Hartleben-Matkin, Hugo Valdez-Flores, Damaris Hodelin-Fuentes, Alejandro Noriega","doi":"10.3389/fopht.2025.1581212","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Artificial intelligence (AI) shows promise in ophthalmology, but its potential in tertiary care settings in Latin America remains understudied. We present a Mexican AI-powered screening tool and evaluate it against first-year ophthalmology residents in a tertiary care setting in Mexico City.</p><p><strong>Methods: </strong>We analyzed data from 435 adult patients undergoing their first ophthalmic evaluation using an AI-based platform and first-year ophthalmology residents. The platform employs an Inception V3-based multi-output classification model with 512 × 512 input resolution to capture small lesions when detecting retinal disease. To evaluate glaucoma suspects, the system uses U-Net models that segment the optic disc and cup to calculate cup-to-disc ratio (CDR) from their vertical heights. The AI and resident evaluations were compared with expert annotations for retinal disease, CDR measurements, and glaucoma suspect classification. In addition, we evaluated a synergistic approach combining AI and resident assessments.</p><p><strong>Results: </strong>For glaucoma suspect classification, AI outperformed residents in accuracy (88.6% vs. 82.9%, <i>p</i> = 0.016), sensitivity (63.0% vs. 50.0%, <i>p</i> = 0.116), and specificity (94.5% vs. 90.5%, <i>p</i> = 0.062). The synergistic approach achieved a higher sensitivity (80.4%) than ophthalmic residents alone or AI alone (<i>p <</i> 0.001). AI's CDR estimates showed lower mean absolute error (0.056 vs. 0.105, <i>p <</i> 0.001) and higher correlation with expert measurements (<i>r</i> = 0.728 vs. <i>r</i> = 0.538). In the retinal disease assessment, AI demonstrated higher sensitivity (90.1% vs. 63.0% for medium/high risk, <i>p <</i> 0.001) and specificity (95.8% vs. 90.4%, <i>p <</i> 0.001). Furthermore, differences between AI and residents were statistically significant across all metrics. The synergistic approach achieved the highest sensitivity for retinal disease (92.6% for medium/high risk, 100% for high risk).</p><p><strong>Discussion: </strong>AI outperformed first-year residents in key ophthalmic assessments. The synergistic use of AI and resident assessments showed potential for optimizing diagnostic accuracy, highlighting the value of AI as a supportive tool in ophthalmic practice, especially for early career clinicians.</p>","PeriodicalId":73096,"journal":{"name":"Frontiers in ophthalmology","volume":"5 ","pages":"1581212"},"PeriodicalIF":0.9000,"publicationDate":"2025-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12127187/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in ophthalmology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fopht.2025.1581212","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction: Artificial intelligence (AI) shows promise in ophthalmology, but its potential in tertiary care settings in Latin America remains understudied. We present a Mexican AI-powered screening tool and evaluate it against first-year ophthalmology residents in a tertiary care setting in Mexico City.
Methods: We analyzed data from 435 adult patients undergoing their first ophthalmic evaluation using an AI-based platform and first-year ophthalmology residents. The platform employs an Inception V3-based multi-output classification model with 512 × 512 input resolution to capture small lesions when detecting retinal disease. To evaluate glaucoma suspects, the system uses U-Net models that segment the optic disc and cup to calculate cup-to-disc ratio (CDR) from their vertical heights. The AI and resident evaluations were compared with expert annotations for retinal disease, CDR measurements, and glaucoma suspect classification. In addition, we evaluated a synergistic approach combining AI and resident assessments.
Results: For glaucoma suspect classification, AI outperformed residents in accuracy (88.6% vs. 82.9%, p = 0.016), sensitivity (63.0% vs. 50.0%, p = 0.116), and specificity (94.5% vs. 90.5%, p = 0.062). The synergistic approach achieved a higher sensitivity (80.4%) than ophthalmic residents alone or AI alone (p < 0.001). AI's CDR estimates showed lower mean absolute error (0.056 vs. 0.105, p < 0.001) and higher correlation with expert measurements (r = 0.728 vs. r = 0.538). In the retinal disease assessment, AI demonstrated higher sensitivity (90.1% vs. 63.0% for medium/high risk, p < 0.001) and specificity (95.8% vs. 90.4%, p < 0.001). Furthermore, differences between AI and residents were statistically significant across all metrics. The synergistic approach achieved the highest sensitivity for retinal disease (92.6% for medium/high risk, 100% for high risk).
Discussion: AI outperformed first-year residents in key ophthalmic assessments. The synergistic use of AI and resident assessments showed potential for optimizing diagnostic accuracy, highlighting the value of AI as a supportive tool in ophthalmic practice, especially for early career clinicians.