T.Y.Alvin Liu MD , Yuxuan Liu MS , Madeleine S. Gastonguay BS , Dan Midgett PhD , Nathanael Kuo PhD , Yujie Zhao , Kareef Ullah , Gwyneth Alexander , Todd Hartman , Neslihan D. Koseoglu MD , Craig Jones PhD
{"title":"Predicting Imminent Conversion to Exudative Age-Related Macular Degeneration Using Multimodal Data and Ensemble Machine Learning","authors":"T.Y.Alvin Liu MD , Yuxuan Liu MS , Madeleine S. Gastonguay BS , Dan Midgett PhD , Nathanael Kuo PhD , Yujie Zhao , Kareef Ullah , Gwyneth Alexander , Todd Hartman , Neslihan D. Koseoglu MD , Craig Jones PhD","doi":"10.1016/j.xops.2025.100785","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><div>Exudative age-related macular degeneration (eAMD) is a major cause of central vision loss. Identifying patients at high risk of imminent eAMD could enable timely treatment and improve outcomes. Our goal was to develop and compare classical machine learning (ML) and deep learning (DL) models to predict imminent eAMD conversion within 6 months and integrate OCT with clinical data into a single predictive model.</div></div><div><h3>Design</h3><div>Retrospective cohort study.</div></div><div><h3>Participants</h3><div>Patients seen at the Wilmer Eye Institute between 2013 and 2021 with eAMD in ≥1 eye.</div></div><div><h3>Methods</h3><div>Spectral domain OCT volumes prior to conversion and the corresponding clinical data (age, best-corrected visual acuity, sex, and fellow-eye status) were collected and used for model training or testing. ResNet-50 and classical ML (Random Forest and XGBoost) models were trained to predict imminent conversion to eAMD within 6 months on an eye level. For the multilayer perceptron (MLP) framework, the trained ResNet-50 model was used as the feature encoder, and a downsampled feature vector concatenated with corresponding clinical tabular data was passed through the MLP (prediction head). Data were partitioned at the patient level (75% training, 15% validation, and 10% testing). Model performance was evaluated using the area under the operating characteristic curve (AUC) and 95% confidence interval (CI) for the model AUC was calculated using the percentile method after bootstrapping the test set 10 000 times. Model comparisons were made using modified paired <em>t</em> test. <em>P</em> < 0.05 was considered statistically significant.</div></div><div><h3>Main Outcome Measures</h3><div>Area under the operating characteristic curve.</div></div><div><h3>Results</h3><div>Thirty-three thousand one hundred eighty-nine OCT volumes from 2084 patients (63% female; 89.1% White, 4.8% Black, and 2.3% Asian) were included. The mean age at the time of first-eye conversion was 78.9 (± 9.3) years. Our best-performing models, “MLP multimodal” (trained with both OCT and clinical data; AUC: 0.76, 95% CI: 0.71–0.80) and “CNN OCT” (trained with only OCT data; AUC: 0.75, 95% CI: 0.70–0.79), had a DL (ResNet-50) architecture; “MLP multimodal” outperformed “CNN OCT” in predicting both all-eye (<em>P</em> < 0.05) and first-eye conversion (<em>P</em> < 0.001).</div></div><div><h3>Conclusions</h3><div>The 3-dimensional DL models, trained with OCT volumes, are capable of predicting both first-eye and fellow-eye imminent conversion to eAMD. The addition of clinical data further improved the model performance. These models, if validated prospectively, could serve as screening tools and allow retinal specialists to prioritize patients with more acute retinal issues.</div></div><div><h3>Financial Disclosure(s)</h3><div>Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.</div></div>","PeriodicalId":74363,"journal":{"name":"Ophthalmology science","volume":"5 5","pages":"Article 100785"},"PeriodicalIF":3.2000,"publicationDate":"2025-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ophthalmology science","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666914525000831","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OPHTHALMOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Objective
Exudative age-related macular degeneration (eAMD) is a major cause of central vision loss. Identifying patients at high risk of imminent eAMD could enable timely treatment and improve outcomes. Our goal was to develop and compare classical machine learning (ML) and deep learning (DL) models to predict imminent eAMD conversion within 6 months and integrate OCT with clinical data into a single predictive model.
Design
Retrospective cohort study.
Participants
Patients seen at the Wilmer Eye Institute between 2013 and 2021 with eAMD in ≥1 eye.
Methods
Spectral domain OCT volumes prior to conversion and the corresponding clinical data (age, best-corrected visual acuity, sex, and fellow-eye status) were collected and used for model training or testing. ResNet-50 and classical ML (Random Forest and XGBoost) models were trained to predict imminent conversion to eAMD within 6 months on an eye level. For the multilayer perceptron (MLP) framework, the trained ResNet-50 model was used as the feature encoder, and a downsampled feature vector concatenated with corresponding clinical tabular data was passed through the MLP (prediction head). Data were partitioned at the patient level (75% training, 15% validation, and 10% testing). Model performance was evaluated using the area under the operating characteristic curve (AUC) and 95% confidence interval (CI) for the model AUC was calculated using the percentile method after bootstrapping the test set 10 000 times. Model comparisons were made using modified paired t test. P < 0.05 was considered statistically significant.
Main Outcome Measures
Area under the operating characteristic curve.
Results
Thirty-three thousand one hundred eighty-nine OCT volumes from 2084 patients (63% female; 89.1% White, 4.8% Black, and 2.3% Asian) were included. The mean age at the time of first-eye conversion was 78.9 (± 9.3) years. Our best-performing models, “MLP multimodal” (trained with both OCT and clinical data; AUC: 0.76, 95% CI: 0.71–0.80) and “CNN OCT” (trained with only OCT data; AUC: 0.75, 95% CI: 0.70–0.79), had a DL (ResNet-50) architecture; “MLP multimodal” outperformed “CNN OCT” in predicting both all-eye (P < 0.05) and first-eye conversion (P < 0.001).
Conclusions
The 3-dimensional DL models, trained with OCT volumes, are capable of predicting both first-eye and fellow-eye imminent conversion to eAMD. The addition of clinical data further improved the model performance. These models, if validated prospectively, could serve as screening tools and allow retinal specialists to prioritize patients with more acute retinal issues.
Financial Disclosure(s)
Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.