Opportunities for Improving Glaucoma Clinical Trials via Deep Learning-Based Identification of Patients with Low Visual Field Variability

IF 2.8 Q1 OPHTHALMOLOGY

Ophthalmology. Glaucoma Pub Date : 2024-05-01 DOI:10.1016/j.ogla.2024.01.005

Ruolin Wang BA , Chris Bradley PhD , Patrick Herbert , Kaihua Hou BA , Gregory D. Hager PhD , Katharina Breininger PhD , Mathias Unberath PhD , Pradeep Ramulu MD, PhD , Jithin Yohannan MD, MPH

{"title":"Opportunities for Improving Glaucoma Clinical Trials via Deep Learning-Based Identification of Patients with Low Visual Field Variability","authors":"Ruolin Wang BA , Chris Bradley PhD , Patrick Herbert , Kaihua Hou BA , Gregory D. Hager PhD , Katharina Breininger PhD , Mathias Unberath PhD , Pradeep Ramulu MD, PhD , Jithin Yohannan MD, MPH","doi":"10.1016/j.ogla.2024.01.005","DOIUrl":null,"url":null,"abstract":"<div><h3>Purpose</h3><p>Develop and evaluate the performance of a deep learning model (DLM) that forecasts eyes with low future visual field (VF) variability, and study the impact of using this DLM on sample size requirements for neuroprotective trials.</p></div><div><h3>Design</h3><p>Retrospective cohort and simulation study.</p></div><div><h3>Methods</h3><p>We included 1 eye per patient with baseline reliable VFs, OCT, clinical measures (demographics, intraocular pressure, and visual acuity), and 5 subsequent reliable VFs to forecast VF variability using DLMs and perform sample size estimates. We estimated sample size for 3 groups of eyes: all eyes (AE), low variability eyes (LVE: the subset of AE with a standard deviation of mean deviation [MD] slope residuals in the bottom 25th percentile), and DLM-predicted low variability eyes (DLPE: the subset of AE predicted to be low variability by the DLM). Deep learning models using only baseline VF/OCT/clinical data as input (DLM1), or also using a second VF (DLM2) were constructed to predict low VF variability (DLPE1 and DLPE2, respectively). Data were split 60/10/30 into train/val/test. Clinical trial simulations were performed only on the test set. We estimated the sample size necessary to detect treatment effects of 20% to 50% in MD slope with 80% power. Power was defined as the percentage of simulated clinical trials where the MD slope was significantly worse from the control. Clinical trials were simulated with visits every 3 months with a total of 10 visits.</p></div><div><h3>Results</h3><p>A total of 2817 eyes were included in the analysis. Deep learning models 1 and 2 achieved an area under the receiver operating characteristic curve of 0.73 (95% confidence interval [CI]: 0.68, 0.76) and 0.82 (95% CI: 0.78, 0.85) in forecasting low VF variability. When compared with including AE, using DLPE1 and DLPE2 reduced sample size to achieve 80% power by 30% and 38% for 30% treatment effect, and 31% and 38% for 50% treatment effect.</p></div><div><h3>Conclusions</h3><p>Deep learning models can forecast eyes with low VF variability using data from a single baseline clinical visit. This can reduce sample size requirements, and potentially reduce the burden of future glaucoma clinical trials.</p></div><div><h3>Financial Disclosure(s)</h3><p>Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.</p></div>","PeriodicalId":19519,"journal":{"name":"Ophthalmology. Glaucoma","volume":"7 3","pages":"Pages 222-231"},"PeriodicalIF":2.8000,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ophthalmology. Glaucoma","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2589419624000139","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OPHTHALMOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Purpose

Develop and evaluate the performance of a deep learning model (DLM) that forecasts eyes with low future visual field (VF) variability, and study the impact of using this DLM on sample size requirements for neuroprotective trials.

Design

Retrospective cohort and simulation study.

Methods

We included 1 eye per patient with baseline reliable VFs, OCT, clinical measures (demographics, intraocular pressure, and visual acuity), and 5 subsequent reliable VFs to forecast VF variability using DLMs and perform sample size estimates. We estimated sample size for 3 groups of eyes: all eyes (AE), low variability eyes (LVE: the subset of AE with a standard deviation of mean deviation [MD] slope residuals in the bottom 25th percentile), and DLM-predicted low variability eyes (DLPE: the subset of AE predicted to be low variability by the DLM). Deep learning models using only baseline VF/OCT/clinical data as input (DLM1), or also using a second VF (DLM2) were constructed to predict low VF variability (DLPE1 and DLPE2, respectively). Data were split 60/10/30 into train/val/test. Clinical trial simulations were performed only on the test set. We estimated the sample size necessary to detect treatment effects of 20% to 50% in MD slope with 80% power. Power was defined as the percentage of simulated clinical trials where the MD slope was significantly worse from the control. Clinical trials were simulated with visits every 3 months with a total of 10 visits.

Results

A total of 2817 eyes were included in the analysis. Deep learning models 1 and 2 achieved an area under the receiver operating characteristic curve of 0.73 (95% confidence interval [CI]: 0.68, 0.76) and 0.82 (95% CI: 0.78, 0.85) in forecasting low VF variability. When compared with including AE, using DLPE1 and DLPE2 reduced sample size to achieve 80% power by 30% and 38% for 30% treatment effect, and 31% and 38% for 50% treatment effect.

Conclusions

Deep learning models can forecast eyes with low VF variability using data from a single baseline clinical visit. This can reduce sample size requirements, and potentially reduce the burden of future glaucoma clinical trials.

Financial Disclosure(s)

Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.

查看原文本刊更多论文

通过基于深度学习的低视野变异性患者识别改进青光眼临床试验的机会。

目的：开发并评估可预测未来视野（VF）变异性较低的眼睛的深度学习模型（DLM）的性能，并研究使用该 DLM 对神经保护试验样本量要求的影响设计：回顾性队列和模拟研究方法：我们为每位患者纳入一只具有基线可靠 VF、光学相干断层扫描（OCT）、临床测量（人口统计学、眼压、视力）和 5 次后续可靠 VF 的眼睛，以便使用 DLM 预测 VF 变异性并进行样本量估算。我们估算了三组眼睛的样本量：所有眼睛（AE）、低变异性眼睛（LVE：平均差 [MD] 斜率残差的标准偏差在第 25 百分位数以下的 AE 子集）和 DLM 预测的低变异性眼睛（DLPE：DLM 预测为低变异性的 AE 子集）。仅使用基线 VF/OCT/临床数据作为输入（DLM1）或同时使用第二个 VF（DLM2）构建 DLM，以预测低 VF 变异性（分别为 DLPE1 和 DLPE2）。数据以 60/10/30 的比例分成训练/评估/测试。临床试验模拟仅在测试集中进行。我们估算了以 80% 的功率检测 MD 斜坡 20% 至 50% 的治疗效果所需的样本量。功率定义为MD斜率明显低于对照组的模拟临床试验的百分比。模拟临床试验每 3 个月访问一次，共访问 10 次：共有 2,817 只眼睛被纳入分析。DLM1 和 DLM2 预测低 VF 变异的接收器操作特征曲线下面积分别为 0.73（95% CI：0.68, 0.76）和 0.82（95% CI：0.78, 0.85）。与包括AE相比，使用DLPE1和DLPE2可使样本量减少30%和38%（30%治疗效果）和31%和38%（50%治疗效果），以达到80%的功率：结论：DLM 可以利用单次基线临床访问的数据预测 VF 变异性较低的眼睛。结论：DLMs 可以利用单次基线临床访问的数据预测 VF 变异性低的眼，从而降低样本量要求，并有可能减轻未来青光眼临床试验的负担。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊