Zufar Mulyukov PhD , Pearse A. Keane FRCOphth, MD , Jayashree Sahni FRCOphth, MD , Sandra Liakopoulos MD , Katja Hatz MD , Daniel Shu Wei Ting MD, PhD , Roberto Gallego-Pinazo MD, PhD , Tariq Aslam PhD, DM(Oxon) , Chui Ming Gemmy Cheung FRCOphth, MD , Gabriella De Salvo FRCOphth, MD , Oudy Semoun MD , Gábor Márk Somfai MD, PhD , Andreas Stahl MD , Brandon J. Lujan MD , Daniel Lorand MSc
{"title":"基于人工智能的疾病活动监测,个性化治疗新生血管性老年黄斑变性:可行性研究","authors":"Zufar Mulyukov PhD , Pearse A. Keane FRCOphth, MD , Jayashree Sahni FRCOphth, MD , Sandra Liakopoulos MD , Katja Hatz MD , Daniel Shu Wei Ting MD, PhD , Roberto Gallego-Pinazo MD, PhD , Tariq Aslam PhD, DM(Oxon) , Chui Ming Gemmy Cheung FRCOphth, MD , Gabriella De Salvo FRCOphth, MD , Oudy Semoun MD , Gábor Márk Somfai MD, PhD , Andreas Stahl MD , Brandon J. Lujan MD , Daniel Lorand MSc","doi":"10.1016/j.xops.2024.100565","DOIUrl":null,"url":null,"abstract":"<div><h3>Purpose</h3><p>To evaluate the performance of a disease activity (DA) model developed to detect DA in participants with neovascular age-related macular degeneration (nAMD).</p></div><div><h3>Design</h3><p>Post hoc analysis.</p></div><div><h3>Participants</h3><p>Patient dataset from the phase III HAWK and HARRIER (H&H) studies.</p></div><div><h3>Methods</h3><p>An artificial intelligence (AI)-based DA model was developed to generate a DA score based on measurements of OCT images and other parameters collected from H&H study participants. Disease activity assessments were classified into 3 categories based on the extent of agreement between the DA model’s scores and the H&H investigators’ decisions: agreement (“easy”), disagreement (“noisy”), and close to the decision boundary (“difficult”). Then, a panel of 10 international retina specialists (“panelists”) reviewed a sample of DA assessments of these 3 categories that contributed to the training of the final DA model. A panelists’ majority vote on the reviewed cases was used to evaluate the accuracy, sensitivity, and specificity of the DA model.</p></div><div><h3>Main Outcome Measures</h3><p>The DA model’s performance in detecting DA compared with the DA assessments made by the investigators and panelists’ majority vote.</p></div><div><h3>Results</h3><p>A total of 4472 OCT DA assessments were used to develop the model; of these, panelists reviewed 425, categorized as “easy” (17.2%), “noisy” (20.5%), and “difficult” (62.4%). False-positive and false negative rates of the DA model’s assessments decreased after changing the assessment in some cases reviewed by the panelists and retraining the DA model. Overall, the DA model achieved 80% accuracy. For “easy” cases, the DA model reached 96% accuracy and performed as well as the investigators (96% accuracy) and panelists (90% accuracy). For “noisy” cases, the DA model performed similarly to panelists and outperformed the investigators (84%, 86%, and 16% accuracies, respectively). The DA model also outperformed the investigators for “difficult” cases (74% and 53% accuracies, respectively) but underperformed the panelists (86% accuracy) owing to lower specificity. Subretinal and intraretinal fluids were the main clinical parameters driving the DA assessments made by the panelists.</p></div><div><h3>Conclusions</h3><p>These results demonstrate the potential of using an AI-based DA model to optimize treatment decisions in the clinical setting and in detecting and monitoring DA in patients with nAMD.</p></div><div><h3>Financial Disclosure(s)</h3><p>Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.</p></div>","PeriodicalId":74363,"journal":{"name":"Ophthalmology science","volume":null,"pages":null},"PeriodicalIF":3.2000,"publicationDate":"2024-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666914524001015/pdfft?md5=65ab512316012a4b8940a3501fab3963&pid=1-s2.0-S2666914524001015-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Artificial Intelligence-Based Disease Activity Monitoring to Personalized Neovascular Age-Related Macular Degeneration Treatment: A Feasibility Study\",\"authors\":\"Zufar Mulyukov PhD , Pearse A. Keane FRCOphth, MD , Jayashree Sahni FRCOphth, MD , Sandra Liakopoulos MD , Katja Hatz MD , Daniel Shu Wei Ting MD, PhD , Roberto Gallego-Pinazo MD, PhD , Tariq Aslam PhD, DM(Oxon) , Chui Ming Gemmy Cheung FRCOphth, MD , Gabriella De Salvo FRCOphth, MD , Oudy Semoun MD , Gábor Márk Somfai MD, PhD , Andreas Stahl MD , Brandon J. Lujan MD , Daniel Lorand MSc\",\"doi\":\"10.1016/j.xops.2024.100565\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Purpose</h3><p>To evaluate the performance of a disease activity (DA) model developed to detect DA in participants with neovascular age-related macular degeneration (nAMD).</p></div><div><h3>Design</h3><p>Post hoc analysis.</p></div><div><h3>Participants</h3><p>Patient dataset from the phase III HAWK and HARRIER (H&H) studies.</p></div><div><h3>Methods</h3><p>An artificial intelligence (AI)-based DA model was developed to generate a DA score based on measurements of OCT images and other parameters collected from H&H study participants. Disease activity assessments were classified into 3 categories based on the extent of agreement between the DA model’s scores and the H&H investigators’ decisions: agreement (“easy”), disagreement (“noisy”), and close to the decision boundary (“difficult”). Then, a panel of 10 international retina specialists (“panelists”) reviewed a sample of DA assessments of these 3 categories that contributed to the training of the final DA model. A panelists’ majority vote on the reviewed cases was used to evaluate the accuracy, sensitivity, and specificity of the DA model.</p></div><div><h3>Main Outcome Measures</h3><p>The DA model’s performance in detecting DA compared with the DA assessments made by the investigators and panelists’ majority vote.</p></div><div><h3>Results</h3><p>A total of 4472 OCT DA assessments were used to develop the model; of these, panelists reviewed 425, categorized as “easy” (17.2%), “noisy” (20.5%), and “difficult” (62.4%). False-positive and false negative rates of the DA model’s assessments decreased after changing the assessment in some cases reviewed by the panelists and retraining the DA model. Overall, the DA model achieved 80% accuracy. For “easy” cases, the DA model reached 96% accuracy and performed as well as the investigators (96% accuracy) and panelists (90% accuracy). For “noisy” cases, the DA model performed similarly to panelists and outperformed the investigators (84%, 86%, and 16% accuracies, respectively). The DA model also outperformed the investigators for “difficult” cases (74% and 53% accuracies, respectively) but underperformed the panelists (86% accuracy) owing to lower specificity. Subretinal and intraretinal fluids were the main clinical parameters driving the DA assessments made by the panelists.</p></div><div><h3>Conclusions</h3><p>These results demonstrate the potential of using an AI-based DA model to optimize treatment decisions in the clinical setting and in detecting and monitoring DA in patients with nAMD.</p></div><div><h3>Financial Disclosure(s)</h3><p>Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.</p></div>\",\"PeriodicalId\":74363,\"journal\":{\"name\":\"Ophthalmology science\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2024-06-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2666914524001015/pdfft?md5=65ab512316012a4b8940a3501fab3963&pid=1-s2.0-S2666914524001015-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ophthalmology science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666914524001015\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"OPHTHALMOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ophthalmology science","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666914524001015","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OPHTHALMOLOGY","Score":null,"Total":0}
Artificial Intelligence-Based Disease Activity Monitoring to Personalized Neovascular Age-Related Macular Degeneration Treatment: A Feasibility Study
Purpose
To evaluate the performance of a disease activity (DA) model developed to detect DA in participants with neovascular age-related macular degeneration (nAMD).
Design
Post hoc analysis.
Participants
Patient dataset from the phase III HAWK and HARRIER (H&H) studies.
Methods
An artificial intelligence (AI)-based DA model was developed to generate a DA score based on measurements of OCT images and other parameters collected from H&H study participants. Disease activity assessments were classified into 3 categories based on the extent of agreement between the DA model’s scores and the H&H investigators’ decisions: agreement (“easy”), disagreement (“noisy”), and close to the decision boundary (“difficult”). Then, a panel of 10 international retina specialists (“panelists”) reviewed a sample of DA assessments of these 3 categories that contributed to the training of the final DA model. A panelists’ majority vote on the reviewed cases was used to evaluate the accuracy, sensitivity, and specificity of the DA model.
Main Outcome Measures
The DA model’s performance in detecting DA compared with the DA assessments made by the investigators and panelists’ majority vote.
Results
A total of 4472 OCT DA assessments were used to develop the model; of these, panelists reviewed 425, categorized as “easy” (17.2%), “noisy” (20.5%), and “difficult” (62.4%). False-positive and false negative rates of the DA model’s assessments decreased after changing the assessment in some cases reviewed by the panelists and retraining the DA model. Overall, the DA model achieved 80% accuracy. For “easy” cases, the DA model reached 96% accuracy and performed as well as the investigators (96% accuracy) and panelists (90% accuracy). For “noisy” cases, the DA model performed similarly to panelists and outperformed the investigators (84%, 86%, and 16% accuracies, respectively). The DA model also outperformed the investigators for “difficult” cases (74% and 53% accuracies, respectively) but underperformed the panelists (86% accuracy) owing to lower specificity. Subretinal and intraretinal fluids were the main clinical parameters driving the DA assessments made by the panelists.
Conclusions
These results demonstrate the potential of using an AI-based DA model to optimize treatment decisions in the clinical setting and in detecting and monitoring DA in patients with nAMD.
Financial Disclosure(s)
Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.