Vision Impairment prediction for patients diagnosed with Multiple Sclerosis: Cosmos based model training and evaluation

Brandon Buxton, Amr Hassan, Nevin Shalaby, John W Lindsey, John Lincoln, Elmer Bernstam, Wagida Anwar, Degui Zhi, Laila Rasmy
{"title":"Vision Impairment prediction for patients diagnosed with Multiple Sclerosis: Cosmos based model training and evaluation","authors":"Brandon Buxton, Amr Hassan, Nevin Shalaby, John W Lindsey, John Lincoln, Elmer Bernstam, Wagida Anwar, Degui Zhi, Laila Rasmy","doi":"10.1101/2023.11.10.23298366","DOIUrl":null,"url":null,"abstract":"Objectives Multiple sclerosis (MS) is a complex autoimmune neurological disorder that frequently impacts vision. One of the most frequent initial presentations of MS is acute vision loss due to optic neuritis, an acute disorder caused by MS involvement with the optic nerve. While vision impairment is often the first sign of MS, it can occur or recur at any time during the patient's course. In this study, we aim to develop and evaluate machine learning models to predict vision impairment in patients with MS, both at the time of first MS diagnosis and throughout their course of care. Early awareness and intervention in patients likely to have vision loss can help preserve patient quality of life. Materials and Methods Using the Epic Cosmos de-identified electronic health record (EHR) dataset, we queried 213+ million patients to extract our MS cohort. Cases were defined as MS patients with vision impairment or optic neuritis (VI) following their first MS diagnosis, while controls were MS patients without VI. We trained logistic regression (LR), light gradient boosting machine (LGBM), and recurrent neural network (RNN) models to predict future VI in MS patients. The models were evaluated for two distinct clinical tasks: prediction of VI at the time of the first MS diagnosis and prediction of VI at the most recent visit. Similarly, we trained the models on different segments of the patient trajectory including up until the first MS diagnosis (MS-First Diagnosis), or until the most recent visit before developing the outcome (MS-Progress) as well as the combination of both (MS-General). Finally, we trained a survival model with the goal of predicting patient likelihood of vision loss over time. We compared the models' performance using AUROC, AUPRC, and Brier scores. Results We extracted a cohort of 377,097 patients with MS, including 42,281 VI cases. Our trained models achieved ~80% AUROC, with RNN-based models outperforming LGBM and LR (79.6% vs 72.8% and 68.6%, respectively) when considering the full patient trajectory. The MS-General RNN model had the highest AUROC (64.4%) for predicting VI at the first MS diagnosis. The MS-Progress survival model achieved a 75% concordance index on the full trajectory, while the more clinically relevant MS-First Diagnosis model achieved 63.1% at initial diagnosis. Discussion and Conclusion The MS-Progress and MS-General RNN models performed best in both prediction scenarios. While MS-General achieved the best performance at the time of first MS diagnosis with around 1% AUROC increase compared to the MS-First Diagnosis model, it showed around 1% AUROC decrease on the MS progress scenario. All RNN survival models performed the best when they were trained on data corresponding to the evaluation use-case scenarios. RNN based models showed promising performance that demonstrates that they can be useful clinical tools to predict risk of future VI events in patients with MS. Further development of these models will focus on expanding to predict other comorbidities associated with MS relapse or progression.","PeriodicalId":478577,"journal":{"name":"medRxiv (Cold Spring Harbor Laboratory)","volume":"20 12","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv (Cold Spring Harbor Laboratory)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2023.11.10.23298366","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Objectives Multiple sclerosis (MS) is a complex autoimmune neurological disorder that frequently impacts vision. One of the most frequent initial presentations of MS is acute vision loss due to optic neuritis, an acute disorder caused by MS involvement with the optic nerve. While vision impairment is often the first sign of MS, it can occur or recur at any time during the patient's course. In this study, we aim to develop and evaluate machine learning models to predict vision impairment in patients with MS, both at the time of first MS diagnosis and throughout their course of care. Early awareness and intervention in patients likely to have vision loss can help preserve patient quality of life. Materials and Methods Using the Epic Cosmos de-identified electronic health record (EHR) dataset, we queried 213+ million patients to extract our MS cohort. Cases were defined as MS patients with vision impairment or optic neuritis (VI) following their first MS diagnosis, while controls were MS patients without VI. We trained logistic regression (LR), light gradient boosting machine (LGBM), and recurrent neural network (RNN) models to predict future VI in MS patients. The models were evaluated for two distinct clinical tasks: prediction of VI at the time of the first MS diagnosis and prediction of VI at the most recent visit. Similarly, we trained the models on different segments of the patient trajectory including up until the first MS diagnosis (MS-First Diagnosis), or until the most recent visit before developing the outcome (MS-Progress) as well as the combination of both (MS-General). Finally, we trained a survival model with the goal of predicting patient likelihood of vision loss over time. We compared the models' performance using AUROC, AUPRC, and Brier scores. Results We extracted a cohort of 377,097 patients with MS, including 42,281 VI cases. Our trained models achieved ~80% AUROC, with RNN-based models outperforming LGBM and LR (79.6% vs 72.8% and 68.6%, respectively) when considering the full patient trajectory. The MS-General RNN model had the highest AUROC (64.4%) for predicting VI at the first MS diagnosis. The MS-Progress survival model achieved a 75% concordance index on the full trajectory, while the more clinically relevant MS-First Diagnosis model achieved 63.1% at initial diagnosis. Discussion and Conclusion The MS-Progress and MS-General RNN models performed best in both prediction scenarios. While MS-General achieved the best performance at the time of first MS diagnosis with around 1% AUROC increase compared to the MS-First Diagnosis model, it showed around 1% AUROC decrease on the MS progress scenario. All RNN survival models performed the best when they were trained on data corresponding to the evaluation use-case scenarios. RNN based models showed promising performance that demonstrates that they can be useful clinical tools to predict risk of future VI events in patients with MS. Further development of these models will focus on expanding to predict other comorbidities associated with MS relapse or progression.
多发性硬化症患者的视力损害预测:基于Cosmos的模型训练和评估
多发性硬化症(MS)是一种复杂的自身免疫性神经系统疾病,经常影响视力。多发性硬化症最常见的最初表现之一是视神经炎引起的急性视力丧失,这是一种由多发性硬化症累及视神经引起的急性疾病。虽然视力障碍通常是多发性硬化症的第一个症状,但它可以在患者病程中的任何时间发生或复发。在这项研究中,我们的目标是开发和评估机器学习模型,以预测多发性硬化症患者的视力障碍,无论是在首次诊断多发性硬化症时还是在整个治疗过程中。早期认识和干预可能有视力丧失的患者可以帮助保持患者的生活质量。材料和方法使用Epic Cosmos去识别电子健康记录(EHR)数据集,我们查询了2.13亿多名患者来提取我们的MS队列。病例定义为首次诊断为MS后伴有视力障碍或视神经炎(VI)的MS患者,对照组为无VI的MS患者。我们训练了逻辑回归(LR)、光梯度增强机(LGBM)和递归神经网络(RNN)模型来预测MS患者未来的VI。对这些模型进行了两项不同的临床任务评估:在第一次MS诊断时预测VI和在最近一次就诊时预测VI。同样,我们在患者轨迹的不同阶段训练模型,包括直到第一次多发性硬化症诊断(MS- first diagnosis),或直到最近一次就诊才得出结果(MS- progress),以及两者的结合(MS- general)。最后,我们训练了一个生存模型,目标是预测患者视力丧失的可能性。我们使用AUROC、AUPRC和Brier评分来比较模型的性能。结果我们提取了377,097例MS患者,包括42,281例VI病例。我们训练的模型达到了约80%的AUROC,在考虑整个患者轨迹时,基于rnn的模型优于LGBM和LR(分别为79.6%比72.8%和68.6%)。MS- general RNN模型在首次MS诊断时预测VI的AUROC最高(64.4%)。MS-Progress生存模型在全轨迹上的一致性指数为75%,而更具临床相关性的MS-First诊断模型在初始诊断时的一致性指数为63.1%。讨论与结论MS-Progress和MS-General RNN模型在两种预测情景下均表现最佳。与MS- first诊断模型相比,MS- general在首次诊断MS时的AUROC提高了约1%,而在MS进展情况下,其AUROC降低了约1%。当所有RNN生存模型在与评估用例场景相对应的数据上进行训练时,它们表现最好。基于RNN的模型显示出良好的性能,这表明它们可以成为预测MS患者未来VI事件风险的有用临床工具,这些模型的进一步发展将集中于扩展到预测与MS复发或进展相关的其他合并症。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信