探索应用深度学习进行关节炎早期预测的可行性

Jiaxuan Chen, Xiangxuan Kong
{"title":"探索应用深度学习进行关节炎早期预测的可行性","authors":"Jiaxuan Chen, Xiangxuan Kong","doi":"10.26685/urncst.562","DOIUrl":null,"url":null,"abstract":"Introduction: Arthritis is one of the most common chronic diseases. Early detection of arthritis and its progression can facilitate early intervention measures, lowering disease severity in patients. As electronic health records (EHR) become more accessible, this study assesses whether general health information and arthritis-related questionnaires can be used in arthritis diagnosis, without the involvement of costly imaging methods. Therefore, we created deep learning (DL) and machine learning (ML) models to explore the feasibility of combining EHR and modern computational tools to diagnose arthritis. Methods: A total of 782 arthritis patients and 4014 control patients were identified from the Osteoarthritis Initiative (OAI) – a ten-year-long observational study that included patient EHR in five time points. Six hundred variables were filtered by random forest classifier followed by manual filtering. Data were split properly to training, testing and validation set, and the training set was balanced. Sequential, nonsequential DL models, and five independent DL models for each time points were used. The accuracy, positive prevalence value (PPV), negative prevalence value (NPV), and area under curve (AUC), were assessed and compared with four classical ML models. SHAP (SHapley Additive exPlanations) summary analysis was also conducted. Results: Sequential and non-sequential deep learning models showed accuracies of ~ 0.97, and the four classical machine learning approaches showed accuracies of above 0.9. High positive and negative predicted values (> 0.90) for all of the models suggested the potential clinical applicability of the model, while the SHAP analysis demonstrated its interpretability. Discussion: We tested various models and showed the ability to use machine learning methods for early diagnosis of arthritis with EHR. The models can be used as a screening tool to select susceptible patients for confirmatory tests such as X-ray and MRI. Identification of early disease states could facilitate protective measures that slow disease progression.","PeriodicalId":245521,"journal":{"name":"Undergraduate Research in Natural and Clinical Science and Technology (URNCST) Journal","volume":"58 9","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Exploring the Feasibility of Applying Deep Learning for the Early Prediction of Arthritis\",\"authors\":\"Jiaxuan Chen, Xiangxuan Kong\",\"doi\":\"10.26685/urncst.562\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Introduction: Arthritis is one of the most common chronic diseases. Early detection of arthritis and its progression can facilitate early intervention measures, lowering disease severity in patients. As electronic health records (EHR) become more accessible, this study assesses whether general health information and arthritis-related questionnaires can be used in arthritis diagnosis, without the involvement of costly imaging methods. Therefore, we created deep learning (DL) and machine learning (ML) models to explore the feasibility of combining EHR and modern computational tools to diagnose arthritis. Methods: A total of 782 arthritis patients and 4014 control patients were identified from the Osteoarthritis Initiative (OAI) – a ten-year-long observational study that included patient EHR in five time points. Six hundred variables were filtered by random forest classifier followed by manual filtering. Data were split properly to training, testing and validation set, and the training set was balanced. Sequential, nonsequential DL models, and five independent DL models for each time points were used. The accuracy, positive prevalence value (PPV), negative prevalence value (NPV), and area under curve (AUC), were assessed and compared with four classical ML models. SHAP (SHapley Additive exPlanations) summary analysis was also conducted. Results: Sequential and non-sequential deep learning models showed accuracies of ~ 0.97, and the four classical machine learning approaches showed accuracies of above 0.9. High positive and negative predicted values (> 0.90) for all of the models suggested the potential clinical applicability of the model, while the SHAP analysis demonstrated its interpretability. Discussion: We tested various models and showed the ability to use machine learning methods for early diagnosis of arthritis with EHR. The models can be used as a screening tool to select susceptible patients for confirmatory tests such as X-ray and MRI. Identification of early disease states could facilitate protective measures that slow disease progression.\",\"PeriodicalId\":245521,\"journal\":{\"name\":\"Undergraduate Research in Natural and Clinical Science and Technology (URNCST) Journal\",\"volume\":\"58 9\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-03-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Undergraduate Research in Natural and Clinical Science and Technology (URNCST) Journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.26685/urncst.562\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Undergraduate Research in Natural and Clinical Science and Technology (URNCST) Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.26685/urncst.562","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

导言关节炎是最常见的慢性疾病之一。早期发现关节炎及其进展可促进早期干预措施,降低患者的疾病严重程度。随着电子健康记录(EHR)的普及,本研究评估了一般健康信息和关节炎相关问卷是否可用于关节炎诊断,而无需使用昂贵的成像方法。因此,我们创建了深度学习(DL)和机器学习(ML)模型,以探索结合电子病历和现代计算工具诊断关节炎的可行性。方法骨关节炎倡议(OAI)是一项长达十年的观察性研究,其中包括五个时间点的患者电子病历,我们从这项研究中确定了 782 名关节炎患者和 4014 名对照组患者。通过随机森林分类器过滤了 600 个变量,然后进行了人工过滤。数据被适当分割为训练集、测试集和验证集,训练集是平衡的。使用了序列、非序列 DL 模型以及每个时间点的五个独立 DL 模型。评估了准确率、阳性流行值(PPV)、阴性流行值(NPV)和曲线下面积(AUC),并与四种经典的 ML 模型进行了比较。此外,还进行了 SHAP(SHapley Additive exPlanations)汇总分析。结果序列和非序列深度学习模型的准确度约为 0.97,而四种经典机器学习方法的准确度高于 0.9。所有模型的正负预测值都很高(> 0.90),这表明该模型具有潜在的临床适用性,而 SHAP 分析则证明了其可解释性。讨论:我们对各种模型进行了测试,结果表明,利用机器学习方法可以通过电子病历对关节炎进行早期诊断。这些模型可用作筛选工具,选择易感患者进行 X 光和核磁共振成像等确诊测试。早期疾病状态的识别有助于采取保护措施,减缓疾病的进展。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Exploring the Feasibility of Applying Deep Learning for the Early Prediction of Arthritis
Introduction: Arthritis is one of the most common chronic diseases. Early detection of arthritis and its progression can facilitate early intervention measures, lowering disease severity in patients. As electronic health records (EHR) become more accessible, this study assesses whether general health information and arthritis-related questionnaires can be used in arthritis diagnosis, without the involvement of costly imaging methods. Therefore, we created deep learning (DL) and machine learning (ML) models to explore the feasibility of combining EHR and modern computational tools to diagnose arthritis. Methods: A total of 782 arthritis patients and 4014 control patients were identified from the Osteoarthritis Initiative (OAI) – a ten-year-long observational study that included patient EHR in five time points. Six hundred variables were filtered by random forest classifier followed by manual filtering. Data were split properly to training, testing and validation set, and the training set was balanced. Sequential, nonsequential DL models, and five independent DL models for each time points were used. The accuracy, positive prevalence value (PPV), negative prevalence value (NPV), and area under curve (AUC), were assessed and compared with four classical ML models. SHAP (SHapley Additive exPlanations) summary analysis was also conducted. Results: Sequential and non-sequential deep learning models showed accuracies of ~ 0.97, and the four classical machine learning approaches showed accuracies of above 0.9. High positive and negative predicted values (> 0.90) for all of the models suggested the potential clinical applicability of the model, while the SHAP analysis demonstrated its interpretability. Discussion: We tested various models and showed the ability to use machine learning methods for early diagnosis of arthritis with EHR. The models can be used as a screening tool to select susceptible patients for confirmatory tests such as X-ray and MRI. Identification of early disease states could facilitate protective measures that slow disease progression.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
0.10
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信