利用基于真实世界数据的机器学习算法改进系统性红斑狼疮的诊断

IF 2.3 3区 数学 Q1 MATHEMATICS
Mathematics Pub Date : 2024-09-13 DOI:10.3390/math12182849
Meeyoung Park
{"title":"利用基于真实世界数据的机器学习算法改进系统性红斑狼疮的诊断","authors":"Meeyoung Park","doi":"10.3390/math12182849","DOIUrl":null,"url":null,"abstract":"This study addresses the diagnostic challenges of Systemic Lupus Erythematosus (SLE), an autoimmune disease with a complex etiology and varied symptoms. The ANA (antinuclear antibody) test, currently the primary diagnostic tool for SLE, exhibits high sensitivity but low specificity, often leading to inaccurate diagnoses. To enhance diagnostic precision, we propose integrating machine learning algorithms with existing clinical classification guidelines to improve SLE diagnosis accuracy, potentially reducing diagnostic errors and healthcare costs. We analyzed real-world data from a cohort of 24,990 patients over a 10-year period at the hospitals, excluding those previously diagnosed with SLE. Patients were categorized into three groups: negative ANA, positive ANA with non-SLE, and positive ANA with SLE. Feature selection was conducted to identify key factors influencing SLE diagnosis, and machine learning algorithms were employed to develop the CDSS. Performance analysis of three machine learning algorithms—decision tree, random forest, and gradient boosting—based on feature sets of 10, 20, and all available features revealed accuracy rates of 70%, 88%, and 87%, respectively, for the 20-feature set. The proposed system, utilizing real-world medical data, demonstrated modest performance in SLE diagnosis, highlighting the potential of machine learning-based CDSS in real clinical settings.","PeriodicalId":18303,"journal":{"name":"Mathematics","volume":null,"pages":null},"PeriodicalIF":2.3000,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improving the Diagnosis of Systemic Lupus Erythematosus with Machine Learning Algorithms Based on Real-World Data\",\"authors\":\"Meeyoung Park\",\"doi\":\"10.3390/math12182849\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study addresses the diagnostic challenges of Systemic Lupus Erythematosus (SLE), an autoimmune disease with a complex etiology and varied symptoms. The ANA (antinuclear antibody) test, currently the primary diagnostic tool for SLE, exhibits high sensitivity but low specificity, often leading to inaccurate diagnoses. To enhance diagnostic precision, we propose integrating machine learning algorithms with existing clinical classification guidelines to improve SLE diagnosis accuracy, potentially reducing diagnostic errors and healthcare costs. We analyzed real-world data from a cohort of 24,990 patients over a 10-year period at the hospitals, excluding those previously diagnosed with SLE. Patients were categorized into three groups: negative ANA, positive ANA with non-SLE, and positive ANA with SLE. Feature selection was conducted to identify key factors influencing SLE diagnosis, and machine learning algorithms were employed to develop the CDSS. Performance analysis of three machine learning algorithms—decision tree, random forest, and gradient boosting—based on feature sets of 10, 20, and all available features revealed accuracy rates of 70%, 88%, and 87%, respectively, for the 20-feature set. The proposed system, utilizing real-world medical data, demonstrated modest performance in SLE diagnosis, highlighting the potential of machine learning-based CDSS in real clinical settings.\",\"PeriodicalId\":18303,\"journal\":{\"name\":\"Mathematics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2024-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Mathematics\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.3390/math12182849\",\"RegionNum\":3,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MATHEMATICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mathematics","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.3390/math12182849","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICS","Score":null,"Total":0}
引用次数: 0

摘要

系统性红斑狼疮(SLE)是一种病因复杂、症状多样的自身免疫性疾病。ANA(抗核抗体)检测是目前系统性红斑狼疮的主要诊断工具,但其灵敏度高而特异性低,常常导致诊断不准确。为了提高诊断的准确性,我们建议将机器学习算法与现有的临床分类指南相结合,以提高系统性红斑狼疮诊断的准确性,从而减少诊断错误和医疗成本。我们分析了各家医院 10 年间 24990 名患者的真实世界数据,其中不包括之前被诊断为系统性红斑狼疮的患者。患者被分为三组:ANA 阴性、非系统性红斑狼疮 ANA 阳性和系统性红斑狼疮 ANA 阳性。通过特征选择来确定影响系统性红斑狼疮诊断的关键因素,并采用机器学习算法来开发 CDSS。对基于 10、20 和所有可用特征集的三种机器学习算法(决策树、随机森林和梯度提升)进行的性能分析表明,20 个特征集的准确率分别为 70%、88% 和 87%。所提出的系统利用真实世界的医疗数据,在系统性红斑狼疮诊断中表现出了适度的性能,凸显了基于机器学习的 CDSS 在实际临床环境中的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Improving the Diagnosis of Systemic Lupus Erythematosus with Machine Learning Algorithms Based on Real-World Data
This study addresses the diagnostic challenges of Systemic Lupus Erythematosus (SLE), an autoimmune disease with a complex etiology and varied symptoms. The ANA (antinuclear antibody) test, currently the primary diagnostic tool for SLE, exhibits high sensitivity but low specificity, often leading to inaccurate diagnoses. To enhance diagnostic precision, we propose integrating machine learning algorithms with existing clinical classification guidelines to improve SLE diagnosis accuracy, potentially reducing diagnostic errors and healthcare costs. We analyzed real-world data from a cohort of 24,990 patients over a 10-year period at the hospitals, excluding those previously diagnosed with SLE. Patients were categorized into three groups: negative ANA, positive ANA with non-SLE, and positive ANA with SLE. Feature selection was conducted to identify key factors influencing SLE diagnosis, and machine learning algorithms were employed to develop the CDSS. Performance analysis of three machine learning algorithms—decision tree, random forest, and gradient boosting—based on feature sets of 10, 20, and all available features revealed accuracy rates of 70%, 88%, and 87%, respectively, for the 20-feature set. The proposed system, utilizing real-world medical data, demonstrated modest performance in SLE diagnosis, highlighting the potential of machine learning-based CDSS in real clinical settings.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Mathematics
Mathematics Mathematics-General Mathematics
CiteScore
4.00
自引率
16.70%
发文量
4032
审稿时长
21.9 days
期刊介绍: Mathematics (ISSN 2227-7390) is an international, open access journal which provides an advanced forum for studies related to mathematical sciences. It devotes exclusively to the publication of high-quality reviews, regular research papers and short communications in all areas of pure and applied mathematics. Mathematics also publishes timely and thorough survey articles on current trends, new theoretical techniques, novel ideas and new mathematical tools in different branches of mathematics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信