{"title":"利用基于真实世界数据的机器学习算法改进系统性红斑狼疮的诊断","authors":"Meeyoung Park","doi":"10.3390/math12182849","DOIUrl":null,"url":null,"abstract":"This study addresses the diagnostic challenges of Systemic Lupus Erythematosus (SLE), an autoimmune disease with a complex etiology and varied symptoms. The ANA (antinuclear antibody) test, currently the primary diagnostic tool for SLE, exhibits high sensitivity but low specificity, often leading to inaccurate diagnoses. To enhance diagnostic precision, we propose integrating machine learning algorithms with existing clinical classification guidelines to improve SLE diagnosis accuracy, potentially reducing diagnostic errors and healthcare costs. We analyzed real-world data from a cohort of 24,990 patients over a 10-year period at the hospitals, excluding those previously diagnosed with SLE. Patients were categorized into three groups: negative ANA, positive ANA with non-SLE, and positive ANA with SLE. Feature selection was conducted to identify key factors influencing SLE diagnosis, and machine learning algorithms were employed to develop the CDSS. Performance analysis of three machine learning algorithms—decision tree, random forest, and gradient boosting—based on feature sets of 10, 20, and all available features revealed accuracy rates of 70%, 88%, and 87%, respectively, for the 20-feature set. The proposed system, utilizing real-world medical data, demonstrated modest performance in SLE diagnosis, highlighting the potential of machine learning-based CDSS in real clinical settings.","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improving the Diagnosis of Systemic Lupus Erythematosus with Machine Learning Algorithms Based on Real-World Data\",\"authors\":\"Meeyoung Park\",\"doi\":\"10.3390/math12182849\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study addresses the diagnostic challenges of Systemic Lupus Erythematosus (SLE), an autoimmune disease with a complex etiology and varied symptoms. The ANA (antinuclear antibody) test, currently the primary diagnostic tool for SLE, exhibits high sensitivity but low specificity, often leading to inaccurate diagnoses. To enhance diagnostic precision, we propose integrating machine learning algorithms with existing clinical classification guidelines to improve SLE diagnosis accuracy, potentially reducing diagnostic errors and healthcare costs. We analyzed real-world data from a cohort of 24,990 patients over a 10-year period at the hospitals, excluding those previously diagnosed with SLE. Patients were categorized into three groups: negative ANA, positive ANA with non-SLE, and positive ANA with SLE. Feature selection was conducted to identify key factors influencing SLE diagnosis, and machine learning algorithms were employed to develop the CDSS. Performance analysis of three machine learning algorithms—decision tree, random forest, and gradient boosting—based on feature sets of 10, 20, and all available features revealed accuracy rates of 70%, 88%, and 87%, respectively, for the 20-feature set. The proposed system, utilizing real-world medical data, demonstrated modest performance in SLE diagnosis, highlighting the potential of machine learning-based CDSS in real clinical settings.\",\"PeriodicalId\":2,\"journal\":{\"name\":\"ACS Applied Bio Materials\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2024-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACS Applied Bio Materials\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.3390/math12182849\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MATERIALS SCIENCE, BIOMATERIALS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.3390/math12182849","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
引用次数: 0
摘要
系统性红斑狼疮(SLE)是一种病因复杂、症状多样的自身免疫性疾病。ANA(抗核抗体)检测是目前系统性红斑狼疮的主要诊断工具,但其灵敏度高而特异性低,常常导致诊断不准确。为了提高诊断的准确性,我们建议将机器学习算法与现有的临床分类指南相结合,以提高系统性红斑狼疮诊断的准确性,从而减少诊断错误和医疗成本。我们分析了各家医院 10 年间 24990 名患者的真实世界数据,其中不包括之前被诊断为系统性红斑狼疮的患者。患者被分为三组:ANA 阴性、非系统性红斑狼疮 ANA 阳性和系统性红斑狼疮 ANA 阳性。通过特征选择来确定影响系统性红斑狼疮诊断的关键因素,并采用机器学习算法来开发 CDSS。对基于 10、20 和所有可用特征集的三种机器学习算法(决策树、随机森林和梯度提升)进行的性能分析表明,20 个特征集的准确率分别为 70%、88% 和 87%。所提出的系统利用真实世界的医疗数据,在系统性红斑狼疮诊断中表现出了适度的性能,凸显了基于机器学习的 CDSS 在实际临床环境中的潜力。
Improving the Diagnosis of Systemic Lupus Erythematosus with Machine Learning Algorithms Based on Real-World Data
This study addresses the diagnostic challenges of Systemic Lupus Erythematosus (SLE), an autoimmune disease with a complex etiology and varied symptoms. The ANA (antinuclear antibody) test, currently the primary diagnostic tool for SLE, exhibits high sensitivity but low specificity, often leading to inaccurate diagnoses. To enhance diagnostic precision, we propose integrating machine learning algorithms with existing clinical classification guidelines to improve SLE diagnosis accuracy, potentially reducing diagnostic errors and healthcare costs. We analyzed real-world data from a cohort of 24,990 patients over a 10-year period at the hospitals, excluding those previously diagnosed with SLE. Patients were categorized into three groups: negative ANA, positive ANA with non-SLE, and positive ANA with SLE. Feature selection was conducted to identify key factors influencing SLE diagnosis, and machine learning algorithms were employed to develop the CDSS. Performance analysis of three machine learning algorithms—decision tree, random forest, and gradient boosting—based on feature sets of 10, 20, and all available features revealed accuracy rates of 70%, 88%, and 87%, respectively, for the 20-feature set. The proposed system, utilizing real-world medical data, demonstrated modest performance in SLE diagnosis, highlighting the potential of machine learning-based CDSS in real clinical settings.