Fuzzy evaluation and explainable machine learning for diagnosis of rheumatic and autoimmune diseases.

IF 2.5 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

PeerJ Computer Science Pub Date : 2025-08-11 eCollection Date: 2025-01-01 DOI:10.7717/peerj-cs.3096

Mohammed Fadhil Mahdi, Arezoo Jahani, Dhafar Hamed Abd

{"title":"Fuzzy evaluation and explainable machine learning for diagnosis of rheumatic and autoimmune diseases.","authors":"Mohammed Fadhil Mahdi, Arezoo Jahani, Dhafar Hamed Abd","doi":"10.7717/peerj-cs.3096","DOIUrl":null,"url":null,"abstract":"<p><p>In this article, a new combination of an explainable machine learning approach with a fuzzy evaluation framework is proposed to improve the diagnostic performance and interpretation of rheumatic and autoimmune diseases. This work addresses three major challenges: (i) overlapping symptoms and complex clinical presentations, (ii) the lack of interpretability in traditional machine learning models, and (iii) the difficulty of selecting the best diagnosis model. To overcome these challenges, a new dataset was collected from Iraq's hospitals and health centers between 2019 and 2024. The size of dataset is 12,085 patients and includes 14 features in seven classes (rheumatoid arthritis, reactive arthritis, ankylosing spondylitis, Sjogren syndrome, systemic lupus erythematosus, psoriatic arthritis, and normal). The dataset is subjected to extensive preprocessing with attribute imputation (mean and mode), encoding categorical features, and balancing the data to pass it to 12 different machine learning models. Performance is evaluated based on precision, recall, F-score, kappa, Hamming loss, Matthews correlation coefficient, and accuracy to identify the best model. To select the optimal model, we apply fuzzy decision by opinion score method (FDOSM). The FDOSM process involves assessments from three domain experts to ensure a robust and well-rounded evaluation. Furthermore, the explainable artificial intelligence (XAI) technique provides global and local explanations for model predictions. Local interpretable model explanations (LIME) were used as explanations and significantly increased the transparency and reliability of the clinical decision-making process. The results show that the FDOSM yields gradient boosting with a 0.1333 score and a rank of 1, is the best model with an accuracy of 86.89%, precision of 87.35%, and kappa of 84.51%. The best model using XAI to increase confidence and trustworthiness in clinical decision-making and healthcare applications.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e3096"},"PeriodicalIF":2.5000,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12453786/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PeerJ Computer Science","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.7717/peerj-cs.3096","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

In this article, a new combination of an explainable machine learning approach with a fuzzy evaluation framework is proposed to improve the diagnostic performance and interpretation of rheumatic and autoimmune diseases. This work addresses three major challenges: (i) overlapping symptoms and complex clinical presentations, (ii) the lack of interpretability in traditional machine learning models, and (iii) the difficulty of selecting the best diagnosis model. To overcome these challenges, a new dataset was collected from Iraq's hospitals and health centers between 2019 and 2024. The size of dataset is 12,085 patients and includes 14 features in seven classes (rheumatoid arthritis, reactive arthritis, ankylosing spondylitis, Sjogren syndrome, systemic lupus erythematosus, psoriatic arthritis, and normal). The dataset is subjected to extensive preprocessing with attribute imputation (mean and mode), encoding categorical features, and balancing the data to pass it to 12 different machine learning models. Performance is evaluated based on precision, recall, F-score, kappa, Hamming loss, Matthews correlation coefficient, and accuracy to identify the best model. To select the optimal model, we apply fuzzy decision by opinion score method (FDOSM). The FDOSM process involves assessments from three domain experts to ensure a robust and well-rounded evaluation. Furthermore, the explainable artificial intelligence (XAI) technique provides global and local explanations for model predictions. Local interpretable model explanations (LIME) were used as explanations and significantly increased the transparency and reliability of the clinical decision-making process. The results show that the FDOSM yields gradient boosting with a 0.1333 score and a rank of 1, is the best model with an accuracy of 86.89%, precision of 87.35%, and kappa of 84.51%. The best model using XAI to increase confidence and trustworthiness in clinical decision-making and healthcare applications.

查看原文本刊更多论文

风湿病和自身免疫性疾病诊断的模糊评价和可解释的机器学习。

在本文中，提出了一种可解释的机器学习方法与模糊评估框架的新组合，以提高风湿病和自身免疫性疾病的诊断性能和解释。这项工作解决了三个主要挑战：(i)重叠的症状和复杂的临床表现，（ii）传统机器学习模型缺乏可解释性，以及（iii）选择最佳诊断模型的困难。为了克服这些挑战，研究人员在2019年至2024年间从伊拉克的医院和卫生中心收集了一个新的数据集。数据集的大小为12085例患者，包括7类14个特征（类风湿关节炎、反应性关节炎、强直性脊柱炎、干燥综合征、系统性红斑狼疮、银屑病关节炎和正常）。数据集经过属性输入（均值和模式）的广泛预处理，编码分类特征，并平衡数据，将其传递给12个不同的机器学习模型。性能评估基于精度，召回率，f分数，kappa，汉明损失，马修斯相关系数和准确性，以确定最佳模型。为了选择最优模型，我们采用了模糊决策的意见评分法（FDOSM）。FDOSM过程包括三位领域专家的评估，以确保可靠和全面的评估。此外，可解释人工智能（XAI）技术为模型预测提供了全局和局部解释。采用局部可解释模型解释（LIME）作为解释，显著提高了临床决策过程的透明度和可靠性。结果表明，FDOSM的梯度增益为0.1333分，秩为1，是最佳模型，准确率为86.89%，精密度为87.35%，kappa为84.51%。使用XAI提高临床决策和医疗保健应用的信心和可信度的最佳模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

PeerJ Computer Science Computer Science-General Computer Science

CiteScore

6.10

自引率

5.30%

发文量

332

审稿时长

10 weeks

期刊介绍： PeerJ Computer Science is the new open access journal covering all subject areas in computer science, with the backing of a prestigious advisory board and more than 300 academic editors.