Explainable AI for enhanced accuracy in malaria diagnosis using ensemble machine learning models.

IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS
Olushina Olawale Awe, Peter Njoroge Mwangi, Samuel Kotva Goudoungou, Ruth Victoria Esho, Olanrewaju Samuel Oyejide
{"title":"Explainable AI for enhanced accuracy in malaria diagnosis using ensemble machine learning models.","authors":"Olushina Olawale Awe, Peter Njoroge Mwangi, Samuel Kotva Goudoungou, Ruth Victoria Esho, Olanrewaju Samuel Oyejide","doi":"10.1186/s12911-025-02874-3","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Malaria, an infectious disease caused by protozoan parasites belonging to the Plasmodium genus, remains a significant public health challenge, with African regions bearing the heaviest burden. Machine learning techniques have shown great promise in improving the diagnosis of infectious diseases, such as malaria.</p><p><strong>Objectives: </strong>This study aims to integrate ensemble machine learning models and Explainable Artificial Intelligence (XAI) frameworks to enhance the diagnosis accuracy of malaria.</p><p><strong>Methods: </strong>The study utilized a dataset from the Federal Polytechnic Ilaro Medical Centre, Ilaro, Ogun State, Nigeria, which includes information from 337 patients aged between 3 and 77 years (180 females and 157 males) over a 4-week period. Ensemble methods, namely Random Forest, AdaBoost, Gradient Boost, XGBoost, and CatBoost, were employed after addressing class imbalance through oversampling techniques. Explainable AI techniques, such as LIME, Shapley Additive Explanations (SHAP) and Permutation Feature Importance, were utilized to enhance transparency and interpretability.</p><p><strong>Results: </strong>Among the ensemble models, Random Forest demonstrated the highest performance with an ROC AUC score of 0.869, followed closely by CatBoost at 0.787. XGBoost, Gradient Boost, and AdaBoost achieved ROC AUC scores of 0.770, 0.747, and 0.633, respectively. These methods evaluated the influence of different characteristics on the probability of malaria diagnosis, revealing critical features that contribute to prediction outcomes.</p><p><strong>Conclusion: </strong>By integrating ensemble machine learning models with explainable AI frameworks, the study promoted transparency in decision-making processes, thereby empowering healthcare providers with actionable insights for improved treatment strategies and enhanced patient outcomes, particularly in malaria management.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"162"},"PeriodicalIF":3.3000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11987329/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-025-02874-3","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Malaria, an infectious disease caused by protozoan parasites belonging to the Plasmodium genus, remains a significant public health challenge, with African regions bearing the heaviest burden. Machine learning techniques have shown great promise in improving the diagnosis of infectious diseases, such as malaria.

Objectives: This study aims to integrate ensemble machine learning models and Explainable Artificial Intelligence (XAI) frameworks to enhance the diagnosis accuracy of malaria.

Methods: The study utilized a dataset from the Federal Polytechnic Ilaro Medical Centre, Ilaro, Ogun State, Nigeria, which includes information from 337 patients aged between 3 and 77 years (180 females and 157 males) over a 4-week period. Ensemble methods, namely Random Forest, AdaBoost, Gradient Boost, XGBoost, and CatBoost, were employed after addressing class imbalance through oversampling techniques. Explainable AI techniques, such as LIME, Shapley Additive Explanations (SHAP) and Permutation Feature Importance, were utilized to enhance transparency and interpretability.

Results: Among the ensemble models, Random Forest demonstrated the highest performance with an ROC AUC score of 0.869, followed closely by CatBoost at 0.787. XGBoost, Gradient Boost, and AdaBoost achieved ROC AUC scores of 0.770, 0.747, and 0.633, respectively. These methods evaluated the influence of different characteristics on the probability of malaria diagnosis, revealing critical features that contribute to prediction outcomes.

Conclusion: By integrating ensemble machine learning models with explainable AI frameworks, the study promoted transparency in decision-making processes, thereby empowering healthcare providers with actionable insights for improved treatment strategies and enhanced patient outcomes, particularly in malaria management.

使用集成机器学习模型提高疟疾诊断准确性的可解释人工智能。
背景:疟疾是一种由疟原虫属原生动物寄生虫引起的传染病,仍然是一项重大的公共卫生挑战,其中非洲地区负担最重。机器学习技术在改善传染病(如疟疾)的诊断方面显示出巨大的希望。目的:本研究旨在整合集成机器学习模型和可解释人工智能(XAI)框架,以提高疟疾诊断的准确性。方法:该研究利用了尼日利亚奥贡州伊拉罗联邦理工学院伊拉罗医学中心的数据集,其中包括337名年龄在3至77岁之间的患者(180名女性和157名男性)在4周内的信息。在通过过采样技术解决类不平衡问题后,采用集成方法,即Random Forest、AdaBoost、Gradient Boost、XGBoost和CatBoost。可解释的人工智能技术,如LIME, Shapley加性解释(SHAP)和排列特征重要性,被用来提高透明度和可解释性。结果:在集成模型中,Random Forest的ROC AUC得分最高,为0.869,其次是CatBoost,为0.787。XGBoost、Gradient Boost和AdaBoost的ROC AUC得分分别为0.770、0.747和0.633。这些方法评估了不同特征对疟疾诊断概率的影响,揭示了有助于预测结果的关键特征。结论:通过将集成机器学习模型与可解释的人工智能框架相结合,该研究提高了决策过程的透明度,从而使医疗保健提供者能够提供可操作的见解,以改进治疗策略并提高患者的治疗效果,特别是在疟疾管理方面。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.20
自引率
5.70%
发文量
297
审稿时长
1 months
期刊介绍: BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信