Developing an Explainable Artificial Intelligence System for the Mobile-Based Diagnosis of Febrile Diseases Using Random Forest, LIME, and GPT.

IF 2.3 Q3 MEDICAL INFORMATICS
Healthcare Informatics Research Pub Date : 2025-04-01 Epub Date: 2025-04-30 DOI:10.4258/hir.2025.31.2.125
Kingsley F Attai, Constance Amannah, Moses Ekpenyong, Daniel E Asuquo, Oryina K Akputu, Okure U Obot, Peterben C Ajuga, Jeremiah C Obi, Omosivie Maduka, Christie Akwaowo, Faith-Michael Uzoka
{"title":"Developing an Explainable Artificial Intelligence System for the Mobile-Based Diagnosis of Febrile Diseases Using Random Forest, LIME, and GPT.","authors":"Kingsley F Attai, Constance Amannah, Moses Ekpenyong, Daniel E Asuquo, Oryina K Akputu, Okure U Obot, Peterben C Ajuga, Jeremiah C Obi, Omosivie Maduka, Christie Akwaowo, Faith-Michael Uzoka","doi":"10.4258/hir.2025.31.2.125","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>This study proposes a mobile-based explainable artificial intelligence (XAI) platform designed for diagnosing febrile illnesses.</p><p><strong>Methods: </strong>We integrated the interpretability offered by local interpretable model-agnostic explanations (LIME) and the explainability provided by generative pre-trained transformers (GPT) to bridge the gap in understanding and trust often created by machine learning models in critical healthcare decision-making. The developed system employed random forest for disease diagnosis, LIME for interpretation of the results, and GPT-3.5 for generating explanations in easy-to-understand language.</p><p><strong>Results: </strong>Our model demonstrated robust performance in detecting malaria, achieving precision, recall, and F1-scores of 85%, 91%, and 88%, respectively. It performed moderately well in detecting urinary tract and respiratory tract infections, with precision, recall, and F1-scores of 80%, 65%, and 72%, and 77%, 68%, and 72%, respectively, maintaining an effective balance between sensitivity and specificity. However, the model exhibited limitations in detecting typhoid fever and human immunodeficiency virus/acquired immune deficiency syndrome, achieving lower precision, recall, and F1-scores of 69%, 53%, and 60%, and 75%, 39%, and 51%, respectively. These results indicate missed true-positive cases, necessitating further model fine-tuning. LIME and GPT-3.5 were integrated to enhance transparency and provide natural language explanations, thereby aiding decision-making and improving user comprehension of the diagnoses.</p><p><strong>Conclusions: </strong>The LIME plots revealed key symptoms influencing the diagnoses, with bitter taste in the mouth and fever showing the highest negative influence on predictions, and GPT-3.5 provided natural language explanations that increased the reliability and trustworthiness of the system, promoting improved patient outcomes and reducing the healthcare burden.</p>","PeriodicalId":12947,"journal":{"name":"Healthcare Informatics Research","volume":"31 2","pages":"125-135"},"PeriodicalIF":2.3000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12086442/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Healthcare Informatics Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4258/hir.2025.31.2.125","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/4/30 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

Abstract

Objectives: This study proposes a mobile-based explainable artificial intelligence (XAI) platform designed for diagnosing febrile illnesses.

Methods: We integrated the interpretability offered by local interpretable model-agnostic explanations (LIME) and the explainability provided by generative pre-trained transformers (GPT) to bridge the gap in understanding and trust often created by machine learning models in critical healthcare decision-making. The developed system employed random forest for disease diagnosis, LIME for interpretation of the results, and GPT-3.5 for generating explanations in easy-to-understand language.

Results: Our model demonstrated robust performance in detecting malaria, achieving precision, recall, and F1-scores of 85%, 91%, and 88%, respectively. It performed moderately well in detecting urinary tract and respiratory tract infections, with precision, recall, and F1-scores of 80%, 65%, and 72%, and 77%, 68%, and 72%, respectively, maintaining an effective balance between sensitivity and specificity. However, the model exhibited limitations in detecting typhoid fever and human immunodeficiency virus/acquired immune deficiency syndrome, achieving lower precision, recall, and F1-scores of 69%, 53%, and 60%, and 75%, 39%, and 51%, respectively. These results indicate missed true-positive cases, necessitating further model fine-tuning. LIME and GPT-3.5 were integrated to enhance transparency and provide natural language explanations, thereby aiding decision-making and improving user comprehension of the diagnoses.

Conclusions: The LIME plots revealed key symptoms influencing the diagnoses, with bitter taste in the mouth and fever showing the highest negative influence on predictions, and GPT-3.5 provided natural language explanations that increased the reliability and trustworthiness of the system, promoting improved patient outcomes and reducing the healthcare burden.

利用随机森林、LIME和GPT开发可解释的温病移动诊断人工智能系统。
目的:本研究提出了一个基于移动的可解释人工智能(XAI)平台,用于诊断发热性疾病。方法:我们整合了局部可解释模型不可知论解释(LIME)提供的可解释性和生成式预训练变形器(GPT)提供的可解释性,以弥合机器学习模型在关键医疗保健决策中经常产生的理解和信任差距。开发的系统采用随机森林进行疾病诊断,LIME用于解释结果,GPT-3.5用于生成易于理解的语言解释。结果:我们的模型在检测疟疾方面表现出稳健的性能,分别达到85%、91%和88%的准确率、召回率和f1得分。该方法在尿路和呼吸道感染的检测中表现较好,准确率、召回率和f1评分分别为80%、65%和72%,77%、68%和72%,保持了敏感性和特异性之间的有效平衡。然而,该模型在检测伤寒和人类免疫缺陷病毒/获得性免疫缺陷综合征方面存在局限性,准确率、召回率和f1评分分别较低,分别为69%、53%和60%,75%、39%和51%。这些结果表明遗漏了真阳性病例,需要进一步的模型微调。LIME和GPT-3.5被整合,以提高透明度和提供自然语言解释,从而帮助决策和提高用户对诊断的理解。结论:LIME图揭示了影响诊断的关键症状,其中口腔苦味和发烧对预测的负面影响最大,GPT-3.5提供的自然语言解释提高了系统的可靠性和可信度,促进了患者预后的改善,减轻了医疗负担。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Healthcare Informatics Research
Healthcare Informatics Research MEDICAL INFORMATICS-
CiteScore
4.90
自引率
6.90%
发文量
44
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信