Developing an Explainable Artificial Intelligence System for the Mobile-Based Diagnosis of Febrile Diseases Using Random Forest, LIME, and GPT.

IF 2.1 Q3 MEDICAL INFORMATICS

Healthcare Informatics Research Pub Date : 2025-04-01 Epub Date: 2025-04-30 DOI:10.4258/hir.2025.31.2.125

Kingsley F Attai, Constance Amannah, Moses Ekpenyong, Daniel E Asuquo, Oryina K Akputu, Okure U Obot, Peterben C Ajuga, Jeremiah C Obi, Omosivie Maduka, Christie Akwaowo, Faith-Michael Uzoka

{"title":"Developing an Explainable Artificial Intelligence System for the Mobile-Based Diagnosis of Febrile Diseases Using Random Forest, LIME, and GPT.","authors":"Kingsley F Attai, Constance Amannah, Moses Ekpenyong, Daniel E Asuquo, Oryina K Akputu, Okure U Obot, Peterben C Ajuga, Jeremiah C Obi, Omosivie Maduka, Christie Akwaowo, Faith-Michael Uzoka","doi":"10.4258/hir.2025.31.2.125","DOIUrl":null,"url":null,"abstract":"Objectives: This study proposes a mobile-based explainable artificial intelligence (XAI) platform designed for diagnosing febrile illnesses.Methods: We integrated the interpretability offered by local interpretable model-agnostic explanations (LIME) and the explainability provided by generative pre-trained transformers (GPT) to bridge the gap in understanding and trust often created by machine learning models in critical healthcare decision-making. The developed system employed random forest for disease diagnosis, LIME for interpretation of the results, and GPT-3.5 for generating explanations in easy-to-understand language.Results: Our model demonstrated robust performance in detecting malaria, achieving precision, recall, and F1-scores of 85%, 91%, and 88%, respectively. It performed moderately well in detecting urinary tract and respiratory tract infections, with precision, recall, and F1-scores of 80%, 65%, and 72%, and 77%, 68%, and 72%, respectively, maintaining an effective balance between sensitivity and specificity. However, the model exhibited limitations in detecting typhoid fever and human immunodeficiency virus/acquired immune deficiency syndrome, achieving lower precision, recall, and F1-scores of 69%, 53%, and 60%, and 75%, 39%, and 51%, respectively. These results indicate missed true-positive cases, necessitating further model fine-tuning. LIME and GPT-3.5 were integrated to enhance transparency and provide natural language explanations, thereby aiding decision-making and improving user comprehension of the diagnoses.Conclusions: The LIME plots revealed key symptoms influencing the diagnoses, with bitter taste in the mouth and fever showing the highest negative influence on predictions, and GPT-3.5 provided natural language explanations that increased the reliability and trustworthiness of the system, promoting improved patient outcomes and reducing the healthcare burden.","PeriodicalId":12947,"journal":{"name":"Healthcare Informatics Research","volume":"31 2","pages":"125-135"},"PeriodicalIF":2.1000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12086442/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Healthcare Informatics Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4258/hir.2025.31.2.125","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/4/30 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}

引用次数: 0

Abstract

Objectives: This study proposes a mobile-based explainable artificial intelligence (XAI) platform designed for diagnosing febrile illnesses.

Methods: We integrated the interpretability offered by local interpretable model-agnostic explanations (LIME) and the explainability provided by generative pre-trained transformers (GPT) to bridge the gap in understanding and trust often created by machine learning models in critical healthcare decision-making. The developed system employed random forest for disease diagnosis, LIME for interpretation of the results, and GPT-3.5 for generating explanations in easy-to-understand language.

Results: Our model demonstrated robust performance in detecting malaria, achieving precision, recall, and F1-scores of 85%, 91%, and 88%, respectively. It performed moderately well in detecting urinary tract and respiratory tract infections, with precision, recall, and F1-scores of 80%, 65%, and 72%, and 77%, 68%, and 72%, respectively, maintaining an effective balance between sensitivity and specificity. However, the model exhibited limitations in detecting typhoid fever and human immunodeficiency virus/acquired immune deficiency syndrome, achieving lower precision, recall, and F1-scores of 69%, 53%, and 60%, and 75%, 39%, and 51%, respectively. These results indicate missed true-positive cases, necessitating further model fine-tuning. LIME and GPT-3.5 were integrated to enhance transparency and provide natural language explanations, thereby aiding decision-making and improving user comprehension of the diagnoses.

Conclusions: The LIME plots revealed key symptoms influencing the diagnoses, with bitter taste in the mouth and fever showing the highest negative influence on predictions, and GPT-3.5 provided natural language explanations that increased the reliability and trustworthiness of the system, promoting improved patient outcomes and reducing the healthcare burden.

Abstract Image

查看原文本刊更多论文

利用随机森林、LIME和GPT开发可解释的温病移动诊断人工智能系统。

目的：本研究提出了一个基于移动的可解释人工智能（XAI）平台，用于诊断发热性疾病。方法：我们整合了局部可解释模型不可知论解释（LIME）提供的可解释性和生成式预训练变形器（GPT）提供的可解释性，以弥合机器学习模型在关键医疗保健决策中经常产生的理解和信任差距。开发的系统采用随机森林进行疾病诊断，LIME用于解释结果，GPT-3.5用于生成易于理解的语言解释。结果：我们的模型在检测疟疾方面表现出稳健的性能，分别达到85%、91%和88%的准确率、召回率和f1得分。该方法在尿路和呼吸道感染的检测中表现较好，准确率、召回率和f1评分分别为80%、65%和72%，77%、68%和72%，保持了敏感性和特异性之间的有效平衡。然而，该模型在检测伤寒和人类免疫缺陷病毒/获得性免疫缺陷综合征方面存在局限性，准确率、召回率和f1评分分别较低，分别为69%、53%和60%，75%、39%和51%。这些结果表明遗漏了真阳性病例，需要进一步的模型微调。LIME和GPT-3.5被整合，以提高透明度和提供自然语言解释，从而帮助决策和提高用户对诊断的理解。结论：LIME图揭示了影响诊断的关键症状，其中口腔苦味和发烧对预测的负面影响最大，GPT-3.5提供的自然语言解释提高了系统的可靠性和可信度，促进了患者预后的改善，减轻了医疗负担。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Healthcare Informatics Research MEDICAL INFORMATICS-

CiteScore

4.90

自引率

6.90%

发文量