Machine learning and explainable artificial intelligence to predict and interpret lead toxicity in pregnant women and unborn baby.

IF 3.2 Q1 HEALTH CARE SCIENCES & SERVICES

Frontiers in digital health Pub Date : 2025-05-30 eCollection Date: 2025-01-01 DOI:10.3389/fdgth.2025.1608949

Priyanka Chaurasia, Pratheepan Yogarajah, Abbas Ali Mahdi, Sally McClean, Mohammad Kaleem Ahmad, Tabrez Jafar, Sanjay Kumar Singh

{"title":"Machine learning and explainable artificial intelligence to predict and interpret lead toxicity in pregnant women and unborn baby.","authors":"Priyanka Chaurasia, Pratheepan Yogarajah, Abbas Ali Mahdi, Sally McClean, Mohammad Kaleem Ahmad, Tabrez Jafar, Sanjay Kumar Singh","doi":"10.3389/fdgth.2025.1608949","DOIUrl":null,"url":null,"abstract":"Introduction: Lead toxicity is a well-recognised environmental health issue, with prenatal exposure posing significant risks to infants. One major pathway of exposure to infants is maternal lead transfer during pregnancy. Therefore, accurately characterising maternal lead levels is critical for enabling targeted and personalised healthcare interventions. Current detection methods for lead poisoning are based on laboratory blood tests, which are not feasible for the screening of a wide population due to cost, accessibility, and logistical constraints. To address this limitation, our previous research proposed a novel machine learning (ML)-based model that predicts lead exposure levels in pregnant women using sociodemographic data alone. However, for such predictive models to gain broader acceptance, especially in clinical and public health settings, transparency and interpretability are essential.Methods: Understanding the reasoning behind the predictions of the model is crucial to building trust and facilitating informed decision-making. In this study, we present the first application of an explainable artificial intelligence (XAI) framework to interpret predictions made by our ML-based lead exposure model.Results: Using a dataset of 200 blood samples and 12 sociodemographic features, a Random Forest classifier was trained, achieving an accuracy of 84.52%.Discussion: We applied two widely used XAI methods, SHAP (SHapley additive explanations) and LIME (Local Interpretable Model-Agnostic Explanations), to provide insight into how each input feature contributed to the model's predictions.","PeriodicalId":73078,"journal":{"name":"Frontiers in digital health","volume":"7 ","pages":"1608949"},"PeriodicalIF":3.2000,"publicationDate":"2025-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12162601/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in digital health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fdgth.2025.1608949","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 0

Abstract

Introduction: Lead toxicity is a well-recognised environmental health issue, with prenatal exposure posing significant risks to infants. One major pathway of exposure to infants is maternal lead transfer during pregnancy. Therefore, accurately characterising maternal lead levels is critical for enabling targeted and personalised healthcare interventions. Current detection methods for lead poisoning are based on laboratory blood tests, which are not feasible for the screening of a wide population due to cost, accessibility, and logistical constraints. To address this limitation, our previous research proposed a novel machine learning (ML)-based model that predicts lead exposure levels in pregnant women using sociodemographic data alone. However, for such predictive models to gain broader acceptance, especially in clinical and public health settings, transparency and interpretability are essential.

Methods: Understanding the reasoning behind the predictions of the model is crucial to building trust and facilitating informed decision-making. In this study, we present the first application of an explainable artificial intelligence (XAI) framework to interpret predictions made by our ML-based lead exposure model.

Results: Using a dataset of 200 blood samples and 12 sociodemographic features, a Random Forest classifier was trained, achieving an accuracy of 84.52%.

Discussion: We applied two widely used XAI methods, SHAP (SHapley additive explanations) and LIME (Local Interpretable Model-Agnostic Explanations), to provide insight into how each input feature contributed to the model's predictions.

查看原文本刊更多论文

机器学习和可解释的人工智能来预测和解释孕妇和未出生婴儿的铅毒性。

导言：铅中毒是一个公认的环境健康问题，产前暴露对婴儿构成重大风险。婴儿接触铅的一个主要途径是怀孕期间母亲的铅转移。因此，准确描述产妇铅水平对于实现有针对性和个性化的医疗保健干预至关重要。目前的铅中毒检测方法是基于实验室血液检测，由于成本、可及性和后勤方面的限制，这种方法不适用于广泛的人群筛查。为了解决这一限制，我们之前的研究提出了一种新的基于机器学习（ML）的模型，该模型仅使用社会人口统计数据来预测孕妇的铅暴露水平。然而，要使这种预测模型获得更广泛的接受，特别是在临床和公共卫生环境中，透明度和可解释性至关重要。方法：理解模型预测背后的推理对于建立信任和促进知情决策至关重要。在这项研究中，我们首次应用可解释的人工智能（XAI）框架来解释我们基于ml的铅暴露模型所做的预测。结果：利用200份血液样本和12个社会人口学特征的数据集，训练了一个随机森林分类器，准确率达到84.52%。讨论：我们应用了两种广泛使用的XAI方法，SHAP （SHapley加性解释）和LIME（局部可解释模型不可知论解释），以深入了解每个输入特征如何对模型的预测做出贡献。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊