Machine learning approach in predicting early antenatal care initiation at first trimester among reproductive women in Somalia: an analysis with SHAP explanations

Jamilu Sani , Mohamed Mustaf Ahmed
{"title":"Machine learning approach in predicting early antenatal care initiation at first trimester among reproductive women in Somalia: an analysis with SHAP explanations","authors":"Jamilu Sani ,&nbsp;Mohamed Mustaf Ahmed","doi":"10.1016/j.ibmed.2025.100252","DOIUrl":null,"url":null,"abstract":"<div><h3>Introduction</h3><div>Timely antenatal care (ANC) initiation is essential for maternal and neonatal health, enabling the early detection of risks and ensuring optimal care. In Somalia, delayed initiation of ANC poses a significant health risk. This study applied machine learning (ML) models to predict early ANC initiation among Somali women and identify key predictors using SHapley Additive exPlanations (SHAP).</div></div><div><h3>Methods</h3><div>Data from the 2020 Somali Health and Demographic Survey were analyzed, focusing on ANC timing in 3138 women aged 15–49. Six ML models (Logistic Regression, Support Vector Machine, Decision Tree, Random Forest, K-Nearest Neighbors, and XGBoost) were assessed for accuracy, precision, recall, F1-score, and AUROC. Feature importance was evaluated using SHAP to interpret the influence of each predictor.</div></div><div><h3>Results</h3><div>Random Forest achieved the highest performance, with an accuracy of 70 %, precision of 0.69, recall of 0.71, and AUROC of 0.74, closely followed by XGBoost, which achieved an accuracy of 69 % and AUROC of 0.72. SHAP analysis identified the place of delivery, residence, and age group as the most influential predictors of early ANC initiation, with the number of births in the past five years showing a significant negative impact.</div></div><div><h3>Conclusion</h3><div>Machine learning models, particularly Random Forest and XGBoost, effectively predicted early ANC initiation, highlighting significant demographic and healthcare access-related predictors. These findings suggest targeted interventions focusing on delivery location preferences, residential factors, and age-specific approaches to improve early ANC attendance in Somalia.</div></div>","PeriodicalId":73399,"journal":{"name":"Intelligence-based medicine","volume":"11 ","pages":"Article 100252"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligence-based medicine","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666521225000560","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction

Timely antenatal care (ANC) initiation is essential for maternal and neonatal health, enabling the early detection of risks and ensuring optimal care. In Somalia, delayed initiation of ANC poses a significant health risk. This study applied machine learning (ML) models to predict early ANC initiation among Somali women and identify key predictors using SHapley Additive exPlanations (SHAP).

Methods

Data from the 2020 Somali Health and Demographic Survey were analyzed, focusing on ANC timing in 3138 women aged 15–49. Six ML models (Logistic Regression, Support Vector Machine, Decision Tree, Random Forest, K-Nearest Neighbors, and XGBoost) were assessed for accuracy, precision, recall, F1-score, and AUROC. Feature importance was evaluated using SHAP to interpret the influence of each predictor.

Results

Random Forest achieved the highest performance, with an accuracy of 70 %, precision of 0.69, recall of 0.71, and AUROC of 0.74, closely followed by XGBoost, which achieved an accuracy of 69 % and AUROC of 0.72. SHAP analysis identified the place of delivery, residence, and age group as the most influential predictors of early ANC initiation, with the number of births in the past five years showing a significant negative impact.

Conclusion

Machine learning models, particularly Random Forest and XGBoost, effectively predicted early ANC initiation, highlighting significant demographic and healthcare access-related predictors. These findings suggest targeted interventions focusing on delivery location preferences, residential factors, and age-specific approaches to improve early ANC attendance in Somalia.
预测索马里育龄妇女妊娠早期产前护理的机器学习方法:基于SHAP解释的分析
及时开展产前保健(ANC)对孕产妇和新生儿健康至关重要,能够及早发现风险并确保最佳护理。在索马里,推迟启动非裔国民大会对健康构成重大风险。本研究应用机器学习(ML)模型预测索马里妇女早期ANC的发生,并使用SHapley加性解释(SHAP)确定关键预测因素。方法分析2020年索马里健康和人口调查的数据,重点分析3138名15-49岁妇女的ANC时间。对6个ML模型(逻辑回归、支持向量机、决策树、随机森林、k近邻和XGBoost)的准确性、精密度、召回率、f1评分和AUROC进行了评估。使用SHAP评估特征重要性,以解释每个预测因子的影响。结果random Forest的准确率为70%,精密度为0.69,召回率为0.71,AUROC为0.74,XGBoost紧随其后,准确率为69%,AUROC为0.72。SHAP分析确定,分娩地点、居住地和年龄组是早期ANC发生的最具影响力的预测因素,过去5年的出生数量显示出显著的负面影响。机器学习模型,特别是Random Forest和XGBoost,可以有效预测早期ANC的发生,突出了重要的人口统计学和医疗保健相关预测因子。这些发现表明,有针对性的干预措施侧重于递送地点偏好、居住因素和针对特定年龄的方法,以提高索马里ANC的早期出勤率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Intelligence-based medicine
Intelligence-based medicine Health Informatics
CiteScore
5.00
自引率
0.00%
发文量
0
审稿时长
187 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信