Causal machine learning models for predicting low birth weight in midwife-led continuity care intervention in North Shoa Zone, Ethiopia.

IF 3.3 3区医学 Q2 MEDICAL INFORMATICS

BMC Medical Informatics and Decision Making Pub Date : 2025-02-07 DOI:10.1186/s12911-025-02917-9

Wudneh Ketema Moges, Awoke Seyoum Tegegne, Aweke A Mitku, Esubalew Tesfahun, Solomon Hailemeskel

{"title":"Causal machine learning models for predicting low birth weight in midwife-led continuity care intervention in North Shoa Zone, Ethiopia.","authors":"Wudneh Ketema Moges, Awoke Seyoum Tegegne, Aweke A Mitku, Esubalew Tesfahun, Solomon Hailemeskel","doi":"10.1186/s12911-025-02917-9","DOIUrl":null,"url":null,"abstract":"Background: Low birth weight (LBW) is a critical global health issue that affects infants disproportionately, particularly in developing countries. This study adopted causal machine learning (CML) algorithms for predicting LBW in newborns, drawing from midwife-led continuity care (MLCC).Methods: A quasi-experimental study was carried out in the North Shoa Zone of Ethiopia from August 2019 to September 2020. A total of 1166 women were allocated into two groups. The first group, the MLCC group, received all their antenatal, labor, birth, and immediate post-natal care from a single midwife. The second group received care from various staff members at different times throughout their pregnancy and childbirth. In this study, CML was implemented to predict LBW. Data preprocessing, including data cleaning, was conducted. CML was then employed to identify the most suitable classifier for predicting LBW. Gradient boosting algorithms were used to estimate the causal effect of MLCC on LBW. Moreover, meta-learner algorithms were utilized to estimate the individual treatment effect (ITE), the average treatment effect (ATE), and performance. Moreover, meta-learner algorithms were utilized to estimate the individual treatment effect (ITE), the average treatment effect (ATE), and performance.Results: The study results revealed that Causal K-Nearest Neighbors (CKNN) was the most effective classifier based on accuracy and estimated LBW using a 94.52% accuracy, 90.25% precision, 92.57% recall, and an F1 score of 88.2%. Meconium aspiration, perinatal mortality, pregnancy-induced hypertension, vacuum babies in need of resuscitation, and previous surgeries on their reproductive organs were identified as the top five features affecting LBW. The estimated impact of MLCC versus other professional groups on LBW was analyzed using gradient boosting algorithms and was found to be 0.237. The estimated ATE for the S-learner was 0.284, which is lower than the true ATE of 0.216. Additionally, the estimated ITE for both the T-learner and X-learner was less than -0.5, indicating that mothers would not choose to participate in the MLCC program.Conclusions: Based on these findings, the CKNN classifier demonstrated a higher accuracy and effectiveness. The S-learner and R-learner models, utilizing the XGBoost Regressor and BaseSRegressor, provided accurate estimations of ITE for assessing the impact of the MLCC program. Promoting the MLCC program could help stabilize LBW outcomes.","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"64"},"PeriodicalIF":3.3000,"publicationDate":"2025-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11806756/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-025-02917-9","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Low birth weight (LBW) is a critical global health issue that affects infants disproportionately, particularly in developing countries. This study adopted causal machine learning (CML) algorithms for predicting LBW in newborns, drawing from midwife-led continuity care (MLCC).

Methods: A quasi-experimental study was carried out in the North Shoa Zone of Ethiopia from August 2019 to September 2020. A total of 1166 women were allocated into two groups. The first group, the MLCC group, received all their antenatal, labor, birth, and immediate post-natal care from a single midwife. The second group received care from various staff members at different times throughout their pregnancy and childbirth. In this study, CML was implemented to predict LBW. Data preprocessing, including data cleaning, was conducted. CML was then employed to identify the most suitable classifier for predicting LBW. Gradient boosting algorithms were used to estimate the causal effect of MLCC on LBW. Moreover, meta-learner algorithms were utilized to estimate the individual treatment effect (ITE), the average treatment effect (ATE), and performance. Moreover, meta-learner algorithms were utilized to estimate the individual treatment effect (ITE), the average treatment effect (ATE), and performance.

Results: The study results revealed that Causal K-Nearest Neighbors (CKNN) was the most effective classifier based on accuracy and estimated LBW using a 94.52% accuracy, 90.25% precision, 92.57% recall, and an F1 score of 88.2%. Meconium aspiration, perinatal mortality, pregnancy-induced hypertension, vacuum babies in need of resuscitation, and previous surgeries on their reproductive organs were identified as the top five features affecting LBW. The estimated impact of MLCC versus other professional groups on LBW was analyzed using gradient boosting algorithms and was found to be 0.237. The estimated ATE for the S-learner was 0.284, which is lower than the true ATE of 0.216. Additionally, the estimated ITE for both the T-learner and X-learner was less than -0.5, indicating that mothers would not choose to participate in the MLCC program.

Conclusions: Based on these findings, the CKNN classifier demonstrated a higher accuracy and effectiveness. The S-learner and R-learner models, utilizing the XGBoost Regressor and BaseSRegressor, provided accurate estimations of ITE for assessing the impact of the MLCC program. Promoting the MLCC program could help stabilize LBW outcomes.

查看原文本刊更多论文

因果机器学习模型预测低出生体重助产士主导的连续性护理干预在北Shoa区，埃塞俄比亚。

背景：低出生体重（LBW）是一个严重的全球健康问题，对婴儿的影响不成比例，特别是在发展中国家。本研究采用因果机器学习（CML）算法预测新生儿LBW，借鉴助产士主导的连续性护理（MLCC）。方法：2019年8月至2020年9月在埃塞俄比亚北部Shoa区进行准实验研究。共有1166名女性被分为两组。第一组，MLCC组，所有产前、分娩、分娩和产后护理都由一名助产士负责。第二组在怀孕和分娩的不同时期接受了不同工作人员的照顾。本研究采用CML预测LBW。进行数据预处理，包括数据清洗。然后使用CML识别最适合预测LBW的分类器。采用梯度增强算法估计MLCC对LBW的因果效应。此外，利用元学习算法估计个体治疗效果（ITE）、平均治疗效果（ATE）和绩效。此外，利用元学习算法估计个体治疗效果（ITE）、平均治疗效果（ATE）和绩效。结果：研究结果表明，因果k近邻（Causal K-Nearest Neighbors， CKNN）是基于准确率和估计LBW最有效的分类器，准确率为94.52%，精密度为90.25%，召回率为92.57%，F1得分为88.2%。胎粪吸入、围产期死亡、妊娠高血压、需要复苏的真空婴儿、生殖器官既往手术是影响LBW的前五大特征。MLCC与其他专业群体对LBW的估计影响使用梯度增强算法进行分析，发现为0.237。s型学习者的估计ATE为0.284，低于真实ATE的0.216。此外，t型学习者和x型学习者的估计ITE均小于-0.5，表明母亲不会选择参加MLCC计划。结论：基于这些发现，CKNN分类器显示出更高的准确性和有效性。S-learner和R-learner模型，利用XGBoost Regressor和BaseSRegressor，为评估MLCC计划的影响提供了准确的ITE估计。推广MLCC项目有助于稳定LBW的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

BMC Medical Informatics and Decision Making 医学-医学：信息

CiteScore

7.20

自引率

5.70%

发文量

297

审稿时长

1 months

期刊介绍： BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.