{"title":"Causal machine learning models for predicting low birth weight in midwife-led continuity care intervention in North Shoa Zone, Ethiopia.","authors":"Wudneh Ketema Moges, Awoke Seyoum Tegegne, Aweke A Mitku, Esubalew Tesfahun, Solomon Hailemeskel","doi":"10.1186/s12911-025-02917-9","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Low birth weight (LBW) is a critical global health issue that affects infants disproportionately, particularly in developing countries. This study adopted causal machine learning (CML) algorithms for predicting LBW in newborns, drawing from midwife-led continuity care (MLCC).</p><p><strong>Methods: </strong>A quasi-experimental study was carried out in the North Shoa Zone of Ethiopia from August 2019 to September 2020. A total of 1166 women were allocated into two groups. The first group, the MLCC group, received all their antenatal, labor, birth, and immediate post-natal care from a single midwife. The second group received care from various staff members at different times throughout their pregnancy and childbirth. In this study, CML was implemented to predict LBW. Data preprocessing, including data cleaning, was conducted. CML was then employed to identify the most suitable classifier for predicting LBW. Gradient boosting algorithms were used to estimate the causal effect of MLCC on LBW. Moreover, meta-learner algorithms were utilized to estimate the individual treatment effect (ITE), the average treatment effect (ATE), and performance. Moreover, meta-learner algorithms were utilized to estimate the individual treatment effect (ITE), the average treatment effect (ATE), and performance.</p><p><strong>Results: </strong>The study results revealed that Causal K-Nearest Neighbors (CKNN) was the most effective classifier based on accuracy and estimated LBW using a 94.52% accuracy, 90.25% precision, 92.57% recall, and an F1 score of 88.2%. Meconium aspiration, perinatal mortality, pregnancy-induced hypertension, vacuum babies in need of resuscitation, and previous surgeries on their reproductive organs were identified as the top five features affecting LBW. The estimated impact of MLCC versus other professional groups on LBW was analyzed using gradient boosting algorithms and was found to be 0.237. The estimated ATE for the S-learner was 0.284, which is lower than the true ATE of 0.216. Additionally, the estimated ITE for both the T-learner and X-learner was less than -0.5, indicating that mothers would not choose to participate in the MLCC program.</p><p><strong>Conclusions: </strong>Based on these findings, the CKNN classifier demonstrated a higher accuracy and effectiveness. The S-learner and R-learner models, utilizing the XGBoost Regressor and BaseSRegressor, provided accurate estimations of ITE for assessing the impact of the MLCC program. Promoting the MLCC program could help stabilize LBW outcomes.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"64"},"PeriodicalIF":3.3000,"publicationDate":"2025-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11806756/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-025-02917-9","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Low birth weight (LBW) is a critical global health issue that affects infants disproportionately, particularly in developing countries. This study adopted causal machine learning (CML) algorithms for predicting LBW in newborns, drawing from midwife-led continuity care (MLCC).
Methods: A quasi-experimental study was carried out in the North Shoa Zone of Ethiopia from August 2019 to September 2020. A total of 1166 women were allocated into two groups. The first group, the MLCC group, received all their antenatal, labor, birth, and immediate post-natal care from a single midwife. The second group received care from various staff members at different times throughout their pregnancy and childbirth. In this study, CML was implemented to predict LBW. Data preprocessing, including data cleaning, was conducted. CML was then employed to identify the most suitable classifier for predicting LBW. Gradient boosting algorithms were used to estimate the causal effect of MLCC on LBW. Moreover, meta-learner algorithms were utilized to estimate the individual treatment effect (ITE), the average treatment effect (ATE), and performance. Moreover, meta-learner algorithms were utilized to estimate the individual treatment effect (ITE), the average treatment effect (ATE), and performance.
Results: The study results revealed that Causal K-Nearest Neighbors (CKNN) was the most effective classifier based on accuracy and estimated LBW using a 94.52% accuracy, 90.25% precision, 92.57% recall, and an F1 score of 88.2%. Meconium aspiration, perinatal mortality, pregnancy-induced hypertension, vacuum babies in need of resuscitation, and previous surgeries on their reproductive organs were identified as the top five features affecting LBW. The estimated impact of MLCC versus other professional groups on LBW was analyzed using gradient boosting algorithms and was found to be 0.237. The estimated ATE for the S-learner was 0.284, which is lower than the true ATE of 0.216. Additionally, the estimated ITE for both the T-learner and X-learner was less than -0.5, indicating that mothers would not choose to participate in the MLCC program.
Conclusions: Based on these findings, the CKNN classifier demonstrated a higher accuracy and effectiveness. The S-learner and R-learner models, utilizing the XGBoost Regressor and BaseSRegressor, provided accurate estimations of ITE for assessing the impact of the MLCC program. Promoting the MLCC program could help stabilize LBW outcomes.
期刊介绍:
BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.