Performance evaluation and comparative analysis of different machine learning algorithms in predicting postnatal care utilization: Evidence from the ethiopian demographic and health survey 2016.

PLOS digital health Pub Date : 2025-01-09 eCollection Date: 2025-01-01 DOI:10.1371/journal.pdig.0000707
Daniel Niguse Mamo, Agmasie Damtew Walle, Eden Ketema Woldekidan, Jibril Bashir Adem, Yosef Haile Gebremariam, Meron Asmamaw Alemayehu, Ermias Bekele Enyew, Shimels Derso Kebede
{"title":"Performance evaluation and comparative analysis of different machine learning algorithms in predicting postnatal care utilization: Evidence from the ethiopian demographic and health survey 2016.","authors":"Daniel Niguse Mamo, Agmasie Damtew Walle, Eden Ketema Woldekidan, Jibril Bashir Adem, Yosef Haile Gebremariam, Meron Asmamaw Alemayehu, Ermias Bekele Enyew, Shimels Derso Kebede","doi":"10.1371/journal.pdig.0000707","DOIUrl":null,"url":null,"abstract":"<p><p>Postnatal care refers to the support provided to mothers and their newborns immediately after childbirth and during the first six weeks of life, a period when most maternal and neonatal deaths occur. In the 30 countries studied, nearly 40 percent of women did not receive a postpartum care check-up. This research aims to evaluate and compare the effectiveness of machine learning algorithms in predicting postnatal care utilization in Ethiopia and to identify the key factors involved. The study employs machine learning techniques to analyse secondary data from the 2016 Ethiopian Demographic and Health Survey. It aims to predict postnatal care utilization and identify key predictors via Python software, applying fifteen machine-learning algorithms to a sample of 7,193 women. Feature importance techniques were used to select the top predictors. The models' effectiveness was evaluated using sensitivity, specificity, F1 score, precision, accuracy, and area under the curve. Among the four experiments, tenfold cross-validation with balancing using Synthetic Minority Over-sampling Technique was outperformed. From fifteen models, the MLP Classifier (f1 score = 0.9548, AUC = 0.99), Random Forest Classifier (f1 score = 0.9543, AUC = 0.98), and Bagging Classifier (f1 score = 0.9498, AUC = 0.98) performed excellently, with a strong ability to differentiate between classes. The Region, residence, maternal education, religion, wealth index, health insurance status, and place of delivery are identified as contributing factors that predict postnatal care utilization. This study assessed machine learning models for forecasting postnatal care usage. Ten-fold cross-validation with Synthetic Minority Oversampling Technique produced the best results, emphasizing the significance of addressing class imbalance in healthcare datasets. This approach enhances the accuracy and dependability of predictive models. Key findings reveal regional and socioeconomic factors influencing PNC utilization, which can guide targeted initiatives to improve postnatal care utilization and ultimately enhance maternal and child health.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"4 1","pages":"e0000707"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11717314/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PLOS digital health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1371/journal.pdig.0000707","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Postnatal care refers to the support provided to mothers and their newborns immediately after childbirth and during the first six weeks of life, a period when most maternal and neonatal deaths occur. In the 30 countries studied, nearly 40 percent of women did not receive a postpartum care check-up. This research aims to evaluate and compare the effectiveness of machine learning algorithms in predicting postnatal care utilization in Ethiopia and to identify the key factors involved. The study employs machine learning techniques to analyse secondary data from the 2016 Ethiopian Demographic and Health Survey. It aims to predict postnatal care utilization and identify key predictors via Python software, applying fifteen machine-learning algorithms to a sample of 7,193 women. Feature importance techniques were used to select the top predictors. The models' effectiveness was evaluated using sensitivity, specificity, F1 score, precision, accuracy, and area under the curve. Among the four experiments, tenfold cross-validation with balancing using Synthetic Minority Over-sampling Technique was outperformed. From fifteen models, the MLP Classifier (f1 score = 0.9548, AUC = 0.99), Random Forest Classifier (f1 score = 0.9543, AUC = 0.98), and Bagging Classifier (f1 score = 0.9498, AUC = 0.98) performed excellently, with a strong ability to differentiate between classes. The Region, residence, maternal education, religion, wealth index, health insurance status, and place of delivery are identified as contributing factors that predict postnatal care utilization. This study assessed machine learning models for forecasting postnatal care usage. Ten-fold cross-validation with Synthetic Minority Oversampling Technique produced the best results, emphasizing the significance of addressing class imbalance in healthcare datasets. This approach enhances the accuracy and dependability of predictive models. Key findings reveal regional and socioeconomic factors influencing PNC utilization, which can guide targeted initiatives to improve postnatal care utilization and ultimately enhance maternal and child health.

不同机器学习算法在预测产后护理利用方面的绩效评估和比较分析:来自2016年埃塞俄比亚人口与健康调查的证据。
产后护理指的是在分娩后立即和在生命的头六周内向母亲及其新生儿提供的支助,这是大多数孕产妇和新生儿死亡的时期。在研究的30个国家中,近40%的女性没有接受产后护理检查。本研究旨在评估和比较机器学习算法在预测埃塞俄比亚产后护理利用方面的有效性,并确定所涉及的关键因素。该研究采用机器学习技术分析2016年埃塞俄比亚人口与健康调查的二手数据。它旨在通过Python软件预测产后护理的利用情况,并确定关键预测因素,对7193名女性的样本应用15种机器学习算法。使用特征重要性技术来选择最重要的预测因子。采用敏感性、特异性、F1评分、精密度、准确度和曲线下面积评价模型的有效性。在四个实验中,使用合成少数派过采样技术进行平衡的十倍交叉验证优于其他实验。在15个模型中,MLP分类器(f1得分= 0.9548,AUC = 0.99)、随机森林分类器(f1得分= 0.9543,AUC = 0.98)和Bagging分类器(f1得分= 0.9498,AUC = 0.98)表现优异,具有较强的分类区分能力。地区、居住地、产妇教育程度、宗教信仰、财富指数、健康保险状况和分娩地点被确定为预测产后护理利用的影响因素。本研究评估了预测产后护理使用的机器学习模型。使用合成少数派过采样技术进行的十倍交叉验证产生了最佳结果,强调了解决医疗保健数据集中类别不平衡的重要性。该方法提高了预测模型的准确性和可靠性。主要研究结果揭示了影响PNC利用的区域和社会经济因素,这可以指导有针对性的举措,以提高产后护理利用,最终改善孕产妇和儿童健康。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信