{"title":"不同机器学习算法在预测产后护理利用方面的绩效评估和比较分析:来自2016年埃塞俄比亚人口与健康调查的证据。","authors":"Daniel Niguse Mamo, Agmasie Damtew Walle, Eden Ketema Woldekidan, Jibril Bashir Adem, Yosef Haile Gebremariam, Meron Asmamaw Alemayehu, Ermias Bekele Enyew, Shimels Derso Kebede","doi":"10.1371/journal.pdig.0000707","DOIUrl":null,"url":null,"abstract":"<p><p>Postnatal care refers to the support provided to mothers and their newborns immediately after childbirth and during the first six weeks of life, a period when most maternal and neonatal deaths occur. In the 30 countries studied, nearly 40 percent of women did not receive a postpartum care check-up. This research aims to evaluate and compare the effectiveness of machine learning algorithms in predicting postnatal care utilization in Ethiopia and to identify the key factors involved. The study employs machine learning techniques to analyse secondary data from the 2016 Ethiopian Demographic and Health Survey. It aims to predict postnatal care utilization and identify key predictors via Python software, applying fifteen machine-learning algorithms to a sample of 7,193 women. Feature importance techniques were used to select the top predictors. The models' effectiveness was evaluated using sensitivity, specificity, F1 score, precision, accuracy, and area under the curve. Among the four experiments, tenfold cross-validation with balancing using Synthetic Minority Over-sampling Technique was outperformed. From fifteen models, the MLP Classifier (f1 score = 0.9548, AUC = 0.99), Random Forest Classifier (f1 score = 0.9543, AUC = 0.98), and Bagging Classifier (f1 score = 0.9498, AUC = 0.98) performed excellently, with a strong ability to differentiate between classes. The Region, residence, maternal education, religion, wealth index, health insurance status, and place of delivery are identified as contributing factors that predict postnatal care utilization. This study assessed machine learning models for forecasting postnatal care usage. Ten-fold cross-validation with Synthetic Minority Oversampling Technique produced the best results, emphasizing the significance of addressing class imbalance in healthcare datasets. This approach enhances the accuracy and dependability of predictive models. Key findings reveal regional and socioeconomic factors influencing PNC utilization, which can guide targeted initiatives to improve postnatal care utilization and ultimately enhance maternal and child health.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"4 1","pages":"e0000707"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11717314/pdf/","citationCount":"0","resultStr":"{\"title\":\"Performance evaluation and comparative analysis of different machine learning algorithms in predicting postnatal care utilization: Evidence from the ethiopian demographic and health survey 2016.\",\"authors\":\"Daniel Niguse Mamo, Agmasie Damtew Walle, Eden Ketema Woldekidan, Jibril Bashir Adem, Yosef Haile Gebremariam, Meron Asmamaw Alemayehu, Ermias Bekele Enyew, Shimels Derso Kebede\",\"doi\":\"10.1371/journal.pdig.0000707\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Postnatal care refers to the support provided to mothers and their newborns immediately after childbirth and during the first six weeks of life, a period when most maternal and neonatal deaths occur. In the 30 countries studied, nearly 40 percent of women did not receive a postpartum care check-up. This research aims to evaluate and compare the effectiveness of machine learning algorithms in predicting postnatal care utilization in Ethiopia and to identify the key factors involved. The study employs machine learning techniques to analyse secondary data from the 2016 Ethiopian Demographic and Health Survey. It aims to predict postnatal care utilization and identify key predictors via Python software, applying fifteen machine-learning algorithms to a sample of 7,193 women. Feature importance techniques were used to select the top predictors. The models' effectiveness was evaluated using sensitivity, specificity, F1 score, precision, accuracy, and area under the curve. Among the four experiments, tenfold cross-validation with balancing using Synthetic Minority Over-sampling Technique was outperformed. From fifteen models, the MLP Classifier (f1 score = 0.9548, AUC = 0.99), Random Forest Classifier (f1 score = 0.9543, AUC = 0.98), and Bagging Classifier (f1 score = 0.9498, AUC = 0.98) performed excellently, with a strong ability to differentiate between classes. The Region, residence, maternal education, religion, wealth index, health insurance status, and place of delivery are identified as contributing factors that predict postnatal care utilization. This study assessed machine learning models for forecasting postnatal care usage. Ten-fold cross-validation with Synthetic Minority Oversampling Technique produced the best results, emphasizing the significance of addressing class imbalance in healthcare datasets. This approach enhances the accuracy and dependability of predictive models. Key findings reveal regional and socioeconomic factors influencing PNC utilization, which can guide targeted initiatives to improve postnatal care utilization and ultimately enhance maternal and child health.</p>\",\"PeriodicalId\":74465,\"journal\":{\"name\":\"PLOS digital health\",\"volume\":\"4 1\",\"pages\":\"e0000707\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-01-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11717314/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"PLOS digital health\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1371/journal.pdig.0000707\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"PLOS digital health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1371/journal.pdig.0000707","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
Performance evaluation and comparative analysis of different machine learning algorithms in predicting postnatal care utilization: Evidence from the ethiopian demographic and health survey 2016.
Postnatal care refers to the support provided to mothers and their newborns immediately after childbirth and during the first six weeks of life, a period when most maternal and neonatal deaths occur. In the 30 countries studied, nearly 40 percent of women did not receive a postpartum care check-up. This research aims to evaluate and compare the effectiveness of machine learning algorithms in predicting postnatal care utilization in Ethiopia and to identify the key factors involved. The study employs machine learning techniques to analyse secondary data from the 2016 Ethiopian Demographic and Health Survey. It aims to predict postnatal care utilization and identify key predictors via Python software, applying fifteen machine-learning algorithms to a sample of 7,193 women. Feature importance techniques were used to select the top predictors. The models' effectiveness was evaluated using sensitivity, specificity, F1 score, precision, accuracy, and area under the curve. Among the four experiments, tenfold cross-validation with balancing using Synthetic Minority Over-sampling Technique was outperformed. From fifteen models, the MLP Classifier (f1 score = 0.9548, AUC = 0.99), Random Forest Classifier (f1 score = 0.9543, AUC = 0.98), and Bagging Classifier (f1 score = 0.9498, AUC = 0.98) performed excellently, with a strong ability to differentiate between classes. The Region, residence, maternal education, religion, wealth index, health insurance status, and place of delivery are identified as contributing factors that predict postnatal care utilization. This study assessed machine learning models for forecasting postnatal care usage. Ten-fold cross-validation with Synthetic Minority Oversampling Technique produced the best results, emphasizing the significance of addressing class imbalance in healthcare datasets. This approach enhances the accuracy and dependability of predictive models. Key findings reveal regional and socioeconomic factors influencing PNC utilization, which can guide targeted initiatives to improve postnatal care utilization and ultimately enhance maternal and child health.