Frederick Osei Owusu, Helena Addai-Manu, Esther Serwah Agbedinu, Emmanuel Konadu, Lydia Asenso, Mercy Addae, Joseph Osarfo, Brenda Abena Ampah, Douglas Aninng Opoku
{"title":"使用机器学习算法预测加纳一家地区医院孕妇剖宫产。","authors":"Frederick Osei Owusu, Helena Addai-Manu, Esther Serwah Agbedinu, Emmanuel Konadu, Lydia Asenso, Mercy Addae, Joseph Osarfo, Brenda Abena Ampah, Douglas Aninng Opoku","doi":"10.1186/s12884-025-07716-8","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Machine learning algorithms may contribute to improving maternal and child health, including determining the suitability of caesarean section (CS) births in low-resource countries. Despite machine learning algorithms offering a more robust approach to predicting/diagnosing a health-related problem, research on their use in determining CS birth is scarce in sub-Saharan Africa. This study therefore aimed to compare the performance of five machine learning techniques in predicting CS birth among pregnant women in a district hospital in Ghana.</p><p><strong>Methods: </strong>This was a cross-sectional study that used retrospective data from medical records of pregnant women who delivered at a district hospital in Ghana. A clinical decision support system for predicting CS birth was developed using five machine learning techniques including logistic regression, Support Vector Machines, Naïve Bayes, Random Forest and Extreme Gradient Boosting. Measures such as accuracy, sensitivity, specificity, negative and positive predictive values and area under the receiver operating characteristics curve (AUC-ROC) were used for the model performance.</p><p><strong>Results: </strong>Of a total of 2310 deliveries, the prevalence of CS birth was 37.7% with previous CS being the most prevalent indication. The Random Forest model showed the best performance for predicting CS birth with an accuracy of 0.981, recall of 0.994, F1 score of 0.985 and an AUC-ROC of 0.988. The Naïve Bayes model followed with an accuracy of 0.965, recall of 0.967, F1 score of 0.972 and AUC-ROC of 0.986. The top five most important predictors proved to be diastolic (0.0906) and systolic (0.0848) blood pressures, maternal age (0.0756), previous CS (0.0641) and marital status (0.0400).</p><p><strong>Conclusion: </strong>This study demonstrated that although all five machine learning techniques had good performance in determining CS births, the Random Forest model was superior to all the others in predicting them. This finding suggests that machine learning could help identify at-risk pregnant women for CS births, potentially supporting early interventions and informing policies in maternal healthcare.</p>","PeriodicalId":9033,"journal":{"name":"BMC Pregnancy and Childbirth","volume":"25 1","pages":"690"},"PeriodicalIF":2.7000,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12219967/pdf/","citationCount":"0","resultStr":"{\"title\":\"Prediction of caesarean section birth using machine learning algorithms among pregnant women in a district hospital in Ghana.\",\"authors\":\"Frederick Osei Owusu, Helena Addai-Manu, Esther Serwah Agbedinu, Emmanuel Konadu, Lydia Asenso, Mercy Addae, Joseph Osarfo, Brenda Abena Ampah, Douglas Aninng Opoku\",\"doi\":\"10.1186/s12884-025-07716-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Machine learning algorithms may contribute to improving maternal and child health, including determining the suitability of caesarean section (CS) births in low-resource countries. Despite machine learning algorithms offering a more robust approach to predicting/diagnosing a health-related problem, research on their use in determining CS birth is scarce in sub-Saharan Africa. This study therefore aimed to compare the performance of five machine learning techniques in predicting CS birth among pregnant women in a district hospital in Ghana.</p><p><strong>Methods: </strong>This was a cross-sectional study that used retrospective data from medical records of pregnant women who delivered at a district hospital in Ghana. A clinical decision support system for predicting CS birth was developed using five machine learning techniques including logistic regression, Support Vector Machines, Naïve Bayes, Random Forest and Extreme Gradient Boosting. Measures such as accuracy, sensitivity, specificity, negative and positive predictive values and area under the receiver operating characteristics curve (AUC-ROC) were used for the model performance.</p><p><strong>Results: </strong>Of a total of 2310 deliveries, the prevalence of CS birth was 37.7% with previous CS being the most prevalent indication. The Random Forest model showed the best performance for predicting CS birth with an accuracy of 0.981, recall of 0.994, F1 score of 0.985 and an AUC-ROC of 0.988. The Naïve Bayes model followed with an accuracy of 0.965, recall of 0.967, F1 score of 0.972 and AUC-ROC of 0.986. The top five most important predictors proved to be diastolic (0.0906) and systolic (0.0848) blood pressures, maternal age (0.0756), previous CS (0.0641) and marital status (0.0400).</p><p><strong>Conclusion: </strong>This study demonstrated that although all five machine learning techniques had good performance in determining CS births, the Random Forest model was superior to all the others in predicting them. This finding suggests that machine learning could help identify at-risk pregnant women for CS births, potentially supporting early interventions and informing policies in maternal healthcare.</p>\",\"PeriodicalId\":9033,\"journal\":{\"name\":\"BMC Pregnancy and Childbirth\",\"volume\":\"25 1\",\"pages\":\"690\"},\"PeriodicalIF\":2.7000,\"publicationDate\":\"2025-07-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12219967/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Pregnancy and Childbirth\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12884-025-07716-8\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"OBSTETRICS & GYNECOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Pregnancy and Childbirth","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12884-025-07716-8","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OBSTETRICS & GYNECOLOGY","Score":null,"Total":0}
Prediction of caesarean section birth using machine learning algorithms among pregnant women in a district hospital in Ghana.
Background: Machine learning algorithms may contribute to improving maternal and child health, including determining the suitability of caesarean section (CS) births in low-resource countries. Despite machine learning algorithms offering a more robust approach to predicting/diagnosing a health-related problem, research on their use in determining CS birth is scarce in sub-Saharan Africa. This study therefore aimed to compare the performance of five machine learning techniques in predicting CS birth among pregnant women in a district hospital in Ghana.
Methods: This was a cross-sectional study that used retrospective data from medical records of pregnant women who delivered at a district hospital in Ghana. A clinical decision support system for predicting CS birth was developed using five machine learning techniques including logistic regression, Support Vector Machines, Naïve Bayes, Random Forest and Extreme Gradient Boosting. Measures such as accuracy, sensitivity, specificity, negative and positive predictive values and area under the receiver operating characteristics curve (AUC-ROC) were used for the model performance.
Results: Of a total of 2310 deliveries, the prevalence of CS birth was 37.7% with previous CS being the most prevalent indication. The Random Forest model showed the best performance for predicting CS birth with an accuracy of 0.981, recall of 0.994, F1 score of 0.985 and an AUC-ROC of 0.988. The Naïve Bayes model followed with an accuracy of 0.965, recall of 0.967, F1 score of 0.972 and AUC-ROC of 0.986. The top five most important predictors proved to be diastolic (0.0906) and systolic (0.0848) blood pressures, maternal age (0.0756), previous CS (0.0641) and marital status (0.0400).
Conclusion: This study demonstrated that although all five machine learning techniques had good performance in determining CS births, the Random Forest model was superior to all the others in predicting them. This finding suggests that machine learning could help identify at-risk pregnant women for CS births, potentially supporting early interventions and informing policies in maternal healthcare.
期刊介绍:
BMC Pregnancy & Childbirth is an open access, peer-reviewed journal that considers articles on all aspects of pregnancy and childbirth. The journal welcomes submissions on the biomedical aspects of pregnancy, breastfeeding, labor, maternal health, maternity care, trends and sociological aspects of pregnancy and childbirth.