Development and Validation of An Interpretable Machine Learning-Based Prediction Model of Postpartum Hemorrhage in Placenta Previa Following Cesarean Section: A Multicenter Study.
Mianmian Li, Xinhui Su, Wenxin Liao, Li Huang, Yihong Yang, Xizi Wu, Yao Fan, Jing Liu, Xin Yang, Zhen Zeng, Wencheng Ding, Wanjiang Zeng, Xiaoyan Xu
{"title":"Development and Validation of An Interpretable Machine Learning-Based Prediction Model of Postpartum Hemorrhage in Placenta Previa Following Cesarean Section: A Multicenter Study.","authors":"Mianmian Li, Xinhui Su, Wenxin Liao, Li Huang, Yihong Yang, Xizi Wu, Yao Fan, Jing Liu, Xin Yang, Zhen Zeng, Wencheng Ding, Wanjiang Zeng, Xiaoyan Xu","doi":"10.1007/s43032-025-01937-0","DOIUrl":null,"url":null,"abstract":"<p><p>The objective of this study is to predict the occurrence of postpartum hemorrhage in women with placenta previa based on machine learning. This retrospective study enrolled 845 singleton pregnant patients with placenta previa from two hospitals. They were allocated into a training cohort (n = 403), a testing cohort (n = 174), and the external validation cohort (n = 268). Univariate and multivariate regression analyses were employed to select clinical variables (p < 0.05), which were subsequently utilized to develop 11 machine learning prediction models. The area under the receiver operating characteristic curve (AUC), decision curve analysis (DCA), accuracy (ACC), sensitivity (SEN), and specificity (SPE) were used to evaluate the performance of the models. Besides, SHapley Additive exPlanations (SHAP) was used to interpret the role and effectiveness of variables in the predictive model. Three machine learning models with the best predictive performance were combined into a Prediction Ensemble Classifier through voting. The Gradient Boosting Machine demonstrated the best predictive performance. In the validation cohort, AUC of the Gradient Boosting Machine model is 0.810(95% CI 0.754-0.865), ACC was 0.765(95% CI 0.716-0.813), SEN was 0.613(95% CI 0.513-0.723), while these values of the Prediction Ensemble Classifier were 0.813(0.756-0.871), 0.806(0.757-0.854), and 0.480(0.375-0.597), respectively. The importance of SHAP variables in the model, ranked from high to low, is as follows: d-dimer, ultrasound diagnosis of placenta accreta spectrum, neutrophils, prothrombin time, and platelets. The Gradient Boosting Machine model demonstrated excellent performance in predicting postpartum hemorrhage in cases of placenta previa. Furthermore, SHAP analysis enabled interpretation of the variables in the model.</p>","PeriodicalId":20920,"journal":{"name":"Reproductive Sciences","volume":" ","pages":"3062-3073"},"PeriodicalIF":2.5000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12443912/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Reproductive Sciences","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s43032-025-01937-0","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/8/12 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"OBSTETRICS & GYNECOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
The objective of this study is to predict the occurrence of postpartum hemorrhage in women with placenta previa based on machine learning. This retrospective study enrolled 845 singleton pregnant patients with placenta previa from two hospitals. They were allocated into a training cohort (n = 403), a testing cohort (n = 174), and the external validation cohort (n = 268). Univariate and multivariate regression analyses were employed to select clinical variables (p < 0.05), which were subsequently utilized to develop 11 machine learning prediction models. The area under the receiver operating characteristic curve (AUC), decision curve analysis (DCA), accuracy (ACC), sensitivity (SEN), and specificity (SPE) were used to evaluate the performance of the models. Besides, SHapley Additive exPlanations (SHAP) was used to interpret the role and effectiveness of variables in the predictive model. Three machine learning models with the best predictive performance were combined into a Prediction Ensemble Classifier through voting. The Gradient Boosting Machine demonstrated the best predictive performance. In the validation cohort, AUC of the Gradient Boosting Machine model is 0.810(95% CI 0.754-0.865), ACC was 0.765(95% CI 0.716-0.813), SEN was 0.613(95% CI 0.513-0.723), while these values of the Prediction Ensemble Classifier were 0.813(0.756-0.871), 0.806(0.757-0.854), and 0.480(0.375-0.597), respectively. The importance of SHAP variables in the model, ranked from high to low, is as follows: d-dimer, ultrasound diagnosis of placenta accreta spectrum, neutrophils, prothrombin time, and platelets. The Gradient Boosting Machine model demonstrated excellent performance in predicting postpartum hemorrhage in cases of placenta previa. Furthermore, SHAP analysis enabled interpretation of the variables in the model.
期刊介绍:
Reproductive Sciences (RS) is a peer-reviewed, monthly journal publishing original research and reviews in obstetrics and gynecology. RS is multi-disciplinary and includes research in basic reproductive biology and medicine, maternal-fetal medicine, obstetrics, gynecology, reproductive endocrinology, urogynecology, fertility/infertility, embryology, gynecologic/reproductive oncology, developmental biology, stem cell research, molecular/cellular biology and other related fields.