Development and validation of a spontaneous preterm birth risk prediction algorithm based on maternal bioinformatics: A single-center retrospective study.
{"title":"Development and validation of a spontaneous preterm birth risk prediction algorithm based on maternal bioinformatics: A single-center retrospective study.","authors":"Yu Chen, Xinyan Shi, Zhiyi Wang, Lin Zhang","doi":"10.1186/s12884-024-06933-x","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Spontaneous preterm birth (sPTB) is a primary cause of adverse neonatal outcomes. The objective of this study is to analyze the factors influencing the occurrence of sPTB in pregnant women and to construct and validate a predictive model for sPTB risk based on big data from clinical and laboratory assessments during pregnancy.</p><p><strong>Methods: </strong>A retrospective analysis was conducted on the clinical data of 3,082 pregnant women, categorizing those who delivered before 37 weeks of gestation as the sPTB group and those who delivered at or after 37 weeks as the full-term group. The performance of five machine learning models was compared using metrics such as the AUC, accuracy, sensitivity, specificity, and precision to identify the optimal predictive model. The top 10 predictive variables were selected based on their significance in disease prediction. The data were then divided into a training set (70%) and a validation set (30%) for validation. External data were also utilized to validate the model's predictive performance.</p><p><strong>Results: </strong>A total of 24 indicators with significant differences were identified. In terms of predicting the risk of preterm birth, the XGBoost algorithm demonstrated the most outstanding performance, with an AUC<sub>ROC</sub> of 0.89 (95% CI: 0.88-0.90). The top 10 critical indicators included ALP, AFP, ALB, HCT, TC, DBP, ALT, PLT, height, and SBP, which are essential for constructing an accurate predictive model. The model exhibited stable performance on both the training and validation sets, with AUC values of 0.93 and 0.87, respectively. Furthermore, the external testing set also showed superior performance, with an AUC of 0.79.</p><p><strong>Conclusions: </strong>At the time of delivery, ALP, AFP, ALB, HCT, TC, DBP, ALT, PLT, height, and SBP are influential factors for sPTB in pregnant women. The XGBoost algorithm, constructed based on these factors, demonstrated the most outstanding performance.</p>","PeriodicalId":9033,"journal":{"name":"BMC Pregnancy and Childbirth","volume":"24 1","pages":"763"},"PeriodicalIF":2.8000,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11571659/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Pregnancy and Childbirth","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12884-024-06933-x","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OBSTETRICS & GYNECOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Spontaneous preterm birth (sPTB) is a primary cause of adverse neonatal outcomes. The objective of this study is to analyze the factors influencing the occurrence of sPTB in pregnant women and to construct and validate a predictive model for sPTB risk based on big data from clinical and laboratory assessments during pregnancy.
Methods: A retrospective analysis was conducted on the clinical data of 3,082 pregnant women, categorizing those who delivered before 37 weeks of gestation as the sPTB group and those who delivered at or after 37 weeks as the full-term group. The performance of five machine learning models was compared using metrics such as the AUC, accuracy, sensitivity, specificity, and precision to identify the optimal predictive model. The top 10 predictive variables were selected based on their significance in disease prediction. The data were then divided into a training set (70%) and a validation set (30%) for validation. External data were also utilized to validate the model's predictive performance.
Results: A total of 24 indicators with significant differences were identified. In terms of predicting the risk of preterm birth, the XGBoost algorithm demonstrated the most outstanding performance, with an AUCROC of 0.89 (95% CI: 0.88-0.90). The top 10 critical indicators included ALP, AFP, ALB, HCT, TC, DBP, ALT, PLT, height, and SBP, which are essential for constructing an accurate predictive model. The model exhibited stable performance on both the training and validation sets, with AUC values of 0.93 and 0.87, respectively. Furthermore, the external testing set also showed superior performance, with an AUC of 0.79.
Conclusions: At the time of delivery, ALP, AFP, ALB, HCT, TC, DBP, ALT, PLT, height, and SBP are influential factors for sPTB in pregnant women. The XGBoost algorithm, constructed based on these factors, demonstrated the most outstanding performance.
期刊介绍:
BMC Pregnancy & Childbirth is an open access, peer-reviewed journal that considers articles on all aspects of pregnancy and childbirth. The journal welcomes submissions on the biomedical aspects of pregnancy, breastfeeding, labor, maternal health, maternity care, trends and sociological aspects of pregnancy and childbirth.