Tetsuya Kawakita, Juliana G Martins, Yara H Diab, Lea Nehme, George Saade
{"title":"Derivation and Validation of Prediction of Preterm Preeclampsia Using Machine Learning Algorithms.","authors":"Tetsuya Kawakita, Juliana G Martins, Yara H Diab, Lea Nehme, George Saade","doi":"10.1055/a-2495-3600","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong> This study aimed to develop machine learning (ML) models for predicting preterm preeclampsia using the information available before 23 weeks gestation.</p><p><strong>Study design: </strong> This was a secondary analysis of the Nulliparous Pregnancy Outcomes Study: Monitoring Mothers-to-Be (nuMoM2b) cohort. We considered 131 features available before 23 weeks including maternal demographics, obstetrics and family history, social determinants of health, physical activity, nutrition, and early second-trimester ultrasound. Our primary outcome was preterm preeclampsia before 37 weeks. The dataset was randomly split into a training set (70%) and a validation set (30%). ML models using glmnet, multilayer perceptron, random forest, XGBoost (extreme gradient boosting), and LightGBM models were developed. Using the ML approach that achieved the best area under the curve (AUC), we developed the final model. Further feature selection was conducted from the top 25 important features based on SHapley Additive exPlanations (SHAP) values. The performance of the final model was assessed using the validation dataset.</p><p><strong>Results: </strong> Of 9,467 individuals, 219 (2.3%) had preterm preeclampsia. The AUC of the XGBoost model was the highest (AUC = 0.749 [95% confidence interval (95% CI), 0.736-0.762]) compared with other models. Therefore, XGBoost was used to develop models using fewer variables. The XGBoost model with the eight features (in order of importance: mean uterine artery pulsatility index in the early second trimester, chronic hypertension, pregestational diabetes, uterine artery notch, systolic and diastolic blood pressure in the first trimester, body mass index, and maternal age) was chosen as the final model as it had an AUC of 0.741 (95% CI, 0.730-0.752) which was not inferior to the original model (<i>p</i> = 0.58). The final model in the validation dataset had an AUC of 0.779 (95% CI, 0.722-0.831). An online application of the final model was developed ( https://kawakita.shinyapps.io/Preterm_preeclampsia/ ).</p><p><strong>Conclusion: </strong> ML algorithms using information available before 23 weeks can accurately predict preterm preeclampsia before 37 weeks.</p><p><strong>Key points: </strong>· Prediction models using uterine artery Doppler have not been adopted in the US.. · We developed a model using an ML algorithm.. · An online application of the final model was developed.. · ML algorithms using information available before 23 weeks can accurately predict preterm preeclampsia before 37 weeks..</p>","PeriodicalId":7584,"journal":{"name":"American journal of perinatology","volume":" ","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2024-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American journal of perinatology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1055/a-2495-3600","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"OBSTETRICS & GYNECOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: This study aimed to develop machine learning (ML) models for predicting preterm preeclampsia using the information available before 23 weeks gestation.
Study design: This was a secondary analysis of the Nulliparous Pregnancy Outcomes Study: Monitoring Mothers-to-Be (nuMoM2b) cohort. We considered 131 features available before 23 weeks including maternal demographics, obstetrics and family history, social determinants of health, physical activity, nutrition, and early second-trimester ultrasound. Our primary outcome was preterm preeclampsia before 37 weeks. The dataset was randomly split into a training set (70%) and a validation set (30%). ML models using glmnet, multilayer perceptron, random forest, XGBoost (extreme gradient boosting), and LightGBM models were developed. Using the ML approach that achieved the best area under the curve (AUC), we developed the final model. Further feature selection was conducted from the top 25 important features based on SHapley Additive exPlanations (SHAP) values. The performance of the final model was assessed using the validation dataset.
Results: Of 9,467 individuals, 219 (2.3%) had preterm preeclampsia. The AUC of the XGBoost model was the highest (AUC = 0.749 [95% confidence interval (95% CI), 0.736-0.762]) compared with other models. Therefore, XGBoost was used to develop models using fewer variables. The XGBoost model with the eight features (in order of importance: mean uterine artery pulsatility index in the early second trimester, chronic hypertension, pregestational diabetes, uterine artery notch, systolic and diastolic blood pressure in the first trimester, body mass index, and maternal age) was chosen as the final model as it had an AUC of 0.741 (95% CI, 0.730-0.752) which was not inferior to the original model (p = 0.58). The final model in the validation dataset had an AUC of 0.779 (95% CI, 0.722-0.831). An online application of the final model was developed ( https://kawakita.shinyapps.io/Preterm_preeclampsia/ ).
Conclusion: ML algorithms using information available before 23 weeks can accurately predict preterm preeclampsia before 37 weeks.
Key points: · Prediction models using uterine artery Doppler have not been adopted in the US.. · We developed a model using an ML algorithm.. · An online application of the final model was developed.. · ML algorithms using information available before 23 weeks can accurately predict preterm preeclampsia before 37 weeks..
期刊介绍:
The American Journal of Perinatology is an international, peer-reviewed, and indexed journal publishing 14 issues a year dealing with original research and topical reviews. It is the definitive forum for specialists in obstetrics, neonatology, perinatology, and maternal/fetal medicine, with emphasis on bridging the different fields.
The focus is primarily on clinical and translational research, clinical and technical advances in diagnosis, monitoring, and treatment as well as evidence-based reviews. Topics of interest include epidemiology, diagnosis, prevention, and management of maternal, fetal, and neonatal diseases. Manuscripts on new technology, NICU set-ups, and nursing topics are published to provide a broad survey of important issues in this field.
All articles undergo rigorous peer review, with web-based submission, expedited turn-around, and availability of electronic publication.
The American Journal of Perinatology is accompanied by AJP Reports - an Open Access journal for case reports in neonatology and maternal/fetal medicine.