Derivation and Validation of Prediction of Preterm Preeclampsia Using Machine Learning Algorithms.

IF 1.5 4区 医学 Q3 OBSTETRICS & GYNECOLOGY
Tetsuya Kawakita, Juliana G Martins, Yara H Diab, Lea Nehme, George Saade
{"title":"Derivation and Validation of Prediction of Preterm Preeclampsia Using Machine Learning Algorithms.","authors":"Tetsuya Kawakita, Juliana G Martins, Yara H Diab, Lea Nehme, George Saade","doi":"10.1055/a-2495-3600","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong> This study aimed to develop machine learning (ML) models for predicting preterm preeclampsia using the information available before 23 weeks gestation.</p><p><strong>Study design: </strong> This was a secondary analysis of the Nulliparous Pregnancy Outcomes Study: Monitoring Mothers-to-Be (nuMoM2b) cohort. We considered 131 features available before 23 weeks including maternal demographics, obstetrics and family history, social determinants of health, physical activity, nutrition, and early second-trimester ultrasound. Our primary outcome was preterm preeclampsia before 37 weeks. The dataset was randomly split into a training set (70%) and a validation set (30%). ML models using glmnet, multilayer perceptron, random forest, XGBoost (extreme gradient boosting), and LightGBM models were developed. Using the ML approach that achieved the best area under the curve (AUC), we developed the final model. Further feature selection was conducted from the top 25 important features based on SHapley Additive exPlanations (SHAP) values. The performance of the final model was assessed using the validation dataset.</p><p><strong>Results: </strong> Of 9,467 individuals, 219 (2.3%) had preterm preeclampsia. The AUC of the XGBoost model was the highest (AUC = 0.749 [95% confidence interval (95% CI), 0.736-0.762]) compared with other models. Therefore, XGBoost was used to develop models using fewer variables. The XGBoost model with the eight features (in order of importance: mean uterine artery pulsatility index in the early second trimester, chronic hypertension, pregestational diabetes, uterine artery notch, systolic and diastolic blood pressure in the first trimester, body mass index, and maternal age) was chosen as the final model as it had an AUC of 0.741 (95% CI, 0.730-0.752) which was not inferior to the original model (<i>p</i> = 0.58). The final model in the validation dataset had an AUC of 0.779 (95% CI, 0.722-0.831). An online application of the final model was developed ( https://kawakita.shinyapps.io/Preterm_preeclampsia/ ).</p><p><strong>Conclusion: </strong> ML algorithms using information available before 23 weeks can accurately predict preterm preeclampsia before 37 weeks.</p><p><strong>Key points: </strong>· Prediction models using uterine artery Doppler have not been adopted in the US.. · We developed a model using an ML algorithm.. · An online application of the final model was developed.. · ML algorithms using information available before 23 weeks can accurately predict preterm preeclampsia before 37 weeks..</p>","PeriodicalId":7584,"journal":{"name":"American journal of perinatology","volume":" ","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2024-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American journal of perinatology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1055/a-2495-3600","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"OBSTETRICS & GYNECOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Objective:  This study aimed to develop machine learning (ML) models for predicting preterm preeclampsia using the information available before 23 weeks gestation.

Study design:  This was a secondary analysis of the Nulliparous Pregnancy Outcomes Study: Monitoring Mothers-to-Be (nuMoM2b) cohort. We considered 131 features available before 23 weeks including maternal demographics, obstetrics and family history, social determinants of health, physical activity, nutrition, and early second-trimester ultrasound. Our primary outcome was preterm preeclampsia before 37 weeks. The dataset was randomly split into a training set (70%) and a validation set (30%). ML models using glmnet, multilayer perceptron, random forest, XGBoost (extreme gradient boosting), and LightGBM models were developed. Using the ML approach that achieved the best area under the curve (AUC), we developed the final model. Further feature selection was conducted from the top 25 important features based on SHapley Additive exPlanations (SHAP) values. The performance of the final model was assessed using the validation dataset.

Results:  Of 9,467 individuals, 219 (2.3%) had preterm preeclampsia. The AUC of the XGBoost model was the highest (AUC = 0.749 [95% confidence interval (95% CI), 0.736-0.762]) compared with other models. Therefore, XGBoost was used to develop models using fewer variables. The XGBoost model with the eight features (in order of importance: mean uterine artery pulsatility index in the early second trimester, chronic hypertension, pregestational diabetes, uterine artery notch, systolic and diastolic blood pressure in the first trimester, body mass index, and maternal age) was chosen as the final model as it had an AUC of 0.741 (95% CI, 0.730-0.752) which was not inferior to the original model (p = 0.58). The final model in the validation dataset had an AUC of 0.779 (95% CI, 0.722-0.831). An online application of the final model was developed ( https://kawakita.shinyapps.io/Preterm_preeclampsia/ ).

Conclusion:  ML algorithms using information available before 23 weeks can accurately predict preterm preeclampsia before 37 weeks.

Key points: · Prediction models using uterine artery Doppler have not been adopted in the US.. · We developed a model using an ML algorithm.. · An online application of the final model was developed.. · ML algorithms using information available before 23 weeks can accurately predict preterm preeclampsia before 37 weeks..

求助全文
约1分钟内获得全文 求助全文
来源期刊
American journal of perinatology
American journal of perinatology 医学-妇产科学
CiteScore
5.90
自引率
0.00%
发文量
302
审稿时长
4-8 weeks
期刊介绍: The American Journal of Perinatology is an international, peer-reviewed, and indexed journal publishing 14 issues a year dealing with original research and topical reviews. It is the definitive forum for specialists in obstetrics, neonatology, perinatology, and maternal/fetal medicine, with emphasis on bridging the different fields. The focus is primarily on clinical and translational research, clinical and technical advances in diagnosis, monitoring, and treatment as well as evidence-based reviews. Topics of interest include epidemiology, diagnosis, prevention, and management of maternal, fetal, and neonatal diseases. Manuscripts on new technology, NICU set-ups, and nursing topics are published to provide a broad survey of important issues in this field. All articles undergo rigorous peer review, with web-based submission, expedited turn-around, and availability of electronic publication. The American Journal of Perinatology is accompanied by AJP Reports - an Open Access journal for case reports in neonatology and maternal/fetal medicine.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信