A comprehensive and bias-free machine learning approach for risk prediction of preeclampsia with severe features in a nulliparous study cohort.

IF 2.8 2区 医学 Q1 OBSTETRICS & GYNECOLOGY
Yun C Lin, Daniel Mallia, Andrea O Clark-Sevilla, Adam Catto, Alisa Leshchenko, Qi Yan, David M Haas, Ronald Wapner, Itsik Pe'er, Anita Raja, Ansaf Salleb-Aouissi
{"title":"A comprehensive and bias-free machine learning approach for risk prediction of preeclampsia with severe features in a nulliparous study cohort.","authors":"Yun C Lin, Daniel Mallia, Andrea O Clark-Sevilla, Adam Catto, Alisa Leshchenko, Qi Yan, David M Haas, Ronald Wapner, Itsik Pe'er, Anita Raja, Ansaf Salleb-Aouissi","doi":"10.1186/s12884-024-06988-w","DOIUrl":null,"url":null,"abstract":"<p><p>Preeclampsia is one of the leading causes of maternal morbidity, with consequences during and after pregnancy. Because of its diverse clinical presentation, preeclampsia is an adverse pregnancy outcome that is uniquely challenging to predict and manage. In this paper, we developed racial bias-free machine learning models that predict the onset of preeclampsia with severe features or eclampsia at discrete time points in a nulliparous pregnant study cohort. To focus on those most at risk, we selected probands with severe PE (sPE). Those with mild preeclampsia, superimposed preeclampsia, and new onset hypertension were excluded.The prospective study cohort to which we applied machine learning is the Nulliparous Pregnancy Outcomes Study: Monitoring Mothers-to-be (nuMoM2b) study, which contains information from eight clinical sites across the US. Maternal serum samples were collected for 1,857 individuals between the first and second trimesters. These patients with serum samples collected are selected as the final cohort.Our prediction models achieved an AUROC of 0.72 (95% CI, 0.69-0.76), 0.75 (95% CI, 0.71-0.79), and 0.77 (95% CI, 0.74-0.80), respectively, for the three visits. Our initial models were biased toward non-Hispanic black participants with a high predictive equality ratio of 1.31. We corrected this bias and reduced this ratio to 1.14. This lowers the rate of false positives in our predictive model for the non-Hispanic black participants. The exact cause of the bias is still under investigation, but previous studies have recognized PLGF as a potential bias-inducing factor. However, since our model includes various factors that exhibit a positive correlation with PLGF, such as blood pressure measurements and BMI, we have employed an algorithmic approach to disentangle this bias from the model.The top features of our built model stress the importance of using several tests, particularly for biomarkers (BMI and blood pressure measurements) and ultrasound measurements. Placental analytes (PLGF and Endoglin) were strong predictors for screening for the early onset of preeclampsia with severe features in the first two trimesters.</p>","PeriodicalId":9033,"journal":{"name":"BMC Pregnancy and Childbirth","volume":"24 1","pages":"853"},"PeriodicalIF":2.8000,"publicationDate":"2024-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11667971/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Pregnancy and Childbirth","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12884-024-06988-w","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OBSTETRICS & GYNECOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Preeclampsia is one of the leading causes of maternal morbidity, with consequences during and after pregnancy. Because of its diverse clinical presentation, preeclampsia is an adverse pregnancy outcome that is uniquely challenging to predict and manage. In this paper, we developed racial bias-free machine learning models that predict the onset of preeclampsia with severe features or eclampsia at discrete time points in a nulliparous pregnant study cohort. To focus on those most at risk, we selected probands with severe PE (sPE). Those with mild preeclampsia, superimposed preeclampsia, and new onset hypertension were excluded.The prospective study cohort to which we applied machine learning is the Nulliparous Pregnancy Outcomes Study: Monitoring Mothers-to-be (nuMoM2b) study, which contains information from eight clinical sites across the US. Maternal serum samples were collected for 1,857 individuals between the first and second trimesters. These patients with serum samples collected are selected as the final cohort.Our prediction models achieved an AUROC of 0.72 (95% CI, 0.69-0.76), 0.75 (95% CI, 0.71-0.79), and 0.77 (95% CI, 0.74-0.80), respectively, for the three visits. Our initial models were biased toward non-Hispanic black participants with a high predictive equality ratio of 1.31. We corrected this bias and reduced this ratio to 1.14. This lowers the rate of false positives in our predictive model for the non-Hispanic black participants. The exact cause of the bias is still under investigation, but previous studies have recognized PLGF as a potential bias-inducing factor. However, since our model includes various factors that exhibit a positive correlation with PLGF, such as blood pressure measurements and BMI, we have employed an algorithmic approach to disentangle this bias from the model.The top features of our built model stress the importance of using several tests, particularly for biomarkers (BMI and blood pressure measurements) and ultrasound measurements. Placental analytes (PLGF and Endoglin) were strong predictors for screening for the early onset of preeclampsia with severe features in the first two trimesters.

一种全面、无偏倚的机器学习方法用于无产队列研究中具有严重特征的先兆子痫的风险预测。
先兆子痫是产妇发病的主要原因之一,在怀孕期间和之后都会产生后果。由于其不同的临床表现,子痫前期是一个不良妊娠结局,是独特的挑战,以预测和管理。在本文中,我们开发了无种族偏见的机器学习模型,用于预测具有严重特征的先兆子痫或子痫在未产妊娠研究队列中离散时间点的发作。为了关注那些最危险的人,我们选择了患有严重PE (sPE)的先证。排除轻度子痫前期、合并子痫前期和新发高血压的患者。我们应用机器学习的前瞻性研究队列是未产妊娠结局研究:监测准妈妈(nuMoM2b)研究,该研究包含来自美国八个临床站点的信息。在妊娠早期和中期收集了1857名孕妇的血清样本。收集到血清样本的患者被选为最终队列。我们的预测模型对三次就诊的AUROC分别为0.72 (95% CI, 0.69-0.76)、0.75 (95% CI, 0.71-0.79)和0.77 (95% CI, 0.74-0.80)。我们最初的模型偏向于非西班牙裔黑人参与者,其预测平等比为1.31。我们纠正了这一偏差,并将这一比率降至1.14。这降低了我们对非西班牙裔黑人参与者的预测模型中的假阳性率。这种偏倚的确切原因仍在调查中,但先前的研究已经认识到PLGF是一种潜在的偏倚诱发因素。然而,由于我们的模型包含了与PLGF呈正相关的各种因素,如血压测量和BMI,我们采用了一种算法方法来消除模型中的这种偏差。我们建立的模型的主要特征强调了使用几种测试的重要性,特别是生物标志物(BMI和血压测量)和超声波测量。胎盘分析物(PLGF和内啡肽)是筛查早发性子痫前期前两个月严重特征的有力预测因子。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
BMC Pregnancy and Childbirth
BMC Pregnancy and Childbirth OBSTETRICS & GYNECOLOGY-
CiteScore
4.90
自引率
6.50%
发文量
845
审稿时长
3-8 weeks
期刊介绍: BMC Pregnancy & Childbirth is an open access, peer-reviewed journal that considers articles on all aspects of pregnancy and childbirth. The journal welcomes submissions on the biomedical aspects of pregnancy, breastfeeding, labor, maternal health, maternity care, trends and sociological aspects of pregnancy and childbirth.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信