Machine learning model-based preterm birth prediction and clinical nomogram: A big retrospective cohort study.

IF 2.6 3区 医学 Q2 OBSTETRICS & GYNECOLOGY
Ya Liu, Jiangling Liu, Heqing Shen
{"title":"Machine learning model-based preterm birth prediction and clinical nomogram: A big retrospective cohort study.","authors":"Ya Liu, Jiangling Liu, Heqing Shen","doi":"10.1002/ijgo.16036","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>This study sought to develop a multifactorial predictive model for preterm birth risk, with the goal of providing clinical practitioners with early prevention.</p><p><strong>Methods: </strong>This retrospective cohort study utilized 2022 and 2018 National Vital Statistics System (NVSS) birth data, with the 2022 cohort arbitrarily split into training (70%) and internal verification (30%) subsets, and the 2018 cohort for external validation. Four machine learning algorithms-logistic regression, adaptive lasso regression, bootstrap forest, and boosted trees-identified features associated with preterm birth. The study then integrated the consensus features identified across the four models to construct a logistic regression-based preterm birth prediction nomogram. To evaluate the model's efficacy, calibration, receiver operating characteristic (ROC), and decision curve analysis were applied to both the internal and external validation sets.</p><p><strong>Results: </strong>The study included 2 567 040 mother-infant pairs from the 2022 cohort and 2 688 568 mother-infant pairs from the 2018 cohort. All four machine learning models demonstrated high accuracy (area under the curve [AUC] >0.7) in predicting preterm birth, and the internal validation results indicated good model generalizability. Feature selection identified nine common risk factors associated with preterm birth. The prediction nomogram based on these nine common features achieved AUCs of 0.701, 0.702, and 0.704 in the training, internal validation, and external validation sets, respectively. The calibration curves showed good agreement, and the decision curve analysis confirmed the model's net clinical benefits.</p><p><strong>Conclusion: </strong>This study developed a reliable preterm birth prediction tool using large-scale birth cohort data, filling the gap of lacking external validation for existing preterm birth prediction models.</p>","PeriodicalId":14164,"journal":{"name":"International Journal of Gynecology & Obstetrics","volume":" ","pages":""},"PeriodicalIF":2.6000,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Gynecology & Obstetrics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/ijgo.16036","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"OBSTETRICS & GYNECOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Objective: This study sought to develop a multifactorial predictive model for preterm birth risk, with the goal of providing clinical practitioners with early prevention.

Methods: This retrospective cohort study utilized 2022 and 2018 National Vital Statistics System (NVSS) birth data, with the 2022 cohort arbitrarily split into training (70%) and internal verification (30%) subsets, and the 2018 cohort for external validation. Four machine learning algorithms-logistic regression, adaptive lasso regression, bootstrap forest, and boosted trees-identified features associated with preterm birth. The study then integrated the consensus features identified across the four models to construct a logistic regression-based preterm birth prediction nomogram. To evaluate the model's efficacy, calibration, receiver operating characteristic (ROC), and decision curve analysis were applied to both the internal and external validation sets.

Results: The study included 2 567 040 mother-infant pairs from the 2022 cohort and 2 688 568 mother-infant pairs from the 2018 cohort. All four machine learning models demonstrated high accuracy (area under the curve [AUC] >0.7) in predicting preterm birth, and the internal validation results indicated good model generalizability. Feature selection identified nine common risk factors associated with preterm birth. The prediction nomogram based on these nine common features achieved AUCs of 0.701, 0.702, and 0.704 in the training, internal validation, and external validation sets, respectively. The calibration curves showed good agreement, and the decision curve analysis confirmed the model's net clinical benefits.

Conclusion: This study developed a reliable preterm birth prediction tool using large-scale birth cohort data, filling the gap of lacking external validation for existing preterm birth prediction models.

基于机器学习模型的早产预测和临床提名图:一项大型回顾性队列研究。
研究目的本研究旨在开发早产风险的多因素预测模型,目的是为临床从业人员提供早期预防:这项回顾性队列研究利用了 2022 年和 2018 年美国国家人口动态统计系统(NVSS)的出生数据,其中 2022 年队列被任意分为训练子集(70%)和内部验证子集(30%),2018 年队列则用于外部验证。四种机器学习算法--逻辑回归、自适应套索回归、引导森林和助推树--识别出了与早产相关的特征。然后,该研究整合了四个模型所识别的共识特征,构建了基于逻辑回归的早产预测提名图。为了评估该模型的有效性,对内部和外部验证集进行了校准、接收器操作特征(ROC)和决策曲线分析:研究纳入了 2022 年队列中的 2 567 040 对母婴和 2018 年队列中的 2 688 568 对母婴。所有四个机器学习模型在预测早产方面都表现出较高的准确性(曲线下面积 [AUC] >0.7),内部验证结果表明模型具有良好的普适性。特征选择确定了与早产相关的九个常见风险因素。基于这九个常见特征的预测提名图在训练集、内部验证集和外部验证集上的 AUC 分别为 0.701、0.702 和 0.704。校准曲线显示出良好的一致性,决策曲线分析证实了该模型的临床净效益:本研究利用大规模出生队列数据开发了一种可靠的早产预测工具,填补了现有早产预测模型缺乏外部验证的空白。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
5.80
自引率
2.60%
发文量
493
审稿时长
3-6 weeks
期刊介绍: The International Journal of Gynecology & Obstetrics publishes articles on all aspects of basic and clinical research in the fields of obstetrics and gynecology and related subjects, with emphasis on matters of worldwide interest.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信