基于机器学习的多囊卵巢综合征IVF/ICSI患者新鲜胚胎移植后活产预测模型的构建与评价

IF 3.8 3区 医学 Q1 REPRODUCTIVE BIOLOGY
Suqin Zhu, Zhiqing Huang, Xiaojing Chen, Wenwen Jiang, Yuan Zhou, Beihong Zheng, Yan Sun
{"title":"基于机器学习的多囊卵巢综合征IVF/ICSI患者新鲜胚胎移植后活产预测模型的构建与评价","authors":"Suqin Zhu, Zhiqing Huang, Xiaojing Chen, Wenwen Jiang, Yuan Zhou, Beihong Zheng, Yan Sun","doi":"10.1186/s13048-025-01654-x","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>To investigate the determinants affecting live birth outcomes in fresh embryo transfer among polycystic ovary syndrome (PCOS) patients using various machine learning (ML) algorithms and to construct predictive models, offering novel insights for enhancing live birth rates in this specific group.</p><p><strong>Methods: </strong>A sum of 1,062 fresh embryo transfer cycles involving PCOS patients were analyzed, with 466 resulting in live births. The dataset was split randomly into training and testing subsets at a 7:3 ratio. Least absolute shrinkage and selection operator and recursive feature elimination methods were utilized for feature selection within the training data. A grid search strategy identified the optimal parameters for seven ML models: decision tree (DT), K-nearest neighbors (KNN), light gradient boosting machine (LightGBM), naive Bayes model(NBM), random forest (RF), support vector machine (SVM) and extreme gradient boosting (XGBoost). The evaluation of model effectiveness incorporated diverse metrics, encompassing area under the curve (AUC), accuracy, positive predictive value, negative predictive value, F1 score, and Brier score. Calibration curves and decision curve analysis were employed to ascertain the optimal model. Furthermore, Shapley additive explanations were applied to elucidate the importance of predictor variables in the top-performing model.</p><p><strong>Results: </strong>The AUC values of DT, KNN, LightGBM, NBM, RF, SVM and XGBoost models in the training set were 0.813, 1.000, 0.724, 0.791, 1.000, 0.819 and 0.853, respectively. Corresponding values in the testing set were 0.773, 0.719, 0.705, 0.764, 0.794, 0.806 and 0.822. XGBoost emerged as the most effective ML model. SHAP analysis revealed that variables encompassing embryo transfer count, embryo type, maternal age, infertility duration, body mass index, serum testosterone (T) levels, and progesterone (P) levels on the day of human chorionic gonadotropin administration were pivotal predictors of live birth outcomes in individuals with PCOS receiving fresh embryo transfer.</p><p><strong>Conclusion: </strong>This study developed a live birth prediction model tailored for PCOS fresh embryo transfer cycles, leveraging ML algorithms to compare the efficacy of multiple models. The XGBoost model demonstrated superior predictive capacity, enabling prompt and precise identification of critical risk factors influencing live birth outcomes in PCOS patients. These findings offer actionable insights for clinical intervention, guiding strategies to improve pregnancy outcomes in this population.</p><p><strong>Clinical trial number: </strong>Not applicable.</p>","PeriodicalId":16610,"journal":{"name":"Journal of Ovarian Research","volume":"18 1","pages":"70"},"PeriodicalIF":3.8000,"publicationDate":"2025-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11969817/pdf/","citationCount":"0","resultStr":"{\"title\":\"Construction and evaluation of machine learning-based prediction model for live birth following fresh embryo transfer in IVF/ICSI patients with polycystic ovary syndrome.\",\"authors\":\"Suqin Zhu, Zhiqing Huang, Xiaojing Chen, Wenwen Jiang, Yuan Zhou, Beihong Zheng, Yan Sun\",\"doi\":\"10.1186/s13048-025-01654-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objective: </strong>To investigate the determinants affecting live birth outcomes in fresh embryo transfer among polycystic ovary syndrome (PCOS) patients using various machine learning (ML) algorithms and to construct predictive models, offering novel insights for enhancing live birth rates in this specific group.</p><p><strong>Methods: </strong>A sum of 1,062 fresh embryo transfer cycles involving PCOS patients were analyzed, with 466 resulting in live births. The dataset was split randomly into training and testing subsets at a 7:3 ratio. Least absolute shrinkage and selection operator and recursive feature elimination methods were utilized for feature selection within the training data. A grid search strategy identified the optimal parameters for seven ML models: decision tree (DT), K-nearest neighbors (KNN), light gradient boosting machine (LightGBM), naive Bayes model(NBM), random forest (RF), support vector machine (SVM) and extreme gradient boosting (XGBoost). The evaluation of model effectiveness incorporated diverse metrics, encompassing area under the curve (AUC), accuracy, positive predictive value, negative predictive value, F1 score, and Brier score. Calibration curves and decision curve analysis were employed to ascertain the optimal model. Furthermore, Shapley additive explanations were applied to elucidate the importance of predictor variables in the top-performing model.</p><p><strong>Results: </strong>The AUC values of DT, KNN, LightGBM, NBM, RF, SVM and XGBoost models in the training set were 0.813, 1.000, 0.724, 0.791, 1.000, 0.819 and 0.853, respectively. Corresponding values in the testing set were 0.773, 0.719, 0.705, 0.764, 0.794, 0.806 and 0.822. XGBoost emerged as the most effective ML model. SHAP analysis revealed that variables encompassing embryo transfer count, embryo type, maternal age, infertility duration, body mass index, serum testosterone (T) levels, and progesterone (P) levels on the day of human chorionic gonadotropin administration were pivotal predictors of live birth outcomes in individuals with PCOS receiving fresh embryo transfer.</p><p><strong>Conclusion: </strong>This study developed a live birth prediction model tailored for PCOS fresh embryo transfer cycles, leveraging ML algorithms to compare the efficacy of multiple models. The XGBoost model demonstrated superior predictive capacity, enabling prompt and precise identification of critical risk factors influencing live birth outcomes in PCOS patients. These findings offer actionable insights for clinical intervention, guiding strategies to improve pregnancy outcomes in this population.</p><p><strong>Clinical trial number: </strong>Not applicable.</p>\",\"PeriodicalId\":16610,\"journal\":{\"name\":\"Journal of Ovarian Research\",\"volume\":\"18 1\",\"pages\":\"70\"},\"PeriodicalIF\":3.8000,\"publicationDate\":\"2025-04-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11969817/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Ovarian Research\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s13048-025-01654-x\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"REPRODUCTIVE BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Ovarian Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s13048-025-01654-x","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"REPRODUCTIVE BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

目的:利用各种机器学习(ML)算法探讨影响多囊卵巢综合征(PCOS)患者新鲜胚胎移植活产结局的因素,并构建预测模型,为提高该特定群体的活产率提供新的见解。方法:对1062例PCOS患者的新鲜胚胎移植周期进行分析,其中466例活产。数据集以7:3的比例随机分为训练子集和测试子集。利用最小绝对收缩和选择算子以及递归特征消除方法对训练数据进行特征选择。网格搜索策略确定了7种ML模型的最优参数:决策树(DT)、k近邻(KNN)、轻梯度增强机(LightGBM)、朴素贝叶斯模型(NBM)、随机森林(RF)、支持向量机(SVM)和极端梯度增强(XGBoost)。模型有效性评价采用多种指标,包括曲线下面积(AUC)、准确性、阳性预测值、阴性预测值、F1评分和Brier评分。采用标定曲线和决策曲线分析确定了最优模型。此外,运用Shapley加性解释阐明预测变量在最佳表现模型中的重要性。结果:DT、KNN、LightGBM、NBM、RF、SVM和XGBoost模型在训练集中的AUC值分别为0.813、1.000、0.724、0.791、1.000、0.819和0.853。测试集中对应的值分别为0.773、0.719、0.705、0.764、0.794、0.806、0.822。XGBoost成为了最有效的ML模型。SHAP分析显示,胚胎移植数量、胚胎类型、母亲年龄、不孕持续时间、体重指数、血清睾酮(T)水平和人绒毛膜促性腺激素给药当日的孕酮(P)水平是接受新鲜胚胎移植的多囊卵巢综合征患者活产结局的关键预测因素。结论:本研究建立了适合PCOS新鲜胚胎移植周期的活产预测模型,利用ML算法比较多种模型的有效性。XGBoost模型显示出卓越的预测能力,能够快速准确地识别影响PCOS患者活产结局的关键危险因素。这些发现为临床干预提供了可行的见解,指导了改善这一人群妊娠结局的策略。临床试验号:不适用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Construction and evaluation of machine learning-based prediction model for live birth following fresh embryo transfer in IVF/ICSI patients with polycystic ovary syndrome.

Objective: To investigate the determinants affecting live birth outcomes in fresh embryo transfer among polycystic ovary syndrome (PCOS) patients using various machine learning (ML) algorithms and to construct predictive models, offering novel insights for enhancing live birth rates in this specific group.

Methods: A sum of 1,062 fresh embryo transfer cycles involving PCOS patients were analyzed, with 466 resulting in live births. The dataset was split randomly into training and testing subsets at a 7:3 ratio. Least absolute shrinkage and selection operator and recursive feature elimination methods were utilized for feature selection within the training data. A grid search strategy identified the optimal parameters for seven ML models: decision tree (DT), K-nearest neighbors (KNN), light gradient boosting machine (LightGBM), naive Bayes model(NBM), random forest (RF), support vector machine (SVM) and extreme gradient boosting (XGBoost). The evaluation of model effectiveness incorporated diverse metrics, encompassing area under the curve (AUC), accuracy, positive predictive value, negative predictive value, F1 score, and Brier score. Calibration curves and decision curve analysis were employed to ascertain the optimal model. Furthermore, Shapley additive explanations were applied to elucidate the importance of predictor variables in the top-performing model.

Results: The AUC values of DT, KNN, LightGBM, NBM, RF, SVM and XGBoost models in the training set were 0.813, 1.000, 0.724, 0.791, 1.000, 0.819 and 0.853, respectively. Corresponding values in the testing set were 0.773, 0.719, 0.705, 0.764, 0.794, 0.806 and 0.822. XGBoost emerged as the most effective ML model. SHAP analysis revealed that variables encompassing embryo transfer count, embryo type, maternal age, infertility duration, body mass index, serum testosterone (T) levels, and progesterone (P) levels on the day of human chorionic gonadotropin administration were pivotal predictors of live birth outcomes in individuals with PCOS receiving fresh embryo transfer.

Conclusion: This study developed a live birth prediction model tailored for PCOS fresh embryo transfer cycles, leveraging ML algorithms to compare the efficacy of multiple models. The XGBoost model demonstrated superior predictive capacity, enabling prompt and precise identification of critical risk factors influencing live birth outcomes in PCOS patients. These findings offer actionable insights for clinical intervention, guiding strategies to improve pregnancy outcomes in this population.

Clinical trial number: Not applicable.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Ovarian Research
Journal of Ovarian Research REPRODUCTIVE BIOLOGY-
CiteScore
6.20
自引率
2.50%
发文量
125
审稿时长
>12 weeks
期刊介绍: Journal of Ovarian Research is an open access, peer reviewed, online journal that aims to provide a forum for high-quality basic and clinical research on ovarian function, abnormalities, and cancer. The journal focuses on research that provides new insights into ovarian functions as well as prevention and treatment of diseases afflicting the organ. Topical areas include, but are not restricted to: Ovary development, hormone secretion and regulation Follicle growth and ovulation Infertility and Polycystic ovarian syndrome Regulation of pituitary and other biological functions by ovarian hormones Ovarian cancer, its prevention, diagnosis and treatment Drug development and screening Role of stem cells in ovary development and function.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信