应用机器学习预测撒哈拉以南非洲妇女的延迟生育能力。

IF 3.4 Q2 REPRODUCTIVE BIOLOGY
Reproduction & fertility Pub Date : 2025-10-07 Print Date: 2025-10-01 DOI:10.1530/RAF-25-0068
Meron Asmamaw Alemayehu, Nebiyu Mekonnen Derseh, Tigist Kifle Tsegaw, Tilahun Yemanu Birhan, Banchlay Addis, Berhanie Addis Ayele, Emebet Birhanu Lealem, Eyob Akalewold Alemu, Fetlework Gubena Arage, Gebrie Getu Alemu, Getaneh Awoke Yismaw, Habtamu Abebe Getahun, Habtamu Wagnew Abuhay, Mekuriaw Nibret Aweke
{"title":"应用机器学习预测撒哈拉以南非洲妇女的延迟生育能力。","authors":"Meron Asmamaw Alemayehu, Nebiyu Mekonnen Derseh, Tigist Kifle Tsegaw, Tilahun Yemanu Birhan, Banchlay Addis, Berhanie Addis Ayele, Emebet Birhanu Lealem, Eyob Akalewold Alemu, Fetlework Gubena Arage, Gebrie Getu Alemu, Getaneh Awoke Yismaw, Habtamu Abebe Getahun, Habtamu Wagnew Abuhay, Mekuriaw Nibret Aweke","doi":"10.1530/RAF-25-0068","DOIUrl":null,"url":null,"abstract":"<p><strong>Graphical abstract: </strong></p><p><strong>Abstract: </strong>Delayed fecundability, defined as trying to conceive for ≥12 months without success, is a growing global concern due to the threat of fertility rates falling below the replacement level. This study aimed to predict delayed fecundability and identify influential predictors. Secondary data from recent Performance Monitoring for Action (PMA) surveys on fertility, contraception, and reproductive health in five sub-Saharan African countries were used. Preprocessing and feature engineering included imputation, encoding, and correlation filtering. Feature selection was done using the Boruta algorithm. Machine learning models, including random forest, XGBoost, and LightGBM, were developed and optimized via grid search with cross-validation. Models were compared using default hyperparameters. Interpretability was enhanced through SHapley Additive exPlanations (SHAP) plots, and heterogeneity was explored with subgroup SHAP analysis to identify context-specific predictor effects. Delayed fecundability was present in 31.01% of women. Grid search optimization improved model performance, with random forest achieving the highest accuracy (79.2%) and AUC (0.94). SHAP analysis identified key predictors, including age 36-49 (0.211), being married (0.208), ovulation-inducing treatment (0.173), and herbal remedy use (0.118). Subgroup SHAP analysis revealed heterogeneity: younger age reduced risk in 15-25-year-olds, fertility treatment history was the main risk driver in treated women, and marital status and childbirth had variable effects across subgroups. The random forest model best predicted delayed fecundability, with age, marital status, and treatment history as key predictors. Subgroup SHAP analysis revealed risk patterns across populations. Targeted screening and tailored fertility counseling, especially for couples with prior fertility treatments, are recommended to support timely conception.</p><p><strong>Lay summary: </strong>Many women struggle to get pregnant even after trying for a year or more, a condition called delayed fecundability. This issue is becoming more common worldwide and can signal problems with fertility. We used data from surveys in five African countries to find out which factors may predict this delay. Using computer models that can learn from data, we found that age, marital status, and past use of fertility treatments were strong predictors. Our best model correctly identified nearly 80% of women with delayed fecundability. To make the findings easy to understand, we used a method that explains how each factor influences the result. We also found that the effects of these factors vary by age and treatment history. Our results can help health workers identify women at higher risk earlier, especially in places where fertility services are limited, and provide them with better, more personalized care.</p>","PeriodicalId":101312,"journal":{"name":"Reproduction & fertility","volume":" ","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2025-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Application of machine learning to predict delayed fecundability among women in sub-Saharan Africa.\",\"authors\":\"Meron Asmamaw Alemayehu, Nebiyu Mekonnen Derseh, Tigist Kifle Tsegaw, Tilahun Yemanu Birhan, Banchlay Addis, Berhanie Addis Ayele, Emebet Birhanu Lealem, Eyob Akalewold Alemu, Fetlework Gubena Arage, Gebrie Getu Alemu, Getaneh Awoke Yismaw, Habtamu Abebe Getahun, Habtamu Wagnew Abuhay, Mekuriaw Nibret Aweke\",\"doi\":\"10.1530/RAF-25-0068\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Graphical abstract: </strong></p><p><strong>Abstract: </strong>Delayed fecundability, defined as trying to conceive for ≥12 months without success, is a growing global concern due to the threat of fertility rates falling below the replacement level. This study aimed to predict delayed fecundability and identify influential predictors. Secondary data from recent Performance Monitoring for Action (PMA) surveys on fertility, contraception, and reproductive health in five sub-Saharan African countries were used. Preprocessing and feature engineering included imputation, encoding, and correlation filtering. Feature selection was done using the Boruta algorithm. Machine learning models, including random forest, XGBoost, and LightGBM, were developed and optimized via grid search with cross-validation. Models were compared using default hyperparameters. Interpretability was enhanced through SHapley Additive exPlanations (SHAP) plots, and heterogeneity was explored with subgroup SHAP analysis to identify context-specific predictor effects. Delayed fecundability was present in 31.01% of women. Grid search optimization improved model performance, with random forest achieving the highest accuracy (79.2%) and AUC (0.94). SHAP analysis identified key predictors, including age 36-49 (0.211), being married (0.208), ovulation-inducing treatment (0.173), and herbal remedy use (0.118). Subgroup SHAP analysis revealed heterogeneity: younger age reduced risk in 15-25-year-olds, fertility treatment history was the main risk driver in treated women, and marital status and childbirth had variable effects across subgroups. The random forest model best predicted delayed fecundability, with age, marital status, and treatment history as key predictors. Subgroup SHAP analysis revealed risk patterns across populations. Targeted screening and tailored fertility counseling, especially for couples with prior fertility treatments, are recommended to support timely conception.</p><p><strong>Lay summary: </strong>Many women struggle to get pregnant even after trying for a year or more, a condition called delayed fecundability. This issue is becoming more common worldwide and can signal problems with fertility. We used data from surveys in five African countries to find out which factors may predict this delay. Using computer models that can learn from data, we found that age, marital status, and past use of fertility treatments were strong predictors. Our best model correctly identified nearly 80% of women with delayed fecundability. To make the findings easy to understand, we used a method that explains how each factor influences the result. We also found that the effects of these factors vary by age and treatment history. Our results can help health workers identify women at higher risk earlier, especially in places where fertility services are limited, and provide them with better, more personalized care.</p>\",\"PeriodicalId\":101312,\"journal\":{\"name\":\"Reproduction & fertility\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-10-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Reproduction & fertility\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1530/RAF-25-0068\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/10/1 0:00:00\",\"PubModel\":\"Print\",\"JCR\":\"Q2\",\"JCRName\":\"REPRODUCTIVE BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Reproduction & fertility","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1530/RAF-25-0068","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/10/1 0:00:00","PubModel":"Print","JCR":"Q2","JCRName":"REPRODUCTIVE BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

摘要:由于生育率低于更替水平的威胁,延迟生育能力日益成为全球关注的问题,延迟生育能力被定义为尝试怀孕≥12个月而未成功。本研究旨在预测延迟生育能力,并找出有影响的预测因子。本研究使用了最近在撒哈拉以南非洲5个国家开展的关于生育率、避孕和生殖健康的行动绩效监测(PMA)调查的二手数据。预处理和特征工程包括插值、编码和相关滤波。特征选择采用Boruta算法。通过交叉验证的网格搜索,开发和优化了Random Forest、XGBoost和LightGBM等机器学习模型。使用默认超参数比较模型。通过SHapley加性解释(SHAP)图增强可解释性,并通过亚组SHAP分析探讨异质性,以确定情境特定的预测效应。31.01%的女性存在延迟生育。网格搜索优化提高了模型性能,其中Random Forest达到了最高的准确率(79.2%)和AUC(0.94)。SHAP分析确定了关键预测因素,包括年龄36-49岁(0.211)、已婚(0.208)、促排卵治疗(0.173)和使用草药(0.118)。亚组SHAP分析显示了异质性:15-25岁人群中年龄越小风险越低,生育治疗史是治疗女性的主要风险驱动因素,婚姻状况和生育在不同亚组中有不同的影响。随机森林模型最能预测延迟生育能力,年龄、婚姻状况和治疗史是关键预测因素。亚组SHAP分析揭示了不同人群的风险模式。建议有针对性的筛查和量身定制的生育咨询,特别是对于先前接受过生育治疗的夫妇,以支持及时受孕。总结:许多女性在尝试了一年或更长时间后仍然难以怀孕,这种情况被称为生育延迟。这个问题在世界范围内变得越来越普遍,可能预示着生育问题。我们使用了来自五个非洲国家的调查数据来找出哪些因素可能导致这种延迟。使用可以从数据中学习的计算机模型,我们发现年龄、婚姻状况和过去使用的生育治疗是强有力的预测因素。我们最好的模型正确地识别了近80%的生育能力延迟的女性。为了使研究结果易于理解,我们使用了一种解释每个因素如何影响结果的方法。我们还发现,这些因素的影响因年龄和治疗史而异。我们的研究结果可以帮助卫生工作者更早地发现风险较高的妇女,特别是在生育服务有限的地方,并为她们提供更好、更个性化的护理。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Application of machine learning to predict delayed fecundability among women in sub-Saharan Africa.

Graphical abstract:

Abstract: Delayed fecundability, defined as trying to conceive for ≥12 months without success, is a growing global concern due to the threat of fertility rates falling below the replacement level. This study aimed to predict delayed fecundability and identify influential predictors. Secondary data from recent Performance Monitoring for Action (PMA) surveys on fertility, contraception, and reproductive health in five sub-Saharan African countries were used. Preprocessing and feature engineering included imputation, encoding, and correlation filtering. Feature selection was done using the Boruta algorithm. Machine learning models, including random forest, XGBoost, and LightGBM, were developed and optimized via grid search with cross-validation. Models were compared using default hyperparameters. Interpretability was enhanced through SHapley Additive exPlanations (SHAP) plots, and heterogeneity was explored with subgroup SHAP analysis to identify context-specific predictor effects. Delayed fecundability was present in 31.01% of women. Grid search optimization improved model performance, with random forest achieving the highest accuracy (79.2%) and AUC (0.94). SHAP analysis identified key predictors, including age 36-49 (0.211), being married (0.208), ovulation-inducing treatment (0.173), and herbal remedy use (0.118). Subgroup SHAP analysis revealed heterogeneity: younger age reduced risk in 15-25-year-olds, fertility treatment history was the main risk driver in treated women, and marital status and childbirth had variable effects across subgroups. The random forest model best predicted delayed fecundability, with age, marital status, and treatment history as key predictors. Subgroup SHAP analysis revealed risk patterns across populations. Targeted screening and tailored fertility counseling, especially for couples with prior fertility treatments, are recommended to support timely conception.

Lay summary: Many women struggle to get pregnant even after trying for a year or more, a condition called delayed fecundability. This issue is becoming more common worldwide and can signal problems with fertility. We used data from surveys in five African countries to find out which factors may predict this delay. Using computer models that can learn from data, we found that age, marital status, and past use of fertility treatments were strong predictors. Our best model correctly identified nearly 80% of women with delayed fecundability. To make the findings easy to understand, we used a method that explains how each factor influences the result. We also found that the effects of these factors vary by age and treatment history. Our results can help health workers identify women at higher risk earlier, especially in places where fertility services are limited, and provide them with better, more personalized care.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
2.80
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信