Yuehong Ding, Lanqing Qu, Jinbiao Zhang, Tiezheng Li, Yunyan Wang, Weize Xu, Die Li
{"title":"Parental health at preconception and gestational age at birth: evidence from a population-based cohort using double machine learning.","authors":"Yuehong Ding, Lanqing Qu, Jinbiao Zhang, Tiezheng Li, Yunyan Wang, Weize Xu, Die Li","doi":"10.1136/wjps-2025-001078","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Gestational age at birth is shaped by complex maternal and paternal factors prior to conception, yet few studies have integrated multidimensional clinical indicators from both parents to estimate their quantitative effects.</p><p><strong>Methods: </strong>Using a large retrospective cohort with routinely collected preconception health data from both mothers and fathers, we applied a double machine learning (DML) framework combining Lasso, Random Forest and XGBoost regressors to estimate associations between parental biomarkers and gestational age at birth. Model interpretability was enhanced through Shapley Additive Explanations (SHAP) analysis, stratified interaction testing and estimation of individualized treatment effects (ITE).</p><p><strong>Results: </strong>Elevated maternal fasting glucose, alanine aminotransferase, platelet count and anti-hepatitis B core seropositivity were consistently associated with shortened gestational age at birth across all DML models. For instance, a 10 mmol/L increase in fasting glucose corresponded to 2.8-4.9 days shorter gestation. Paternal Treponema pallidum seropositivity and increased monocyte proportion, defined as the fraction of monocytes within total circulating white blood cells, also demonstrated significant associations, with the former linked to gestational shortening and the latter to modest extension. Stratified and interaction analyses revealed that paternal immune markers modified the associations of maternal metabolic indicators with gestational age at birth. SHAP-based interpretation confirmed model consistency and ITE analysis indicated marked heterogeneity, particularly for fasting glucose.</p><p><strong>Conclusions: </strong>This study demonstrates the utility of interpretable DML methods for quantifying the effects of multidimensional preconception health indicators from both parents on gestational age at birth. Our findings support integrating maternal and paternal screening into preconception risk assessments to enable early, targeted prevention of preterm birth. Given the established links between preterm birth and neonatal surgical conditions, early risk identification via parental screening may help inform perinatal care strategies and optimize resource allocation in pediatric surgery.</p>","PeriodicalId":23823,"journal":{"name":"World Journal of Pediatric Surgery","volume":"8 4","pages":"e001078"},"PeriodicalIF":1.3000,"publicationDate":"2025-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12481323/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"World Journal of Pediatric Surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1136/wjps-2025-001078","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q4","JCRName":"PEDIATRICS","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Gestational age at birth is shaped by complex maternal and paternal factors prior to conception, yet few studies have integrated multidimensional clinical indicators from both parents to estimate their quantitative effects.
Methods: Using a large retrospective cohort with routinely collected preconception health data from both mothers and fathers, we applied a double machine learning (DML) framework combining Lasso, Random Forest and XGBoost regressors to estimate associations between parental biomarkers and gestational age at birth. Model interpretability was enhanced through Shapley Additive Explanations (SHAP) analysis, stratified interaction testing and estimation of individualized treatment effects (ITE).
Results: Elevated maternal fasting glucose, alanine aminotransferase, platelet count and anti-hepatitis B core seropositivity were consistently associated with shortened gestational age at birth across all DML models. For instance, a 10 mmol/L increase in fasting glucose corresponded to 2.8-4.9 days shorter gestation. Paternal Treponema pallidum seropositivity and increased monocyte proportion, defined as the fraction of monocytes within total circulating white blood cells, also demonstrated significant associations, with the former linked to gestational shortening and the latter to modest extension. Stratified and interaction analyses revealed that paternal immune markers modified the associations of maternal metabolic indicators with gestational age at birth. SHAP-based interpretation confirmed model consistency and ITE analysis indicated marked heterogeneity, particularly for fasting glucose.
Conclusions: This study demonstrates the utility of interpretable DML methods for quantifying the effects of multidimensional preconception health indicators from both parents on gestational age at birth. Our findings support integrating maternal and paternal screening into preconception risk assessments to enable early, targeted prevention of preterm birth. Given the established links between preterm birth and neonatal surgical conditions, early risk identification via parental screening may help inform perinatal care strategies and optimize resource allocation in pediatric surgery.