{"title":"Machine learning-based prediction of preterm birth risk using methylation changes in neonatal cord blood CpG sites.","authors":"Yuxin Feng, Ying Ni, Wenkai Wang, Fen Guo, Liyu Wang, Fan Zhu, Luyao Zhang, Ying Feng","doi":"10.1186/s12884-025-07884-7","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Preterm birth, defined as delivery before 37 weeks of gestation, is a major cause of neonatal morbidity and mortality. DNA methylation changes at CpG sites have been associated with the risk of preterm birth.</p><p><strong>Objective: </strong>This study aimed to identify differential CpG sites in cord blood and develop predictive machine learning models based on these methylation changes to assess preterm birth risk.</p><p><strong>Methods: </strong>Methylome data from 110 neonatal cord blood samples in the GSE110828 dataset were analyzed to identify CpG sites differing between preterm and full-term births (88 for training, and 22 for testing, respectively). Key CpG sites were selected using Lasso, Elastic Net, and Random Forest. Forty-five predictive models were constructed and evaluated for accuracy, precision, recall, and F1 score.</p><p><strong>Results: </strong>Sixty-six CpG sites showed significant differences between preterm and full-term groups. Four models, including Random Forest with Lasso and Gradient Boosting with Random Forest, achieved optimal predictive performance, each with a validation accuracy of 93.75%.</p><p><strong>Conclusion: </strong>DNA methylation changes at CpG sites in cord blood are associated with preterm birth risk. CpG-based methylation models demonstrate high predictive accuracy and hold promise for early clinical risk assessment.</p>","PeriodicalId":9033,"journal":{"name":"BMC Pregnancy and Childbirth","volume":"25 1","pages":"784"},"PeriodicalIF":2.7000,"publicationDate":"2025-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12285009/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Pregnancy and Childbirth","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12884-025-07884-7","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OBSTETRICS & GYNECOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Preterm birth, defined as delivery before 37 weeks of gestation, is a major cause of neonatal morbidity and mortality. DNA methylation changes at CpG sites have been associated with the risk of preterm birth.
Objective: This study aimed to identify differential CpG sites in cord blood and develop predictive machine learning models based on these methylation changes to assess preterm birth risk.
Methods: Methylome data from 110 neonatal cord blood samples in the GSE110828 dataset were analyzed to identify CpG sites differing between preterm and full-term births (88 for training, and 22 for testing, respectively). Key CpG sites were selected using Lasso, Elastic Net, and Random Forest. Forty-five predictive models were constructed and evaluated for accuracy, precision, recall, and F1 score.
Results: Sixty-six CpG sites showed significant differences between preterm and full-term groups. Four models, including Random Forest with Lasso and Gradient Boosting with Random Forest, achieved optimal predictive performance, each with a validation accuracy of 93.75%.
Conclusion: DNA methylation changes at CpG sites in cord blood are associated with preterm birth risk. CpG-based methylation models demonstrate high predictive accuracy and hold promise for early clinical risk assessment.
期刊介绍:
BMC Pregnancy & Childbirth is an open access, peer-reviewed journal that considers articles on all aspects of pregnancy and childbirth. The journal welcomes submissions on the biomedical aspects of pregnancy, breastfeeding, labor, maternal health, maternity care, trends and sociological aspects of pregnancy and childbirth.