Chunfei Hu , Hongmei Lin , Yupin Xu , Xukun Fu , Xiaojing Qiu , Siqian Hu , Tong Jin , Hualin Xu , Qiong Luo
{"title":"Development and application of a machine learning-based antenatal depression prediction model","authors":"Chunfei Hu , Hongmei Lin , Yupin Xu , Xukun Fu , Xiaojing Qiu , Siqian Hu , Tong Jin , Hualin Xu , Qiong Luo","doi":"10.1016/j.jad.2025.01.099","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Antenatal depression (AND), occurring during pregnancy, is associated with severe outcomes. However, there is a lack of objective and universally applicable prediction methods for AND in clinical practice. We leveraged sociodemographic and pregnancy-related data to develop and validate a machine learning-based AND prediction model.</div></div><div><h3>Methods</h3><div>Data from 20,950 pregnant women form 3 hospitals were used and divided into training and test sets. AND was defined as an EPDS score of 10 or above. Using machine learning, we selected 34 characteristic variables and divided them into three categories based on clinical practice: Base Variables, General Variables, and Obstetric Variables. Based on this classification, we constructed four different AND random forest prediction models: the Base Model, the Base+General Model, the Base+Obstetric Model, and the Full Model.</div></div><div><h3>Results</h3><div>The AUC range in the test set was 0.687–0.710. The Base+General Model achieved the best performance with an AUC of 0.710 (95 % CI: 0.693–0.710) in predicting AND risk during the late pregnancy period. The AUC of the Base Model was only 0.022 lower than that of the top-performing model, indicating its solid foundation for early AND screening.</div></div><div><h3>Limitations</h3><div>We have only analyzed the dataset from two eastern cities, and have not yet validated our models in an external dataset.</div></div><div><h3>Conclusions</h3><div>Machine learning-based prediction models offer the capability to anticipate the risk of AND across different pregnancy stages. This enables the earlier and more accurate identification of pregnant women who may be at risk, facilitating timely interventions for improving outcomes for both mothers and their offspring.</div></div>","PeriodicalId":14963,"journal":{"name":"Journal of affective disorders","volume":"375 ","pages":"Pages 137-147"},"PeriodicalIF":4.9000,"publicationDate":"2025-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of affective disorders","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S016503272500117X","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Antenatal depression (AND), occurring during pregnancy, is associated with severe outcomes. However, there is a lack of objective and universally applicable prediction methods for AND in clinical practice. We leveraged sociodemographic and pregnancy-related data to develop and validate a machine learning-based AND prediction model.
Methods
Data from 20,950 pregnant women form 3 hospitals were used and divided into training and test sets. AND was defined as an EPDS score of 10 or above. Using machine learning, we selected 34 characteristic variables and divided them into three categories based on clinical practice: Base Variables, General Variables, and Obstetric Variables. Based on this classification, we constructed four different AND random forest prediction models: the Base Model, the Base+General Model, the Base+Obstetric Model, and the Full Model.
Results
The AUC range in the test set was 0.687–0.710. The Base+General Model achieved the best performance with an AUC of 0.710 (95 % CI: 0.693–0.710) in predicting AND risk during the late pregnancy period. The AUC of the Base Model was only 0.022 lower than that of the top-performing model, indicating its solid foundation for early AND screening.
Limitations
We have only analyzed the dataset from two eastern cities, and have not yet validated our models in an external dataset.
Conclusions
Machine learning-based prediction models offer the capability to anticipate the risk of AND across different pregnancy stages. This enables the earlier and more accurate identification of pregnant women who may be at risk, facilitating timely interventions for improving outcomes for both mothers and their offspring.
期刊介绍:
The Journal of Affective Disorders publishes papers concerned with affective disorders in the widest sense: depression, mania, mood spectrum, emotions and personality, anxiety and stress. It is interdisciplinary and aims to bring together different approaches for a diverse readership. Top quality papers will be accepted dealing with any aspect of affective disorders, including neuroimaging, cognitive neurosciences, genetics, molecular biology, experimental and clinical neurosciences, pharmacology, neuroimmunoendocrinology, intervention and treatment trials.