Early identification of postpartum depression using machine learning

IF 5 3区 医学 Q1 CLINICAL NEUROLOGY
Yukako Nakamura, Taro Ueno, Nagahide Takahashi, Daisuke Ichikawa, Aya Yamauchi, Norio Ozaki
{"title":"Early identification of postpartum depression using machine learning","authors":"Yukako Nakamura, Taro Ueno, Nagahide Takahashi, Daisuke Ichikawa, Aya Yamauchi, Norio Ozaki","doi":"10.1111/pcn.13659","DOIUrl":null,"url":null,"abstract":"<p>During the perinatal period, the risk of developing depression is high and it is estimated that approximately 10%–15% of mothers experience perinatal depression.<span><sup>1</sup></span> In recent years, machine learning has been widely used in the research of mental health, and it has been suggested that machine learning could be useful in the clinical management of mental disorders by providing accurate predictions for the diagnosis, prognosis. If an effective postpartum depression (PPD) prediction model can be established, it will enable early identification of high-risk individuals and early intervention by healthcare providers in high-risk individuals.<span><sup>2</sup></span></p>\n<p>In this study, we used machine learning methods to construct a prediction model for depression in the first postpartum month using demographic information and subjective ratings of pregnant women collected from the time of pregnancy to the fifth postpartum day after delivery. A verbal and written explanation of the study was given to all participants, and written informed consent was obtained from all those who agreed to participate. The study protocol was approved by the Ethics Committee of the Nagoya University Graduate School of Medicine. Detailed methods are described in the Supplementary materials Appendix S1. The flowchart of the study procedures is shown in Fig. S1. 1559 women participated in the study and 1416 women responded to all 10 Edinburgh Postnatal Depression Scale items 1 month after delivery. In this study, we included these 1416 women (mean age 32.4 years, standard deviation ±4.6 years) in our machine learning. The flowchart of the recruitment process is shown in Fig. S2. We show the method details in Appendix S1 and comparison of predictors between the PPD group and the non PPD group in Table S1. We also show missing percentages for each item in Table S3.</p>\n<p>We used a machine learning approach, logistic regression, decision tree, gradient-boosting decision tree (GBDT), and balanced GBDT to develop the PPD prediction model. GBDT gives a predictive model as an ensemble of decision tree and achieves high predictive ability with a differentiable loss function. Early prediction models for PPD need to be sensitive so as not to overlook women at high risk for PPD. Therefore, we set sensitivity to 80% and built the model using the machine learning approach. We calculated a 95% confidence interval for the AUC to confirm the predictive power of the model used. The results are shown in Table S2. The high accuracy (73.11%), high specificity (71.3%) and high positive predictive value (42.2%) was obtained in the balanced GBDT, and the high AUC value (0.8285) was obtained from the logistic regression model. Therefore, we thought that the balanced GBDT model would be the best. Figure 1 shows the feature value with high importance and AUC when using the balanced GBDT. In addition, we used the 28 features shown in Fig. 1 as predictors and built a predictive model again using the balanced GBDT. From the results, we were able to build a model with high accuracy and good sensitivity and specificity (accuracy 78.6%, AUC 0.83, sensitivity 71.1%, specificity 80.6%, positive predictive value 48.9%, negative predictive value 91.4%).</p>\n<figure><picture>\n<source media=\"(min-width: 1650px)\" srcset=\"/cms/asset/f5f142c6-3d77-4655-b7b8-ac28a7d0d2c3/pcn13659-fig-0001-m.jpg\"/><img alt=\"Details are in the caption following the image\" data-lg-src=\"/cms/asset/f5f142c6-3d77-4655-b7b8-ac28a7d0d2c3/pcn13659-fig-0001-m.jpg\" loading=\"lazy\" src=\"/cms/asset/849d12b2-ec3c-4527-8ec7-1edc3795a320/pcn13659-fig-0001-m.png\" title=\"Details are in the caption following the image\"/></picture><figcaption>\n<div><strong>Fig. 1<span style=\"font-weight:normal\"></span></strong><div>Open in figure viewer<i aria-hidden=\"true\"></i><span>PowerPoint</span></div>\n</div>\n<div>Ranked importance of predicters and AUC using the balanced GBDT (<i>n</i> = 1416). (a) Importance of predicters. (b) Receiver-operating characteristic curve for predicting PPD based on the optimal predictive model developed using the balanced GBDT model. Area under the curve for PPD is 0.82. EPDS, Edinburgh postnatal depression scale; Fa, father; GBDT, gradient-boosting decision tree; HA, Harm avoidance; MB, Stein's Scale; Mother-Infant Bonding Questionnaire, MB_B,C,M,G, the item name of MB scale, number after the scale is each item number of the scales; NP, number of persons; PBI, parental bonding instrument; PPD, postpartum depression; SSQ, social support questionnaire.</div>\n</figcaption>\n</figure>\n<p>In this study, we used machine learning technique and successfully constructed a highly accurate and convenient model for predicting depression in the first month after childbirth using data from pregnant women. In our model, we found that depression during pregnancy and immediately after delivery, maternity blues, age, parity, temperament, and other factors were important predictors. The prevalence of PPD is about 10%–15%, and the proportion of positives is smaller than the proportion of negatives. Therefore, the positive predictive value (PPV) tends to be low, as in previous studies<span><sup>3, 4</sup></span> and this study. Future studies will need to develop a prediction tool with high PPV. In addition to the data used as predictors in this study, previous studies using machine learning reported that any history of depression is an important factor.<span><sup>3</sup></span> In our previous studies, we also confirmed that a history of depression<span><sup>5</sup></span> is a risk factor for PPD. However, since the number of responses to this item of our study was still small, we could not add it to this analysis. We think that it is important to try to build a model that includes this item to the predictor in the future.</p>\n<p>We were able to build a simple and useful model for predicting PPD by using machine learning techniques. In the future, it will be important to build a more accurate model by adding data on the history of depression to the predictor.</p>","PeriodicalId":20938,"journal":{"name":"Psychiatry and Clinical Neurosciences","volume":"66 1","pages":""},"PeriodicalIF":5.0000,"publicationDate":"2024-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Psychiatry and Clinical Neurosciences","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/pcn.13659","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

During the perinatal period, the risk of developing depression is high and it is estimated that approximately 10%–15% of mothers experience perinatal depression.1 In recent years, machine learning has been widely used in the research of mental health, and it has been suggested that machine learning could be useful in the clinical management of mental disorders by providing accurate predictions for the diagnosis, prognosis. If an effective postpartum depression (PPD) prediction model can be established, it will enable early identification of high-risk individuals and early intervention by healthcare providers in high-risk individuals.2

In this study, we used machine learning methods to construct a prediction model for depression in the first postpartum month using demographic information and subjective ratings of pregnant women collected from the time of pregnancy to the fifth postpartum day after delivery. A verbal and written explanation of the study was given to all participants, and written informed consent was obtained from all those who agreed to participate. The study protocol was approved by the Ethics Committee of the Nagoya University Graduate School of Medicine. Detailed methods are described in the Supplementary materials Appendix S1. The flowchart of the study procedures is shown in Fig. S1. 1559 women participated in the study and 1416 women responded to all 10 Edinburgh Postnatal Depression Scale items 1 month after delivery. In this study, we included these 1416 women (mean age 32.4 years, standard deviation ±4.6 years) in our machine learning. The flowchart of the recruitment process is shown in Fig. S2. We show the method details in Appendix S1 and comparison of predictors between the PPD group and the non PPD group in Table S1. We also show missing percentages for each item in Table S3.

We used a machine learning approach, logistic regression, decision tree, gradient-boosting decision tree (GBDT), and balanced GBDT to develop the PPD prediction model. GBDT gives a predictive model as an ensemble of decision tree and achieves high predictive ability with a differentiable loss function. Early prediction models for PPD need to be sensitive so as not to overlook women at high risk for PPD. Therefore, we set sensitivity to 80% and built the model using the machine learning approach. We calculated a 95% confidence interval for the AUC to confirm the predictive power of the model used. The results are shown in Table S2. The high accuracy (73.11%), high specificity (71.3%) and high positive predictive value (42.2%) was obtained in the balanced GBDT, and the high AUC value (0.8285) was obtained from the logistic regression model. Therefore, we thought that the balanced GBDT model would be the best. Figure 1 shows the feature value with high importance and AUC when using the balanced GBDT. In addition, we used the 28 features shown in Fig. 1 as predictors and built a predictive model again using the balanced GBDT. From the results, we were able to build a model with high accuracy and good sensitivity and specificity (accuracy 78.6%, AUC 0.83, sensitivity 71.1%, specificity 80.6%, positive predictive value 48.9%, negative predictive value 91.4%).

Abstract Image
Fig. 1
Open in figure viewerPowerPoint
Ranked importance of predicters and AUC using the balanced GBDT (n = 1416). (a) Importance of predicters. (b) Receiver-operating characteristic curve for predicting PPD based on the optimal predictive model developed using the balanced GBDT model. Area under the curve for PPD is 0.82. EPDS, Edinburgh postnatal depression scale; Fa, father; GBDT, gradient-boosting decision tree; HA, Harm avoidance; MB, Stein's Scale; Mother-Infant Bonding Questionnaire, MB_B,C,M,G, the item name of MB scale, number after the scale is each item number of the scales; NP, number of persons; PBI, parental bonding instrument; PPD, postpartum depression; SSQ, social support questionnaire.

In this study, we used machine learning technique and successfully constructed a highly accurate and convenient model for predicting depression in the first month after childbirth using data from pregnant women. In our model, we found that depression during pregnancy and immediately after delivery, maternity blues, age, parity, temperament, and other factors were important predictors. The prevalence of PPD is about 10%–15%, and the proportion of positives is smaller than the proportion of negatives. Therefore, the positive predictive value (PPV) tends to be low, as in previous studies3, 4 and this study. Future studies will need to develop a prediction tool with high PPV. In addition to the data used as predictors in this study, previous studies using machine learning reported that any history of depression is an important factor.3 In our previous studies, we also confirmed that a history of depression5 is a risk factor for PPD. However, since the number of responses to this item of our study was still small, we could not add it to this analysis. We think that it is important to try to build a model that includes this item to the predictor in the future.

We were able to build a simple and useful model for predicting PPD by using machine learning techniques. In the future, it will be important to build a more accurate model by adding data on the history of depression to the predictor.

利用机器学习早期识别产后抑郁症
我们认为,将来有必要尝试建立一个模型,在预测因子中加入这一项。我们能够利用机器学习技术建立一个简单而有用的模型,用于预测 PPD。今后,通过在预测因子中加入抑郁症病史数据来建立一个更准确的模型将非常重要。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.40
自引率
4.20%
发文量
181
审稿时长
6-12 weeks
期刊介绍: PCN (Psychiatry and Clinical Neurosciences) Publication Frequency: Published 12 online issues a year by JSPN Content Categories: Review Articles Regular Articles Letters to the Editor Peer Review Process: All manuscripts undergo peer review by anonymous reviewers, an Editorial Board Member, and the Editor Publication Criteria: Manuscripts are accepted based on quality, originality, and significance to the readership Authors must confirm that the manuscript has not been published or submitted elsewhere and has been approved by each author
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信