基于机器学习的nomogram预测女性抑郁症状:一项来自中国广东省的横断面研究

IF 3.4 4区 医学 Q1 PSYCHIATRY
Jia-Min Chen, Mei Rao, Yu-Ting Wei, Qiong-Gui Zhou, Jun-Long Tao, Shi-Bin Wang, Bo Bi
{"title":"基于机器学习的nomogram预测女性抑郁症状:一项来自中国广东省的横断面研究","authors":"Jia-Min Chen, Mei Rao, Yu-Ting Wei, Qiong-Gui Zhou, Jun-Long Tao, Shi-Bin Wang, Bo Bi","doi":"10.5498/wjp.v15.i8.106622","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Female depression is a prevalent and increasingly recognized mental health issue. Due to cultural and social factors, many female patients still face challenges in diagnosis and treatment, and traditional assessment methods often fail to identify high-risk individuals accurately. This highlights the necessity of developing more precise predictive tools. Utilizing machine learning (ML) algorithms to construct predictive models may overcome the limitations of traditional methods, providing more comprehensive support for women's mental health.</p><p><strong>Aim: </strong>To construct an ML-nomogram hybrid model that translates multivariate risk predictors of female depressive symptoms into actionable clinical scoring thresholds, optimizing predictive accuracy and interpretability for healthcare applications.</p><p><strong>Methods: </strong>We analyzed data from 7609 female participants aged 18 to 85 years from the Guangdong Provincial Sleep and Psychosomatic Health Survey. Sixteen variables, including anxiety symptoms, insomnia, chronic diseases, exercise habits, and age, were selected based on prior literature and comprehensively incorporated into ML models to maximize predictive information utilization. Three ML algorithms, extreme gradient boosting, support vector machine, and light gradient boosting machine, were employed to construct predictive models. Model performance was evaluated using accuracy, precision, recall, F1 score, and area under the curve (AUC). Feature importance was interpreted using SHapley Additive exPlanations (SHAP), with ablation studies validating the impact of the top five SHAP-derived features on predictive performance, and a nomogram was constructed based on these prioritized predictors. Clinical utility was assessed through decision curve analysis.</p><p><strong>Results: </strong>The prevalence of depressive symptoms was 6.8% among the sample. The evaluation of predictive models revealed that light gradient boosting machine achieved a top-performing AUC of 0.867, placing it ahead of extreme gradient boosting (AUC = 0.862) and support vector machine (AUC = 0.849). SHAP analysis identified insomnia, anxiety symptoms, age, chronic disease, and exercise as the top five predictors. The nomogram based on these features demonstrated excellent discrimination (AUC = 0.910) and calibration, with significant net benefits in decision curve analysis compared to baseline strategies. The model effectively stratifies depressive symptoms risk, facilitating personalized and quantitative assessments in clinical settings. We also developed an interactive digital version of the nomogram to facilitate its application in clinical practice.</p><p><strong>Conclusion: </strong>The ML-based model effectively predicts depressive symptoms in women, identifying insomnia, anxiety symptoms, age, chronic diseases, and exercise as key predictors, offering a practical tool for early detection and intervention.</p>","PeriodicalId":23896,"journal":{"name":"World Journal of Psychiatry","volume":"15 8","pages":"106622"},"PeriodicalIF":3.4000,"publicationDate":"2025-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12362627/pdf/","citationCount":"0","resultStr":"{\"title\":\"Machine learning-based nomogram for predicting depressive symptoms in women: A cross-sectional study in Guangdong Province, China.\",\"authors\":\"Jia-Min Chen, Mei Rao, Yu-Ting Wei, Qiong-Gui Zhou, Jun-Long Tao, Shi-Bin Wang, Bo Bi\",\"doi\":\"10.5498/wjp.v15.i8.106622\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Female depression is a prevalent and increasingly recognized mental health issue. Due to cultural and social factors, many female patients still face challenges in diagnosis and treatment, and traditional assessment methods often fail to identify high-risk individuals accurately. This highlights the necessity of developing more precise predictive tools. Utilizing machine learning (ML) algorithms to construct predictive models may overcome the limitations of traditional methods, providing more comprehensive support for women's mental health.</p><p><strong>Aim: </strong>To construct an ML-nomogram hybrid model that translates multivariate risk predictors of female depressive symptoms into actionable clinical scoring thresholds, optimizing predictive accuracy and interpretability for healthcare applications.</p><p><strong>Methods: </strong>We analyzed data from 7609 female participants aged 18 to 85 years from the Guangdong Provincial Sleep and Psychosomatic Health Survey. Sixteen variables, including anxiety symptoms, insomnia, chronic diseases, exercise habits, and age, were selected based on prior literature and comprehensively incorporated into ML models to maximize predictive information utilization. Three ML algorithms, extreme gradient boosting, support vector machine, and light gradient boosting machine, were employed to construct predictive models. Model performance was evaluated using accuracy, precision, recall, F1 score, and area under the curve (AUC). Feature importance was interpreted using SHapley Additive exPlanations (SHAP), with ablation studies validating the impact of the top five SHAP-derived features on predictive performance, and a nomogram was constructed based on these prioritized predictors. Clinical utility was assessed through decision curve analysis.</p><p><strong>Results: </strong>The prevalence of depressive symptoms was 6.8% among the sample. The evaluation of predictive models revealed that light gradient boosting machine achieved a top-performing AUC of 0.867, placing it ahead of extreme gradient boosting (AUC = 0.862) and support vector machine (AUC = 0.849). SHAP analysis identified insomnia, anxiety symptoms, age, chronic disease, and exercise as the top five predictors. The nomogram based on these features demonstrated excellent discrimination (AUC = 0.910) and calibration, with significant net benefits in decision curve analysis compared to baseline strategies. The model effectively stratifies depressive symptoms risk, facilitating personalized and quantitative assessments in clinical settings. We also developed an interactive digital version of the nomogram to facilitate its application in clinical practice.</p><p><strong>Conclusion: </strong>The ML-based model effectively predicts depressive symptoms in women, identifying insomnia, anxiety symptoms, age, chronic diseases, and exercise as key predictors, offering a practical tool for early detection and intervention.</p>\",\"PeriodicalId\":23896,\"journal\":{\"name\":\"World Journal of Psychiatry\",\"volume\":\"15 8\",\"pages\":\"106622\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-08-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12362627/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"World Journal of Psychiatry\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.5498/wjp.v15.i8.106622\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PSYCHIATRY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"World Journal of Psychiatry","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.5498/wjp.v15.i8.106622","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHIATRY","Score":null,"Total":0}
引用次数: 0

摘要

背景:女性抑郁症是一种普遍存在的心理健康问题。由于文化和社会因素,许多女性患者在诊断和治疗方面仍面临挑战,传统的评估方法往往不能准确识别高危人群。这凸显了开发更精确的预测工具的必要性。利用机器学习(ML)算法构建预测模型可以克服传统方法的局限性,为女性心理健康提供更全面的支持。目的:构建一个ML-nomogram混合模型,将女性抑郁症状的多变量风险预测因子转化为可操作的临床评分阈值,优化预测的准确性和可解释性,用于医疗保健应用。方法:我们分析了广东省睡眠与身心健康调查中7609名年龄在18至85岁之间的女性参与者的数据。根据已有文献选取焦虑症状、失眠、慢性病、运动习惯、年龄等16个变量,综合纳入ML模型,最大限度地利用预测信息。采用极端梯度增强、支持向量机和光梯度增强三种机器学习算法构建预测模型。使用准确性、精密度、召回率、F1分数和曲线下面积(AUC)来评估模型的性能。特征重要性使用SHapley加性解释(SHAP)进行解释,消融研究验证了SHapley衍生的前5个特征对预测性能的影响,并基于这些优先预测因子构建了nomogram。通过决策曲线分析评估临床效用。结果:样本中抑郁症状的患病率为6.8%。对预测模型的评价表明,光梯度增强机的AUC为0.867,领先于极端梯度增强机(AUC = 0.862)和支持向量机(AUC = 0.849)。SHAP分析确定失眠、焦虑症状、年龄、慢性疾病和运动是前五大预测因素。基于这些特征的nomogram表现出良好的辨别能力(AUC = 0.910)和校准能力,与基线策略相比,在决策曲线分析中具有显著的净收益。该模型有效地分层抑郁症状的风险,促进个性化和定量评估在临床设置。我们还开发了一个交互式的数字版本的图,以促进其在临床实践中的应用。结论:基于ml的模型可以有效预测女性抑郁症状,识别出失眠、焦虑症状、年龄、慢性疾病和运动作为关键预测因素,为早期发现和干预提供了实用的工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Machine learning-based nomogram for predicting depressive symptoms in women: A cross-sectional study in Guangdong Province, China.

Machine learning-based nomogram for predicting depressive symptoms in women: A cross-sectional study in Guangdong Province, China.

Machine learning-based nomogram for predicting depressive symptoms in women: A cross-sectional study in Guangdong Province, China.

Machine learning-based nomogram for predicting depressive symptoms in women: A cross-sectional study in Guangdong Province, China.

Background: Female depression is a prevalent and increasingly recognized mental health issue. Due to cultural and social factors, many female patients still face challenges in diagnosis and treatment, and traditional assessment methods often fail to identify high-risk individuals accurately. This highlights the necessity of developing more precise predictive tools. Utilizing machine learning (ML) algorithms to construct predictive models may overcome the limitations of traditional methods, providing more comprehensive support for women's mental health.

Aim: To construct an ML-nomogram hybrid model that translates multivariate risk predictors of female depressive symptoms into actionable clinical scoring thresholds, optimizing predictive accuracy and interpretability for healthcare applications.

Methods: We analyzed data from 7609 female participants aged 18 to 85 years from the Guangdong Provincial Sleep and Psychosomatic Health Survey. Sixteen variables, including anxiety symptoms, insomnia, chronic diseases, exercise habits, and age, were selected based on prior literature and comprehensively incorporated into ML models to maximize predictive information utilization. Three ML algorithms, extreme gradient boosting, support vector machine, and light gradient boosting machine, were employed to construct predictive models. Model performance was evaluated using accuracy, precision, recall, F1 score, and area under the curve (AUC). Feature importance was interpreted using SHapley Additive exPlanations (SHAP), with ablation studies validating the impact of the top five SHAP-derived features on predictive performance, and a nomogram was constructed based on these prioritized predictors. Clinical utility was assessed through decision curve analysis.

Results: The prevalence of depressive symptoms was 6.8% among the sample. The evaluation of predictive models revealed that light gradient boosting machine achieved a top-performing AUC of 0.867, placing it ahead of extreme gradient boosting (AUC = 0.862) and support vector machine (AUC = 0.849). SHAP analysis identified insomnia, anxiety symptoms, age, chronic disease, and exercise as the top five predictors. The nomogram based on these features demonstrated excellent discrimination (AUC = 0.910) and calibration, with significant net benefits in decision curve analysis compared to baseline strategies. The model effectively stratifies depressive symptoms risk, facilitating personalized and quantitative assessments in clinical settings. We also developed an interactive digital version of the nomogram to facilitate its application in clinical practice.

Conclusion: The ML-based model effectively predicts depressive symptoms in women, identifying insomnia, anxiety symptoms, age, chronic diseases, and exercise as key predictors, offering a practical tool for early detection and intervention.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
6.50%
发文量
110
期刊介绍: The World Journal of Psychiatry (WJP) is a high-quality, peer reviewed, open-access journal. The primary task of WJP is to rapidly publish high-quality original articles, reviews, editorials, and case reports in the field of psychiatry. In order to promote productive academic communication, the peer review process for the WJP is transparent; to this end, all published manuscripts are accompanied by the anonymized reviewers’ comments as well as the authors’ responses. The primary aims of the WJP are to improve diagnostic, therapeutic and preventive modalities and the skills of clinicians and to guide clinical practice in psychiatry.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信