Jia-Min Chen, Mei Rao, Yu-Ting Wei, Qiong-Gui Zhou, Jun-Long Tao, Shi-Bin Wang, Bo Bi
{"title":"基于机器学习的nomogram预测女性抑郁症状:一项来自中国广东省的横断面研究","authors":"Jia-Min Chen, Mei Rao, Yu-Ting Wei, Qiong-Gui Zhou, Jun-Long Tao, Shi-Bin Wang, Bo Bi","doi":"10.5498/wjp.v15.i8.106622","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Female depression is a prevalent and increasingly recognized mental health issue. Due to cultural and social factors, many female patients still face challenges in diagnosis and treatment, and traditional assessment methods often fail to identify high-risk individuals accurately. This highlights the necessity of developing more precise predictive tools. Utilizing machine learning (ML) algorithms to construct predictive models may overcome the limitations of traditional methods, providing more comprehensive support for women's mental health.</p><p><strong>Aim: </strong>To construct an ML-nomogram hybrid model that translates multivariate risk predictors of female depressive symptoms into actionable clinical scoring thresholds, optimizing predictive accuracy and interpretability for healthcare applications.</p><p><strong>Methods: </strong>We analyzed data from 7609 female participants aged 18 to 85 years from the Guangdong Provincial Sleep and Psychosomatic Health Survey. Sixteen variables, including anxiety symptoms, insomnia, chronic diseases, exercise habits, and age, were selected based on prior literature and comprehensively incorporated into ML models to maximize predictive information utilization. Three ML algorithms, extreme gradient boosting, support vector machine, and light gradient boosting machine, were employed to construct predictive models. Model performance was evaluated using accuracy, precision, recall, F1 score, and area under the curve (AUC). Feature importance was interpreted using SHapley Additive exPlanations (SHAP), with ablation studies validating the impact of the top five SHAP-derived features on predictive performance, and a nomogram was constructed based on these prioritized predictors. Clinical utility was assessed through decision curve analysis.</p><p><strong>Results: </strong>The prevalence of depressive symptoms was 6.8% among the sample. The evaluation of predictive models revealed that light gradient boosting machine achieved a top-performing AUC of 0.867, placing it ahead of extreme gradient boosting (AUC = 0.862) and support vector machine (AUC = 0.849). SHAP analysis identified insomnia, anxiety symptoms, age, chronic disease, and exercise as the top five predictors. The nomogram based on these features demonstrated excellent discrimination (AUC = 0.910) and calibration, with significant net benefits in decision curve analysis compared to baseline strategies. The model effectively stratifies depressive symptoms risk, facilitating personalized and quantitative assessments in clinical settings. We also developed an interactive digital version of the nomogram to facilitate its application in clinical practice.</p><p><strong>Conclusion: </strong>The ML-based model effectively predicts depressive symptoms in women, identifying insomnia, anxiety symptoms, age, chronic diseases, and exercise as key predictors, offering a practical tool for early detection and intervention.</p>","PeriodicalId":23896,"journal":{"name":"World Journal of Psychiatry","volume":"15 8","pages":"106622"},"PeriodicalIF":3.4000,"publicationDate":"2025-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12362627/pdf/","citationCount":"0","resultStr":"{\"title\":\"Machine learning-based nomogram for predicting depressive symptoms in women: A cross-sectional study in Guangdong Province, China.\",\"authors\":\"Jia-Min Chen, Mei Rao, Yu-Ting Wei, Qiong-Gui Zhou, Jun-Long Tao, Shi-Bin Wang, Bo Bi\",\"doi\":\"10.5498/wjp.v15.i8.106622\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Female depression is a prevalent and increasingly recognized mental health issue. Due to cultural and social factors, many female patients still face challenges in diagnosis and treatment, and traditional assessment methods often fail to identify high-risk individuals accurately. This highlights the necessity of developing more precise predictive tools. Utilizing machine learning (ML) algorithms to construct predictive models may overcome the limitations of traditional methods, providing more comprehensive support for women's mental health.</p><p><strong>Aim: </strong>To construct an ML-nomogram hybrid model that translates multivariate risk predictors of female depressive symptoms into actionable clinical scoring thresholds, optimizing predictive accuracy and interpretability for healthcare applications.</p><p><strong>Methods: </strong>We analyzed data from 7609 female participants aged 18 to 85 years from the Guangdong Provincial Sleep and Psychosomatic Health Survey. Sixteen variables, including anxiety symptoms, insomnia, chronic diseases, exercise habits, and age, were selected based on prior literature and comprehensively incorporated into ML models to maximize predictive information utilization. Three ML algorithms, extreme gradient boosting, support vector machine, and light gradient boosting machine, were employed to construct predictive models. Model performance was evaluated using accuracy, precision, recall, F1 score, and area under the curve (AUC). Feature importance was interpreted using SHapley Additive exPlanations (SHAP), with ablation studies validating the impact of the top five SHAP-derived features on predictive performance, and a nomogram was constructed based on these prioritized predictors. Clinical utility was assessed through decision curve analysis.</p><p><strong>Results: </strong>The prevalence of depressive symptoms was 6.8% among the sample. The evaluation of predictive models revealed that light gradient boosting machine achieved a top-performing AUC of 0.867, placing it ahead of extreme gradient boosting (AUC = 0.862) and support vector machine (AUC = 0.849). SHAP analysis identified insomnia, anxiety symptoms, age, chronic disease, and exercise as the top five predictors. The nomogram based on these features demonstrated excellent discrimination (AUC = 0.910) and calibration, with significant net benefits in decision curve analysis compared to baseline strategies. The model effectively stratifies depressive symptoms risk, facilitating personalized and quantitative assessments in clinical settings. We also developed an interactive digital version of the nomogram to facilitate its application in clinical practice.</p><p><strong>Conclusion: </strong>The ML-based model effectively predicts depressive symptoms in women, identifying insomnia, anxiety symptoms, age, chronic diseases, and exercise as key predictors, offering a practical tool for early detection and intervention.</p>\",\"PeriodicalId\":23896,\"journal\":{\"name\":\"World Journal of Psychiatry\",\"volume\":\"15 8\",\"pages\":\"106622\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-08-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12362627/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"World Journal of Psychiatry\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.5498/wjp.v15.i8.106622\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PSYCHIATRY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"World Journal of Psychiatry","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.5498/wjp.v15.i8.106622","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHIATRY","Score":null,"Total":0}
Machine learning-based nomogram for predicting depressive symptoms in women: A cross-sectional study in Guangdong Province, China.
Background: Female depression is a prevalent and increasingly recognized mental health issue. Due to cultural and social factors, many female patients still face challenges in diagnosis and treatment, and traditional assessment methods often fail to identify high-risk individuals accurately. This highlights the necessity of developing more precise predictive tools. Utilizing machine learning (ML) algorithms to construct predictive models may overcome the limitations of traditional methods, providing more comprehensive support for women's mental health.
Aim: To construct an ML-nomogram hybrid model that translates multivariate risk predictors of female depressive symptoms into actionable clinical scoring thresholds, optimizing predictive accuracy and interpretability for healthcare applications.
Methods: We analyzed data from 7609 female participants aged 18 to 85 years from the Guangdong Provincial Sleep and Psychosomatic Health Survey. Sixteen variables, including anxiety symptoms, insomnia, chronic diseases, exercise habits, and age, were selected based on prior literature and comprehensively incorporated into ML models to maximize predictive information utilization. Three ML algorithms, extreme gradient boosting, support vector machine, and light gradient boosting machine, were employed to construct predictive models. Model performance was evaluated using accuracy, precision, recall, F1 score, and area under the curve (AUC). Feature importance was interpreted using SHapley Additive exPlanations (SHAP), with ablation studies validating the impact of the top five SHAP-derived features on predictive performance, and a nomogram was constructed based on these prioritized predictors. Clinical utility was assessed through decision curve analysis.
Results: The prevalence of depressive symptoms was 6.8% among the sample. The evaluation of predictive models revealed that light gradient boosting machine achieved a top-performing AUC of 0.867, placing it ahead of extreme gradient boosting (AUC = 0.862) and support vector machine (AUC = 0.849). SHAP analysis identified insomnia, anxiety symptoms, age, chronic disease, and exercise as the top five predictors. The nomogram based on these features demonstrated excellent discrimination (AUC = 0.910) and calibration, with significant net benefits in decision curve analysis compared to baseline strategies. The model effectively stratifies depressive symptoms risk, facilitating personalized and quantitative assessments in clinical settings. We also developed an interactive digital version of the nomogram to facilitate its application in clinical practice.
Conclusion: The ML-based model effectively predicts depressive symptoms in women, identifying insomnia, anxiety symptoms, age, chronic diseases, and exercise as key predictors, offering a practical tool for early detection and intervention.
期刊介绍:
The World Journal of Psychiatry (WJP) is a high-quality, peer reviewed, open-access journal. The primary task of WJP is to rapidly publish high-quality original articles, reviews, editorials, and case reports in the field of psychiatry. In order to promote productive academic communication, the peer review process for the WJP is transparent; to this end, all published manuscripts are accompanied by the anonymized reviewers’ comments as well as the authors’ responses. The primary aims of the WJP are to improve diagnostic, therapeutic and preventive modalities and the skills of clinicians and to guide clinical practice in psychiatry.