使用广泛性焦虑障碍量表和患者健康问卷对焦虑和抑郁的严重程度进行分类：应用分类和回归树模型的全国横断面研究

IF 3.9 2区医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH

JMIR Public Health and Surveillance Pub Date : 2025-09-30 DOI:10.2196/72591

Andre Faro, Julian Tejada, Wael Al-Delaimy

{"title":"使用广泛性焦虑障碍量表和患者健康问卷对焦虑和抑郁的严重程度进行分类：应用分类和回归树模型的全国横断面研究","authors":"Andre Faro, Julian Tejada, Wael Al-Delaimy","doi":"10.2196/72591","DOIUrl":null,"url":null,"abstract":"Background: Scalable and accurate screening tools are critical for public mental health strategies, especially in low- and middle-income countries (LMICs). While the Generalized Anxiety Disorder Scale (GAD-7) and Patient Health Questionnaire (PHQ-9) are widely used, their full application in large-scale programs can pose feasibility challenges. By contrast, shorter versions like GAD-2 and PHQ-2 reduce burdens but fail to capture symptom diversity.Objective: This study aimed to optimize screening for anxiety and depression severity using classification and regression tree (CART) models, identifying concise and high-performing decision rules based on the GAD-7 and PHQ-9 items, and to test their reproducibility in 5 independent datasets.Methods: A cross-sectional, nonprobabilistic study was conducted with 20,585 Brazilian adults from all 27 states and more than 3,000 cities, collected using digital outreach. Anxiety and depression symptoms were assessed using the GAD-7 and PHQ-9. CART models were trained and tested on bootstrapped samples (70% training, 30% testing), totaling 45,000 trees per scale. Each model used combinations of scale items and sociodemographic predictors. Robustness was evaluated via 10-fold cross-validation and evaluation across 3 hyperparameter configurations (minsplit and minbucket=500, 1000, 2000). Performance metrics included accuracy, sensitivity, specificity, precision, F1-score, and area under the curve (AUC).Results: The CART models produced concise, high-performing decision rules-using only 2 items for the GAD-7 and 3 for the PHQ-9. No sociodemographic variable appeared in the final classification paths. For GAD-7, the models achieved an accuracy of 86.1% for minimal or mild severity and 85.1% for severe cases, with both categories showing AUC values above 0.900. By contrast, the moderate severity class had lower performance, with accuracy around 51% and an AUC of 0.728. For PHQ-9, the models achieved 81.7% accuracy for minimal or mild cases and 78.8% for severe cases, with AUCs again exceeding 0.900 for the extreme classes; the moderate or moderately severe class showed 66.9% accuracy and an AUC of 0.776. The most frequently repeated rules included the following: \"GAD2<2 and GAD4<2\" for identifying minimal or mild anxiety and \"GAD2≥2 and GAD4=3\" for severe anxiety; for depression, \"PHQ2<2and PHQ4<2\" for minimal or mild cases and \"PHQ2≥2 and PHQ8≥2\" for severe cases. These rule-based models demonstrated stable performance across thousands of bootstrapped replications and showed reproducibility in 5 independent datasets through external validation.Conclusions: CART models enabled simplified, symptom-specific pathways for stratifying anxiety and depression severity with high precision and minimal item burden. These rule-based shortcuts offer an efficient alternative to fixed short forms (eg, GAD-2, PHQ-2) by preserving symptom diversity and severity discrimination. The findings support and lay the groundwork for adaptive, cost-effective screening and intervention models, especially in resource-limited settings and LMICs.","PeriodicalId":14765,"journal":{"name":"JMIR Public Health and Surveillance","volume":"11 ","pages":"e72591"},"PeriodicalIF":3.9000,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12483340/pdf/","citationCount":"0","resultStr":"{\"title\":\"Severity Classification of Anxiety and Depression Using Generalized Anxiety Disorder Scale and Patient Health Questionnaire: National Cross-Sectional Study Applying Classification and Regression Tree Models.\",\"authors\":\"Andre Faro, Julian Tejada, Wael Al-Delaimy\",\"doi\":\"10.2196/72591\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: Scalable and accurate screening tools are critical for public mental health strategies, especially in low- and middle-income countries (LMICs). While the Generalized Anxiety Disorder Scale (GAD-7) and Patient Health Questionnaire (PHQ-9) are widely used, their full application in large-scale programs can pose feasibility challenges. By contrast, shorter versions like GAD-2 and PHQ-2 reduce burdens but fail to capture symptom diversity.Objective: This study aimed to optimize screening for anxiety and depression severity using classification and regression tree (CART) models, identifying concise and high-performing decision rules based on the GAD-7 and PHQ-9 items, and to test their reproducibility in 5 independent datasets.Methods: A cross-sectional, nonprobabilistic study was conducted with 20,585 Brazilian adults from all 27 states and more than 3,000 cities, collected using digital outreach. Anxiety and depression symptoms were assessed using the GAD-7 and PHQ-9. CART models were trained and tested on bootstrapped samples (70% training, 30% testing), totaling 45,000 trees per scale. Each model used combinations of scale items and sociodemographic predictors. Robustness was evaluated via 10-fold cross-validation and evaluation across 3 hyperparameter configurations (minsplit and minbucket=500, 1000, 2000). Performance metrics included accuracy, sensitivity, specificity, precision, F1-score, and area under the curve (AUC).Results: The CART models produced concise, high-performing decision rules-using only 2 items for the GAD-7 and 3 for the PHQ-9. No sociodemographic variable appeared in the final classification paths. For GAD-7, the models achieved an accuracy of 86.1% for minimal or mild severity and 85.1% for severe cases, with both categories showing AUC values above 0.900. By contrast, the moderate severity class had lower performance, with accuracy around 51% and an AUC of 0.728. For PHQ-9, the models achieved 81.7% accuracy for minimal or mild cases and 78.8% for severe cases, with AUCs again exceeding 0.900 for the extreme classes; the moderate or moderately severe class showed 66.9% accuracy and an AUC of 0.776. The most frequently repeated rules included the following: \\\"GAD2<2 and GAD4<2\\\" for identifying minimal or mild anxiety and \\\"GAD2≥2 and GAD4=3\\\" for severe anxiety; for depression, \\\"PHQ2<2and PHQ4<2\\\" for minimal or mild cases and \\\"PHQ2≥2 and PHQ8≥2\\\" for severe cases. These rule-based models demonstrated stable performance across thousands of bootstrapped replications and showed reproducibility in 5 independent datasets through external validation.Conclusions: CART models enabled simplified, symptom-specific pathways for stratifying anxiety and depression severity with high precision and minimal item burden. These rule-based shortcuts offer an efficient alternative to fixed short forms (eg, GAD-2, PHQ-2) by preserving symptom diversity and severity discrimination. The findings support and lay the groundwork for adaptive, cost-effective screening and intervention models, especially in resource-limited settings and LMICs.\",\"PeriodicalId\":14765,\"journal\":{\"name\":\"JMIR Public Health and Surveillance\",\"volume\":\"11 \",\"pages\":\"e72591\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12483340/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JMIR Public Health and Surveillance\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.2196/72591\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR Public Health and Surveillance","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2196/72591","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}

引用次数: 0

摘要

背景：可扩展和准确的筛查工具对公共精神卫生战略至关重要，特别是在低收入和中等收入国家（LMICs）。虽然广泛性焦虑障碍量表（GAD-7）和患者健康问卷（PHQ-9）被广泛使用，但它们在大规模项目中的全面应用存在可行性挑战。相比之下，GAD-2和PHQ-2等较短的版本减轻了负担，但未能捕捉到症状的多样性。目的：利用分类回归树（CART）模型优化焦虑和抑郁严重程度的筛选，确定基于GAD-7和PHQ-9的简洁高效的决策规则，并在5个独立数据集上检验其可重复性。方法：对来自27个州和3000多个城市的20,585名巴西成年人进行了一项横断面非概率研究，这些成年人是通过数字外展收集的。使用GAD-7和PHQ-9评估焦虑和抑郁症状。CART模型在bootstrap样本上进行训练和测试（70%训练，30%测试），每个尺度共45,000棵树。每个模型都使用了量表项目和社会人口预测因子的组合。鲁棒性评估通过10倍交叉验证和评估跨3个超参数配置（minsplit和minbucket=500, 1000, 2000）。性能指标包括准确性、敏感性、特异性、精密度、f1评分和曲线下面积（AUC）。结果：CART模型产生了简洁、高效的决策规则——GAD-7只使用2个项目，PHQ-9只使用3个项目。在最终的分类路径中没有出现社会人口学变量。对于GAD-7，模型在轻度或轻度严重程度上的准确率为86.1%，在重度病例上的准确率为85.1%，两个类别的AUC值都在0.900以上。相比之下，中度严重等级的性能较低，准确率约为51%，AUC为0.728。对于PHQ-9，模型对轻微或轻微病例的准确率为81.7%，对严重病例的准确率为78.8%，极端类别的auc再次超过0.900；中度或中度重度的准确率为66.9%，AUC为0.776。最常重复的规则包括：“gad2结论：CART模型简化了症状特异性途径，以高精度和最小的项目负担对焦虑和抑郁严重程度进行分层。这些基于规则的快捷方式通过保留症状多样性和严重程度区分，为固定的简短形式（例如GAD-2、PHQ-2）提供了一种有效的替代方法。这些发现支持并奠定了适应性的、具有成本效益的筛查和干预模式的基础，特别是在资源有限的环境和中低收入国家。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Severity Classification of Anxiety and Depression Using Generalized Anxiety Disorder Scale and Patient Health Questionnaire: National Cross-Sectional Study Applying Classification and Regression Tree Models.

Background: Scalable and accurate screening tools are critical for public mental health strategies, especially in low- and middle-income countries (LMICs). While the Generalized Anxiety Disorder Scale (GAD-7) and Patient Health Questionnaire (PHQ-9) are widely used, their full application in large-scale programs can pose feasibility challenges. By contrast, shorter versions like GAD-2 and PHQ-2 reduce burdens but fail to capture symptom diversity.

Objective: This study aimed to optimize screening for anxiety and depression severity using classification and regression tree (CART) models, identifying concise and high-performing decision rules based on the GAD-7 and PHQ-9 items, and to test their reproducibility in 5 independent datasets.

Methods: A cross-sectional, nonprobabilistic study was conducted with 20,585 Brazilian adults from all 27 states and more than 3,000 cities, collected using digital outreach. Anxiety and depression symptoms were assessed using the GAD-7 and PHQ-9. CART models were trained and tested on bootstrapped samples (70% training, 30% testing), totaling 45,000 trees per scale. Each model used combinations of scale items and sociodemographic predictors. Robustness was evaluated via 10-fold cross-validation and evaluation across 3 hyperparameter configurations (minsplit and minbucket=500, 1000, 2000). Performance metrics included accuracy, sensitivity, specificity, precision, F1-score, and area under the curve (AUC).

Results: The CART models produced concise, high-performing decision rules-using only 2 items for the GAD-7 and 3 for the PHQ-9. No sociodemographic variable appeared in the final classification paths. For GAD-7, the models achieved an accuracy of 86.1% for minimal or mild severity and 85.1% for severe cases, with both categories showing AUC values above 0.900. By contrast, the moderate severity class had lower performance, with accuracy around 51% and an AUC of 0.728. For PHQ-9, the models achieved 81.7% accuracy for minimal or mild cases and 78.8% for severe cases, with AUCs again exceeding 0.900 for the extreme classes; the moderate or moderately severe class showed 66.9% accuracy and an AUC of 0.776. The most frequently repeated rules included the following: "GAD2<2 and GAD4<2" for identifying minimal or mild anxiety and "GAD2≥2 and GAD4=3" for severe anxiety; for depression, "PHQ2<2and PHQ4<2" for minimal or mild cases and "PHQ2≥2 and PHQ8≥2" for severe cases. These rule-based models demonstrated stable performance across thousands of bootstrapped replications and showed reproducibility in 5 independent datasets through external validation.

Conclusions: CART models enabled simplified, symptom-specific pathways for stratifying anxiety and depression severity with high precision and minimal item burden. These rule-based shortcuts offer an efficient alternative to fixed short forms (eg, GAD-2, PHQ-2) by preserving symptom diversity and severity discrimination. The findings support and lay the groundwork for adaptive, cost-effective screening and intervention models, especially in resource-limited settings and LMICs.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

JMIR Public Health and Surveillance Multiple-

CiteScore

13.70

自引率

2.40%

发文量

136

审稿时长

12 weeks

期刊介绍： JMIR Public Health & Surveillance (JPHS) is a renowned scholarly journal indexed on PubMed. It follows a rigorous peer-review process and covers a wide range of disciplines. The journal distinguishes itself by its unique focus on the intersection of technology and innovation in the field of public health. JPHS delves into diverse topics such as public health informatics, surveillance systems, rapid reports, participatory epidemiology, infodemiology, infoveillance, digital disease detection, digital epidemiology, electronic public health interventions, mass media and social media campaigns, health communication, and emerging population health analysis systems and tools.