Bayesian Networks for Prescreening in Depression: Algorithm Development and Validation.

IF 4.8 2区 医学 Q1 PSYCHIATRY
Jmir Mental Health Pub Date : 2024-07-04 DOI:10.2196/52045
Eduardo Maekawa, Eoin Martino Grua, Carina Akemi Nakamura, Marcia Scazufca, Ricardo Araya, Tim Peters, Pepijn van de Ven
{"title":"Bayesian Networks for Prescreening in Depression: Algorithm Development and Validation.","authors":"Eduardo Maekawa, Eoin Martino Grua, Carina Akemi Nakamura, Marcia Scazufca, Ricardo Araya, Tim Peters, Pepijn van de Ven","doi":"10.2196/52045","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Identifying individuals with depressive symptomatology (DS) promptly and effectively is of paramount importance for providing timely treatment. Machine learning models have shown promise in this area; however, studies often fall short in demonstrating the practical benefits of using these models and fail to provide tangible real-world applications.</p><p><strong>Objective: </strong>This study aims to establish a novel methodology for identifying individuals likely to exhibit DS, identify the most influential features in a more explainable way via probabilistic measures, and propose tools that can be used in real-world applications.</p><p><strong>Methods: </strong>The study used 3 data sets: PROACTIVE, the Brazilian National Health Survey (Pesquisa Nacional de Saúde [PNS]) 2013, and PNS 2019, comprising sociodemographic and health-related features. A Bayesian network was used for feature selection. Selected features were then used to train machine learning models to predict DS, operationalized as a score of ≥10 on the 9-item Patient Health Questionnaire. The study also analyzed the impact of varying sensitivity rates on the reduction of screening interviews compared to a random approach.</p><p><strong>Results: </strong>The methodology allows the users to make an informed trade-off among sensitivity, specificity, and a reduction in the number of interviews. At the thresholds of 0.444, 0.412, and 0.472, determined by maximizing the Youden index, the models achieved sensitivities of 0.717, 0.741, and 0.718, and specificities of 0.644, 0.737, and 0.766 for PROACTIVE, PNS 2013, and PNS 2019, respectively. The area under the receiver operating characteristic curve was 0.736, 0.801, and 0.809 for these 3 data sets, respectively. For the PROACTIVE data set, the most influential features identified were postural balance, shortness of breath, and how old people feel they are. In the PNS 2013 data set, the features were the ability to do usual activities, chest pain, sleep problems, and chronic back problems. The PNS 2019 data set shared 3 of the most influential features with the PNS 2013 data set. However, the difference was the replacement of chronic back problems with verbal abuse. It is important to note that the features contained in the PNS data sets differ from those found in the PROACTIVE data set. An empirical analysis demonstrated that using the proposed model led to a potential reduction in screening interviews of up to 52% while maintaining a sensitivity of 0.80.</p><p><strong>Conclusions: </strong>This study developed a novel methodology for identifying individuals with DS, demonstrating the utility of using Bayesian networks to identify the most significant features. Moreover, this approach has the potential to substantially reduce the number of screening interviews while maintaining high sensitivity, thereby facilitating improved early identification and intervention strategies for individuals experiencing DS.</p>","PeriodicalId":48616,"journal":{"name":"Jmir Mental Health","volume":"11 ","pages":"e52045"},"PeriodicalIF":4.8000,"publicationDate":"2024-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11258528/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jmir Mental Health","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2196/52045","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHIATRY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Identifying individuals with depressive symptomatology (DS) promptly and effectively is of paramount importance for providing timely treatment. Machine learning models have shown promise in this area; however, studies often fall short in demonstrating the practical benefits of using these models and fail to provide tangible real-world applications.

Objective: This study aims to establish a novel methodology for identifying individuals likely to exhibit DS, identify the most influential features in a more explainable way via probabilistic measures, and propose tools that can be used in real-world applications.

Methods: The study used 3 data sets: PROACTIVE, the Brazilian National Health Survey (Pesquisa Nacional de Saúde [PNS]) 2013, and PNS 2019, comprising sociodemographic and health-related features. A Bayesian network was used for feature selection. Selected features were then used to train machine learning models to predict DS, operationalized as a score of ≥10 on the 9-item Patient Health Questionnaire. The study also analyzed the impact of varying sensitivity rates on the reduction of screening interviews compared to a random approach.

Results: The methodology allows the users to make an informed trade-off among sensitivity, specificity, and a reduction in the number of interviews. At the thresholds of 0.444, 0.412, and 0.472, determined by maximizing the Youden index, the models achieved sensitivities of 0.717, 0.741, and 0.718, and specificities of 0.644, 0.737, and 0.766 for PROACTIVE, PNS 2013, and PNS 2019, respectively. The area under the receiver operating characteristic curve was 0.736, 0.801, and 0.809 for these 3 data sets, respectively. For the PROACTIVE data set, the most influential features identified were postural balance, shortness of breath, and how old people feel they are. In the PNS 2013 data set, the features were the ability to do usual activities, chest pain, sleep problems, and chronic back problems. The PNS 2019 data set shared 3 of the most influential features with the PNS 2013 data set. However, the difference was the replacement of chronic back problems with verbal abuse. It is important to note that the features contained in the PNS data sets differ from those found in the PROACTIVE data set. An empirical analysis demonstrated that using the proposed model led to a potential reduction in screening interviews of up to 52% while maintaining a sensitivity of 0.80.

Conclusions: This study developed a novel methodology for identifying individuals with DS, demonstrating the utility of using Bayesian networks to identify the most significant features. Moreover, this approach has the potential to substantially reduce the number of screening interviews while maintaining high sensitivity, thereby facilitating improved early identification and intervention strategies for individuals experiencing DS.

用于抑郁症预检的贝叶斯网络:算法开发与验证
背景:及时有效地识别抑郁症状(DS)患者对于提供及时治疗至关重要。机器学习模型在这一领域大有可为;然而,研究往往无法证明使用这些模型的实际益处,也未能提供切实的实际应用:本研究旨在建立一种新的方法来识别可能表现出 DS 的个体,通过概率测量以更易于解释的方式识别最具影响力的特征,并提出可用于实际应用的工具:研究使用了 3 个数据集:方法:研究使用了三组数据:PROACTIVE、2013 年巴西全国健康调查(Pesquisa Nacional de Saúde [PNS])和 2019 年巴西全国健康调查(PNS 2019),其中包括社会人口学特征和健康相关特征。贝叶斯网络用于特征选择。选定的特征随后被用于训练机器学习模型,以预测 DS,即在 9 个项目的患者健康问卷中得分≥10。与随机方法相比,该研究还分析了不同灵敏度对减少筛查面谈的影响:结果:该方法允许用户在灵敏度、特异性和减少面谈次数之间做出明智的权衡。在尤登指数最大化确定的 0.444、0.412 和 0.472 临界值下,PROACTIVE、PNS 2013 和 PNS 2019 模型的灵敏度分别为 0.717、0.741 和 0.718,特异度分别为 0.644、0.737 和 0.766。这三个数据集的接收者操作特征曲线下面积分别为 0.736、0.801 和 0.809。在 PROACTIVE 数据集中,最有影响力的特征是姿势平衡、呼吸急促和人们感觉自己的年龄。在 PNS 2013 数据集中,特征是进行日常活动的能力、胸痛、睡眠问题和慢性背部问题。PNS 2019 数据集与 PNS 2013 数据集共享 3 个最具影响力的特征。然而,不同之处在于用辱骂代替了慢性背部问题。值得注意的是,PNS 数据集中的特征与 PROACTIVE 数据集中的特征有所不同。实证分析表明,使用所提出的模型可将筛查面谈次数减少 52%,同时保持 0.80 的灵敏度:本研究开发了一种用于识别 DS 患者的新方法,证明了使用贝叶斯网络识别最重要特征的实用性。此外,这种方法有可能在保持高灵敏度的同时大幅减少筛查面谈的次数,从而有助于改进对 DS 患者的早期识别和干预策略。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Jmir Mental Health
Jmir Mental Health Medicine-Psychiatry and Mental Health
CiteScore
10.80
自引率
3.80%
发文量
104
审稿时长
16 weeks
期刊介绍: JMIR Mental Health (JMH, ISSN 2368-7959) is a PubMed-indexed, peer-reviewed sister journal of JMIR, the leading eHealth journal (Impact Factor 2016: 5.175). JMIR Mental Health focusses on digital health and Internet interventions, technologies and electronic innovations (software and hardware) for mental health, addictions, online counselling and behaviour change. This includes formative evaluation and system descriptions, theoretical papers, review papers, viewpoint/vision papers, and rigorous evaluations.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信