Unsupervised Machine Learning to Identify Depressive Subtypes.

IF 2.3 Q3 MEDICAL INFORMATICS
Healthcare Informatics Research Pub Date : 2022-07-01 Epub Date: 2022-07-31 DOI:10.4258/hir.2022.28.3.256
Benson Kung, Maurice Chiang, Gayan Perera, Megan Pritchard, Robert Stewart
{"title":"Unsupervised Machine Learning to Identify Depressive Subtypes.","authors":"Benson Kung, Maurice Chiang, Gayan Perera, Megan Pritchard, Robert Stewart","doi":"10.4258/hir.2022.28.3.256","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>This study evaluated an unsupervised machine learning method, latent Dirichlet allocation (LDA), as a method for identifying subtypes of depression within symptom data.</p><p><strong>Methods: </strong>Data from 18,314 depressed patients were used to create LDA models. The outcomes included future emergency presentations, crisis events, and behavioral problems. One model was chosen for further analysis based upon its potential as a clinically meaningful construct. The associations between patient groups created with the final LDA model and outcomes were tested. These steps were repeated with a commonly-used latent variable model to provide additional context to the LDA results.</p><p><strong>Results: </strong>Five subtypes were identified using the final LDA model. Prior to the outcome analysis, the subtypes were labeled based upon the symptom distributions they produced: psychotic, severe, mild, agitated, and anergic-apathetic. The patient groups largely aligned with the outcome data. For example, the psychotic and severe subgroups were more likely to have emergency presentations (odds ratio [OR] = 1.29; 95% confidence interval [CI], 1.17-1.43 and OR = 1.16; 95% CI, 1.05-1.29, respectively), whereas these outcomes were less likely in the mild subgroup (OR = 0.86; 95% CI, 0.78-0.94). We found that the LDA subtypes were characterized by clusters of unique symptoms. This contrasted with the latent variable model subtypes, which were largely stratified by severity.</p><p><strong>Conclusions: </strong>This study suggests that LDA can surface clinically meaningful, qualitative subtypes. Future work could be incorporated into studies concerning the biological bases of depression, thereby contributing to the development of new psychiatric therapeutics.</p>","PeriodicalId":12947,"journal":{"name":"Healthcare Informatics Research","volume":null,"pages":null},"PeriodicalIF":2.3000,"publicationDate":"2022-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/97/14/hir-2022-28-3-256.PMC9388921.pdf","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Healthcare Informatics Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4258/hir.2022.28.3.256","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2022/7/31 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 3

Abstract

Objectives: This study evaluated an unsupervised machine learning method, latent Dirichlet allocation (LDA), as a method for identifying subtypes of depression within symptom data.

Methods: Data from 18,314 depressed patients were used to create LDA models. The outcomes included future emergency presentations, crisis events, and behavioral problems. One model was chosen for further analysis based upon its potential as a clinically meaningful construct. The associations between patient groups created with the final LDA model and outcomes were tested. These steps were repeated with a commonly-used latent variable model to provide additional context to the LDA results.

Results: Five subtypes were identified using the final LDA model. Prior to the outcome analysis, the subtypes were labeled based upon the symptom distributions they produced: psychotic, severe, mild, agitated, and anergic-apathetic. The patient groups largely aligned with the outcome data. For example, the psychotic and severe subgroups were more likely to have emergency presentations (odds ratio [OR] = 1.29; 95% confidence interval [CI], 1.17-1.43 and OR = 1.16; 95% CI, 1.05-1.29, respectively), whereas these outcomes were less likely in the mild subgroup (OR = 0.86; 95% CI, 0.78-0.94). We found that the LDA subtypes were characterized by clusters of unique symptoms. This contrasted with the latent variable model subtypes, which were largely stratified by severity.

Conclusions: This study suggests that LDA can surface clinically meaningful, qualitative subtypes. Future work could be incorporated into studies concerning the biological bases of depression, thereby contributing to the development of new psychiatric therapeutics.

Abstract Image

Abstract Image

Abstract Image

无监督机器学习识别抑郁症亚型。
目的:本研究评估了一种无监督的机器学习方法,潜在狄利克雷分配(LDA),作为在症状数据中识别抑郁亚型的方法。方法:采用18314例抑郁症患者的数据建立LDA模型。结果包括未来的紧急情况介绍、危机事件和行为问题。根据其作为临床意义构建体的潜力,选择了一种模型进行进一步分析。测试了使用最终LDA模型创建的患者组与结果之间的关联。用常用的潜在变量模型重复这些步骤,为LDA结果提供额外的上下文。结果:使用最终LDA模型鉴定出5种亚型。在结果分析之前,根据亚型产生的症状分布对其进行标记:精神病、严重、轻度、激动和无反应性冷漠。患者组与结果数据基本一致。例如,精神病亚组和重症亚组更有可能出现紧急情况(比值比[OR]=1.29;95%置信区间[CI]分别为1.17-1.43和OR=1.16;95%CI分别为1.05-1.29),而轻症亚组出现紧急情况的可能性较小(OR=0.86;95%CI为0.78-0.94)。这与潜在变量模型亚型形成对比,后者在很大程度上按严重程度分层。结论:本研究表明LDA可以表现出具有临床意义的定性亚型。未来的工作可以纳入有关抑郁症生物学基础的研究,从而有助于开发新的精神疗法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Healthcare Informatics Research
Healthcare Informatics Research MEDICAL INFORMATICS-
CiteScore
4.90
自引率
6.90%
发文量
44
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信