Clinical decision support for pharmacologic management of treatment-resistant depression with augmented large language models

Roy H. Perlis , Pilar F. Verhaak , Joseph Goldberg , Cristina Cusin , Michael Ostacher , Gin S. Malhi , Carlos A. Zarate , Richard C. Shelton , Dan V. Iosifescu , Mauricio Tohen , Manish Kumar Jha , Martha Sajatovic , Michael Berk
{"title":"Clinical decision support for pharmacologic management of treatment-resistant depression with augmented large language models","authors":"Roy H. Perlis ,&nbsp;Pilar F. Verhaak ,&nbsp;Joseph Goldberg ,&nbsp;Cristina Cusin ,&nbsp;Michael Ostacher ,&nbsp;Gin S. Malhi ,&nbsp;Carlos A. Zarate ,&nbsp;Richard C. Shelton ,&nbsp;Dan V. Iosifescu ,&nbsp;Mauricio Tohen ,&nbsp;Manish Kumar Jha ,&nbsp;Martha Sajatovic ,&nbsp;Michael Berk","doi":"10.1016/j.xjmad.2025.100142","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>We evaluated whether a large language model could assist in selecting psychopharmacological treatments for adults with treatment-resistant depression.</div></div><div><h3>Methods</h3><div>We generated 20 clinical vignettes reflecting treatment-resistant depression among adults based on distributions drawn from electronic health records. Each vignette was evaluated by 2 expert psychopharmacologists to determine and rank the 5 best next-step pharmacologic interventions, as well as contraindicated or poor next-step treatments. Vignettes were then presented in random order, permuting gender and race, to a large language model (Qwen 2.5:7B), augmented with a synopsis of published treatment guidelines. Model output was compared to expert rankings, as well as to those of a convenience sample of community clinicians and an additional group of expert clinicians.</div></div><div><h3>Results</h3><div>The augmented model prioritized the expert-designated optimal choice for 114/320 vignettes (35.6 %, 95 % CI 30.6 %–41.0 %; Cohen’s kappa = 0.34, 95 % CI 0.28–0.39). There were no vignettes for which any of the model choices were among the poor or contraindicated treatments. Results were not meaningfully different when gender or race of the vignette was permuted to examine risk for bias. A sample of community clinicians identified the optimal treatment choice for 12/91 vignettes (13.2 %, 95 % CI: 7.7–21.6 %; Cohen’s kappa = 0.10, 95 % CI 0.03–0.18), while an additional group of expert psychopharmacologists identified optimal treatment for 9/140 (6.4 %, 95 %CI: 3.4–11.8 %; Cohen’s kappa = 0.03, 95 % CI 0.01–0.08).</div></div><div><h3>Conclusion</h3><div>An augmented language model demonstrated moderate agreement with expert recommendations and avoided contraindicated treatments, suggesting potential as a tool for supporting complex psychopharmacologic decision-making in treatment-resistant depression.</div></div>","PeriodicalId":73841,"journal":{"name":"Journal of mood and anxiety disorders","volume":"12 ","pages":"Article 100142"},"PeriodicalIF":0.0000,"publicationDate":"2025-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of mood and anxiety disorders","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2950004425000392","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Background

We evaluated whether a large language model could assist in selecting psychopharmacological treatments for adults with treatment-resistant depression.

Methods

We generated 20 clinical vignettes reflecting treatment-resistant depression among adults based on distributions drawn from electronic health records. Each vignette was evaluated by 2 expert psychopharmacologists to determine and rank the 5 best next-step pharmacologic interventions, as well as contraindicated or poor next-step treatments. Vignettes were then presented in random order, permuting gender and race, to a large language model (Qwen 2.5:7B), augmented with a synopsis of published treatment guidelines. Model output was compared to expert rankings, as well as to those of a convenience sample of community clinicians and an additional group of expert clinicians.

Results

The augmented model prioritized the expert-designated optimal choice for 114/320 vignettes (35.6 %, 95 % CI 30.6 %–41.0 %; Cohen’s kappa = 0.34, 95 % CI 0.28–0.39). There were no vignettes for which any of the model choices were among the poor or contraindicated treatments. Results were not meaningfully different when gender or race of the vignette was permuted to examine risk for bias. A sample of community clinicians identified the optimal treatment choice for 12/91 vignettes (13.2 %, 95 % CI: 7.7–21.6 %; Cohen’s kappa = 0.10, 95 % CI 0.03–0.18), while an additional group of expert psychopharmacologists identified optimal treatment for 9/140 (6.4 %, 95 %CI: 3.4–11.8 %; Cohen’s kappa = 0.03, 95 % CI 0.01–0.08).

Conclusion

An augmented language model demonstrated moderate agreement with expert recommendations and avoided contraindicated treatments, suggesting potential as a tool for supporting complex psychopharmacologic decision-making in treatment-resistant depression.
增强大语言模型对难治性抑郁症药物管理的临床决策支持
背景:我们评估了一个大型语言模型是否可以帮助选择精神药物治疗成人难治性抑郁症。方法:基于电子健康记录的分布,我们生成了20个反映成人治疗难治性抑郁症的临床小片段。每个小插曲由2名精神药理学专家进行评估,以确定并排名5个最佳的下一步药物干预措施,以及禁忌症或不良的下一步治疗。然后将小插图按性别和种族的随机顺序呈现给一个大型语言模型(Qwen 2.5:7B),并辅以已发表的治疗指南摘要。将模型输出与专家排名,以及社区临床医生的方便样本和额外的专家临床医生组进行比较。结果增强模型在114/320个样本中对专家指定的最优选择进行了优先排序(35.6 %,95 % CI 30.6 % -41.0 %;Cohen’s kappa = 0.34, 95 % CI 0.28-0.39)。没有任何模型选择属于不良或禁忌治疗的小插曲。当小插曲的性别或种族被排列以检查偏倚风险时,结果没有显著差异。社区临床医生样本确定了12/91的最佳治疗选择(13.2 %,95 % CI: 7.7-21.6 %;Cohen’s kappa = 0.10, 95 %CI 0.03-0.18),而另一组精神药理学专家确定了9/140的最佳治疗方法(6.4 %,95 %CI: 3.4-11.8 %;Cohen’s kappa = 0.03, 95 % CI 0.01-0.08)。结论增强语言模型与专家建议有一定程度的一致性,并避免了禁忌症的治疗,这表明它有可能成为支持难治性抑郁症复杂精神药理学决策的工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of mood and anxiety disorders
Journal of mood and anxiety disorders Applied Psychology, Experimental and Cognitive Psychology, Clinical Psychology, Psychiatry and Mental Health, Psychology (General), Behavioral Neuroscience
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信