Characterizing research domain criteria symptoms among psychiatric inpatients using large language models

Thomas H. McCoy , Roy H. Perlis
{"title":"Characterizing research domain criteria symptoms among psychiatric inpatients using large language models","authors":"Thomas H. McCoy ,&nbsp;Roy H. Perlis","doi":"10.1016/j.xjmad.2024.100079","DOIUrl":null,"url":null,"abstract":"<div><p>We sought to characterize the ability of large language models to estimate NIMH Research Domain Criteria dimensions from narrative clinical notes of adult psychiatric inpatients, deriving estimate of overall burden of symptoms in each domain. We extracted consecutive admissions to a psychiatric inpatient unit between December 23, 2009 and September 27, 2015 from the electronic health records of a large academic medical center. Admission and discharge notes were scored with a HIPAA-compliant instance of a large language model (gpt-4–1106-preview). To examine convergent validity, the resulting estimates were correlated with those derived using an earlier method; for predictive validity, they were examined for association with length of hospitalization and probability of readmission. The cohort included 3619 individuals, 1779 female (49 %), 1840 male (51 %) with mean age 44 (SD=16.6). We identified modest correlations between LLM-derived RDoC scores and a previously validated scoring method, with Kendall’s tau between from.07 for arousal and 0.27 for positive and cognitive domains (p &lt; .001 for all of these). For admission notes, greater scores on cognitive, sensorimotor, negative, and social domains were significantly associated with longer length of hospitalization in linear regression models including sociodemographic features (p &lt; .01 for all of these); positive valence was associated with shorter hospitalization (p &lt; .001). For discharge notes, social, arousal, and positive valence were associated with likelihood of readmission within 180 days in adjusted logistic regression models (p &lt; .05 for social and arousal, p &lt; .001 for positive valence). Overall, LLM-derived estimates of RDoC psychopathology demonstrated promising convergent and predictive validity, suggesting this approach may make real-world application of the RDoC framework more feasible.</p></div>","PeriodicalId":73841,"journal":{"name":"Journal of mood and anxiety disorders","volume":"8 ","pages":"Article 100079"},"PeriodicalIF":0.0000,"publicationDate":"2024-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2950004424000336/pdfft?md5=713aa34765aff14b5d678c6003870dac&pid=1-s2.0-S2950004424000336-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of mood and anxiety disorders","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2950004424000336","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

We sought to characterize the ability of large language models to estimate NIMH Research Domain Criteria dimensions from narrative clinical notes of adult psychiatric inpatients, deriving estimate of overall burden of symptoms in each domain. We extracted consecutive admissions to a psychiatric inpatient unit between December 23, 2009 and September 27, 2015 from the electronic health records of a large academic medical center. Admission and discharge notes were scored with a HIPAA-compliant instance of a large language model (gpt-4–1106-preview). To examine convergent validity, the resulting estimates were correlated with those derived using an earlier method; for predictive validity, they were examined for association with length of hospitalization and probability of readmission. The cohort included 3619 individuals, 1779 female (49 %), 1840 male (51 %) with mean age 44 (SD=16.6). We identified modest correlations between LLM-derived RDoC scores and a previously validated scoring method, with Kendall’s tau between from.07 for arousal and 0.27 for positive and cognitive domains (p < .001 for all of these). For admission notes, greater scores on cognitive, sensorimotor, negative, and social domains were significantly associated with longer length of hospitalization in linear regression models including sociodemographic features (p < .01 for all of these); positive valence was associated with shorter hospitalization (p < .001). For discharge notes, social, arousal, and positive valence were associated with likelihood of readmission within 180 days in adjusted logistic regression models (p < .05 for social and arousal, p < .001 for positive valence). Overall, LLM-derived estimates of RDoC psychopathology demonstrated promising convergent and predictive validity, suggesting this approach may make real-world application of the RDoC framework more feasible.

使用大型语言模型表征精神病住院患者的研究领域标准症状
我们试图描述大型语言模型从成年精神病住院患者的叙述性临床笔记中估算 NIMH 研究领域标准维度的能力,从而得出每个领域的总体症状负担估算值。我们从一家大型学术医疗中心的电子健康记录中提取了 2009 年 12 月 23 日至 2015 年 9 月 27 日期间精神科住院病人的连续入院记录。入院和出院记录使用符合 HIPAA 标准的大型语言模型实例(gpt-4-1106-preview)进行评分。为了检查收敛有效性,我们将得出的估计值与使用早期方法得出的估计值进行了关联;为了检查预测有效性,我们检查了估计值与住院时间和再入院概率的关联。队列包括 3619 人,其中女性 1779 人(占 49%),男性 1840 人(占 51%),平均年龄 44 岁(SD=16.6)。我们发现,LLM 导出的 RDoC 分数与之前验证的评分方法之间存在适度的相关性,唤醒的 Kendall's tau 值为 0.07,积极和认知领域的 Kendall's tau 值为 0.27(均为 0.001)。就入院记录而言,在包括社会人口特征在内的线性回归模型中,认知、感觉运动、消极和社交领域得分越高,住院时间越长(所有这些因素的 p < .01);积极情绪与住院时间越短越相关(p < .001)。就出院记录而言,在调整后的逻辑回归模型中,社交、唤醒和积极情绪与 180 天内再次入院的可能性相关(社交和唤醒的 p < .05,积极情绪的 p < .001)。总之,LLM 得出的 RDoC 精神病理学估计值表现出了良好的收敛性和预测性,这表明这种方法可能会使 RDoC 框架在现实世界中的应用更加可行。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of mood and anxiety disorders
Journal of mood and anxiety disorders Applied Psychology, Experimental and Cognitive Psychology, Clinical Psychology, Psychiatry and Mental Health, Psychology (General), Behavioral Neuroscience
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信