使用大型语言模型对儿童和青少年的精神病理学进行维度测量。

IF 9.6 1区 医学 Q1 NEUROSCIENCES
Biological Psychiatry Pub Date : 2024-12-15 Epub Date: 2024-06-10 DOI:10.1016/j.biopsych.2024.05.008
Thomas H McCoy, Roy H Perlis
{"title":"使用大型语言模型对儿童和青少年的精神病理学进行维度测量。","authors":"Thomas H McCoy, Roy H Perlis","doi":"10.1016/j.biopsych.2024.05.008","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>To enable greater use of National Institute of Mental Health Research Domain Criteria (RDoC) in real-world settings, we applied large language models (LLMs) to estimate dimensional psychopathology from narrative clinical notes.</p><p><strong>Methods: </strong>We conducted a cohort study using health records from individuals age ≤18 years evaluated in the psychiatric emergency department of a large academic medical center between November 2008 and March 2015. Outcomes were hospital admission and length of emergency department stay. RDoC domains were estimated using a Health Insurance Portability and Accountability Act-compliant LLM (gpt-4-1106-preview) and compared with a previously validated token-based approach.</p><p><strong>Results: </strong>The cohort included 3059 individuals (median age 16 years [interquartile range, 13-18]; 1580 [52%] female, 1479 [48%] male; 105 [3.4%] identified as Asian, 329 [11%] as Black, 288 [9.4%] as Hispanic, 474 [15%] as other race, and 1863 [61%] as White), of whom 1695 (55%) were admitted. Correlation between LLM-extracted RDoC scores and the token-based scores ranged from small to medium as assessed by Kendall's tau (0.14-0.22). In logistic regression models adjusting for sociodemographic and clinical features, admission likelihood was associated with greater scores on all domains, with the exception of the sensorimotor domain, which was inversely associated (p < .001 for all adjusted associations). Tests for bias suggested modest but statistically significant differences in positive valence scores by race (p < .05 for Asian, Black, and Hispanic individuals).</p><p><strong>Conclusions: </strong>An LLM extracted estimates of 6 RDoC domains in an explainable manner, which were associated with clinical outcomes. This approach can contribute to a new generation of prediction models or biological investigations based on dimensional psychopathology.</p>","PeriodicalId":8918,"journal":{"name":"Biological Psychiatry","volume":null,"pages":null},"PeriodicalIF":9.6000,"publicationDate":"2024-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Dimensional Measures of Psychopathology in Children and Adolescents Using Large Language Models.\",\"authors\":\"Thomas H McCoy, Roy H Perlis\",\"doi\":\"10.1016/j.biopsych.2024.05.008\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>To enable greater use of National Institute of Mental Health Research Domain Criteria (RDoC) in real-world settings, we applied large language models (LLMs) to estimate dimensional psychopathology from narrative clinical notes.</p><p><strong>Methods: </strong>We conducted a cohort study using health records from individuals age ≤18 years evaluated in the psychiatric emergency department of a large academic medical center between November 2008 and March 2015. Outcomes were hospital admission and length of emergency department stay. RDoC domains were estimated using a Health Insurance Portability and Accountability Act-compliant LLM (gpt-4-1106-preview) and compared with a previously validated token-based approach.</p><p><strong>Results: </strong>The cohort included 3059 individuals (median age 16 years [interquartile range, 13-18]; 1580 [52%] female, 1479 [48%] male; 105 [3.4%] identified as Asian, 329 [11%] as Black, 288 [9.4%] as Hispanic, 474 [15%] as other race, and 1863 [61%] as White), of whom 1695 (55%) were admitted. Correlation between LLM-extracted RDoC scores and the token-based scores ranged from small to medium as assessed by Kendall's tau (0.14-0.22). In logistic regression models adjusting for sociodemographic and clinical features, admission likelihood was associated with greater scores on all domains, with the exception of the sensorimotor domain, which was inversely associated (p < .001 for all adjusted associations). Tests for bias suggested modest but statistically significant differences in positive valence scores by race (p < .05 for Asian, Black, and Hispanic individuals).</p><p><strong>Conclusions: </strong>An LLM extracted estimates of 6 RDoC domains in an explainable manner, which were associated with clinical outcomes. This approach can contribute to a new generation of prediction models or biological investigations based on dimensional psychopathology.</p>\",\"PeriodicalId\":8918,\"journal\":{\"name\":\"Biological Psychiatry\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":9.6000,\"publicationDate\":\"2024-12-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biological Psychiatry\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1016/j.biopsych.2024.05.008\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/6/10 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"NEUROSCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biological Psychiatry","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.biopsych.2024.05.008","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/6/10 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"NEUROSCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

背景:为了在真实世界环境中更广泛地使用 NIMH 研究领域标准(RDoC),我们应用大型语言模型从叙述性临床笔记中估算出精神病理学的维度:我们利用 2008 年 11 月至 2015 年 3 月间在一家大型学术医疗中心精神科急诊室接受评估的 18 岁或以下患者的健康记录开展了一项队列研究。研究结果包括入院率和急诊科住院时间。使用符合 HIPAA 标准的大型语言模型(gpt-4-1106-preview)估算了 RDoC 域,并与之前经过验证的基于标记的方法进行了比较:研究对象包括 3059 人(中位年龄 16 岁(25%-75% 为 13-18 岁);女性 1580 人(52%),男性 1479 人(48%);亚裔 105 人(3.4%),黑人 329 人(11%),西班牙裔 288 人(9.4%),其他种族 474 人(15%),白人 1863 人(61%)),其中 1695 人(55%)入院。根据 Kendall's Tau(0.14-0.22),LLM 提取的 RDoC 分数与基于代币的分数之间的相关性从较小到中等不等。在调整了社会人口学和临床特征的逻辑回归模型中,入院可能性与所有领域的较高得分相关,但感觉运动领域除外,该领域与入院可能性成反比(p 结论:一个大型语言模型以可解释的方式提取了 6 个 RDoC 领域的估计值,这些估计值与临床结果相关。这种方法有助于新一代的预测模型或基于维度精神病理学的生物学研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Dimensional Measures of Psychopathology in Children and Adolescents Using Large Language Models.

Background: To enable greater use of National Institute of Mental Health Research Domain Criteria (RDoC) in real-world settings, we applied large language models (LLMs) to estimate dimensional psychopathology from narrative clinical notes.

Methods: We conducted a cohort study using health records from individuals age ≤18 years evaluated in the psychiatric emergency department of a large academic medical center between November 2008 and March 2015. Outcomes were hospital admission and length of emergency department stay. RDoC domains were estimated using a Health Insurance Portability and Accountability Act-compliant LLM (gpt-4-1106-preview) and compared with a previously validated token-based approach.

Results: The cohort included 3059 individuals (median age 16 years [interquartile range, 13-18]; 1580 [52%] female, 1479 [48%] male; 105 [3.4%] identified as Asian, 329 [11%] as Black, 288 [9.4%] as Hispanic, 474 [15%] as other race, and 1863 [61%] as White), of whom 1695 (55%) were admitted. Correlation between LLM-extracted RDoC scores and the token-based scores ranged from small to medium as assessed by Kendall's tau (0.14-0.22). In logistic regression models adjusting for sociodemographic and clinical features, admission likelihood was associated with greater scores on all domains, with the exception of the sensorimotor domain, which was inversely associated (p < .001 for all adjusted associations). Tests for bias suggested modest but statistically significant differences in positive valence scores by race (p < .05 for Asian, Black, and Hispanic individuals).

Conclusions: An LLM extracted estimates of 6 RDoC domains in an explainable manner, which were associated with clinical outcomes. This approach can contribute to a new generation of prediction models or biological investigations based on dimensional psychopathology.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Biological Psychiatry
Biological Psychiatry 医学-精神病学
CiteScore
18.80
自引率
2.80%
发文量
1398
审稿时长
33 days
期刊介绍: Biological Psychiatry is an official journal of the Society of Biological Psychiatry and was established in 1969. It is the first journal in the Biological Psychiatry family, which also includes Biological Psychiatry: Cognitive Neuroscience and Neuroimaging and Biological Psychiatry: Global Open Science. The Society's main goal is to promote excellence in scientific research and education in the fields related to the nature, causes, mechanisms, and treatments of disorders pertaining to thought, emotion, and behavior. To fulfill this mission, Biological Psychiatry publishes peer-reviewed, rapid-publication articles that present new findings from original basic, translational, and clinical mechanistic research, ultimately advancing our understanding of psychiatric disorders and their treatment. The journal also encourages the submission of reviews and commentaries on current research and topics of interest.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信