A highly scalable deep learning language model for common risks prediction among psychiatric inpatients.

IF 7 1区医学 Q1 MEDICINE, GENERAL & INTERNAL

BMC Medicine Pub Date : 2025-05-28 DOI:10.1186/s12916-025-04150-7

Enzhao Zhu, Jiayi Wang, Guoquan Zhou, Chunbo Li, Fazhan Chen, Kang Ju, Liangliang Chen, Yichao Yin, Yi Chen, Yanping Zhang, Xu Zhang, Xinlin Zhou, Zongyuan Wang, Jianping Qiu, Hui Wang, Weizhong Shi, Feng Wang, Dong Wang, Zhihao Chen, Jiaojiao Hou, Hui Li, Zisheng Ai

{"title":"A highly scalable deep learning language model for common risks prediction among psychiatric inpatients.","authors":"Enzhao Zhu, Jiayi Wang, Guoquan Zhou, Chunbo Li, Fazhan Chen, Kang Ju, Liangliang Chen, Yichao Yin, Yi Chen, Yanping Zhang, Xu Zhang, Xinlin Zhou, Zongyuan Wang, Jianping Qiu, Hui Wang, Weizhong Shi, Feng Wang, Dong Wang, Zhihao Chen, Jiaojiao Hou, Hui Li, Zisheng Ai","doi":"10.1186/s12916-025-04150-7","DOIUrl":null,"url":null,"abstract":"Background: There is a lack of studies exploring the performance of Transformers-based language models in common risks assessment among psychiatric inpatients. We aim to develop a scalable risk assessment model using multidimensional textualized data and test the stability, robustness, and benefit of this approach.Methods: In this real-world cohort study, a deep learning language model was developed and validated using first hospitalized cases diagnosed with schizophrenia, bipolar disorder, and depressive disorder between January 2016 and March 2023 in three hospitals. The algorithm was externally validated on an independent testing cohort comprising 1180 patients. A total of 140 features, including first medical records (FMR), laboratory examinations, medical orders, and psychological scales, were assessed for analysis. The outcomes were short- and long-term impulsivity (STI and LTI), risk of suicide (STSS and LTSS), and need of physical restraint (STPR and LTPR) assessed by qualified nurses or clinicians. Analysis was carried out between August 2024 and June 2024. Models with different architectures and input settings were compared with each other. The area under the receiver operating characteristic curve (AUROC) was used to assess the primary performance of models. The clinical utility was determined by the net benefit under Youden's threshold.Results: Of 7451 patients included in this study, 2982 (47.6%) were male, and the median (interquartile range) age was 42 (28-57) years. The overall incidence of outcomes was 635 (8.5%), 728 (10.5%), 659 (8.8%), 803 (10.8%), 588 (7.9%), and 728 (9.8%) for STPR, LTPR, STSS, LTSS, STI, and LTI, respectively. The multitask semi-structured Transformers-based language (SSTL) model showed more promising AUROCs (STPR: 0.915; LTPR: 0.844; STSS: 0.867; LTSS: 0.879; STI: 0.899; LTI: 0.894) in the prediction of these outcomes than single-tasked or multimodal language models and traditional structured data models. Combining FMR with other data from electronic health records led to significant improvements in the performance and clinical utility of SSTL models based on demographic, diagnosis, laboratory tests, treatment, and psychological scales.Conclusions: The SSTL model shows potential advantages in prognostic evaluation. FMR is a strong predictor for common risks prediction and may benefit other tasks in psychiatry with minimum requirements for data and data processing.","PeriodicalId":9188,"journal":{"name":"BMC Medicine","volume":"23 1","pages":"308"},"PeriodicalIF":7.0000,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12121029/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12916-025-04150-7","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}

引用次数: 0

Abstract

Background: There is a lack of studies exploring the performance of Transformers-based language models in common risks assessment among psychiatric inpatients. We aim to develop a scalable risk assessment model using multidimensional textualized data and test the stability, robustness, and benefit of this approach.

Methods: In this real-world cohort study, a deep learning language model was developed and validated using first hospitalized cases diagnosed with schizophrenia, bipolar disorder, and depressive disorder between January 2016 and March 2023 in three hospitals. The algorithm was externally validated on an independent testing cohort comprising 1180 patients. A total of 140 features, including first medical records (FMR), laboratory examinations, medical orders, and psychological scales, were assessed for analysis. The outcomes were short- and long-term impulsivity (STI and LTI), risk of suicide (STSS and LTSS), and need of physical restraint (STPR and LTPR) assessed by qualified nurses or clinicians. Analysis was carried out between August 2024 and June 2024. Models with different architectures and input settings were compared with each other. The area under the receiver operating characteristic curve (AUROC) was used to assess the primary performance of models. The clinical utility was determined by the net benefit under Youden's threshold.

Results: Of 7451 patients included in this study, 2982 (47.6%) were male, and the median (interquartile range) age was 42 (28-57) years. The overall incidence of outcomes was 635 (8.5%), 728 (10.5%), 659 (8.8%), 803 (10.8%), 588 (7.9%), and 728 (9.8%) for STPR, LTPR, STSS, LTSS, STI, and LTI, respectively. The multitask semi-structured Transformers-based language (SSTL) model showed more promising AUROCs (STPR: 0.915; LTPR: 0.844; STSS: 0.867; LTSS: 0.879; STI: 0.899; LTI: 0.894) in the prediction of these outcomes than single-tasked or multimodal language models and traditional structured data models. Combining FMR with other data from electronic health records led to significant improvements in the performance and clinical utility of SSTL models based on demographic, diagnosis, laboratory tests, treatment, and psychological scales.

Conclusions: The SSTL model shows potential advantages in prognostic evaluation. FMR is a strong predictor for common risks prediction and may benefit other tasks in psychiatry with minimum requirements for data and data processing.

查看原文本刊更多论文

用于精神科住院患者常见风险预测的高度可扩展深度学习语言模型。

背景：基于transformer的语言模型在精神科住院患者常见风险评估中的表现缺乏研究。我们的目标是使用多维文本化数据开发可扩展的风险评估模型，并测试该方法的稳定性、鲁棒性和优势。方法：在这项现实世界的队列研究中，研究人员开发了一个深度学习语言模型，并对2016年1月至2023年3月期间三家医院诊断为精神分裂症、双相情感障碍和抑郁症的首次住院病例进行了验证。该算法在包括1180名患者的独立测试队列中进行了外部验证。共评估140项特征，包括首次医疗记录（FMR）、实验室检查、医嘱和心理量表。结果包括短期和长期冲动（STI和LTI）、自杀风险（STSS和LTSS）和身体约束需求（STPR和LTPR），由合格的护士或临床医生评估。分析在2024年8月至2024年6月期间进行。对不同体系结构和输入设置的模型进行了比较。使用受试者工作特征曲线下面积（AUROC）来评估模型的主要性能。临床效用由约登阈值下的净收益决定。结果：本研究纳入的7451例患者中，男性2982例（47.6%），年龄中位数（四分位数间距）为42岁（28-57岁）。STPR、LTPR、STSS、LTSS、STI和LTI的总结局发生率分别为635（8.5%）、728（10.5%）、659（8.8%）、803（10.8%）、588（7.9%）和728（9.8%）。多任务半结构化的基于transformer的语言（SSTL）模型显示出更有希望的auroc (STPR: 0.915；LTPR: 0.844;STSS: 0.867;lts: 0.879;STI: 0.899;LTI: 0.894)在预测这些结果方面优于单任务或多模态语言模型和传统的结构化数据模型。将FMR与来自电子健康记录的其他数据相结合，可以显著改善基于人口统计学、诊断、实验室测试、治疗和心理量表的SSTL模型的性能和临床效用。结论：SSTL模型在预后评估中具有潜在优势。FMR是一种对常见风险预测的强大预测器，并且可能对精神病学中对数据和数据处理要求最低的其他任务有益。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

BMC Medicine 医学-医学：内科

CiteScore

13.10

自引率

1.10%

发文量

435

审稿时长

4-8 weeks

期刊介绍： BMC Medicine is an open access, transparent peer-reviewed general medical journal. It is the flagship journal of the BMC series and publishes outstanding and influential research in various areas including clinical practice, translational medicine, medical and health advances, public health, global health, policy, and general topics of interest to the biomedical and sociomedical professional communities. In addition to research articles, the journal also publishes stimulating debates, reviews, unique forum articles, and concise tutorials. All articles published in BMC Medicine are included in various databases such as Biological Abstracts, BIOSIS, CAS, Citebase, Current contents, DOAJ, Embase, MEDLINE, PubMed, Science Citation Index Expanded, OAIster, SCImago, Scopus, SOCOLAR, and Zetoc.