Prediction of early‐onset bipolar using electronic health records

IF 6.5 1区 医学 Q1 PSYCHIATRY
Bo Wang, Yi‐Han Sheu, Hyunjoon Lee, Robert G. Mealer, Victor M. Castro, Jordan W. Smoller
{"title":"Prediction of early‐onset bipolar using electronic health records","authors":"Bo Wang, Yi‐Han Sheu, Hyunjoon Lee, Robert G. Mealer, Victor M. Castro, Jordan W. Smoller","doi":"10.1111/jcpp.14131","DOIUrl":null,"url":null,"abstract":"BackgroundEarly identification of bipolar disorder (BD) provides an important opportunity for timely intervention. In this study, we aimed to develop machine learning models using large‐scale electronic health record (EHR) data including clinical notes for predicting early‐onset BD.MethodsStructured and unstructured data were extracted from the longitudinal EHR of the Mass General Brigham health system. We defined three cohorts aged 10–25 years: (1) the full youth cohort (<jats:italic>N</jats:italic> = 300,398); (2) a subcohort defined by having a mental health visit (<jats:italic>N</jats:italic> = 105,461); and (3) a subcohort defined by having a diagnosis of mood disorder or ADHD (<jats:italic>N</jats:italic> = 35,213). By adopting a prospective landmark modeling approach that aligns with clinical practice, we developed and validated a range of machine learning models, across different cohorts and prediction windows.ResultsWe found the two tree‐based models, random forests (RF) and light gradient‐boosting machine (LGBM), achieving good discriminative performance across different clinical settings (area under the receiver operating characteristic curve 0.76–0.88 for RF and 0.74–0.89 for LGBM). In addition, we showed comparable performance can be achieved with a greatly reduced set of features, demonstrating computational efficiency can be attained without significant compromise of model accuracy.ConclusionsGood discriminative performance for models predicting early‐onset BD can be achieved utilizing large‐scale EHR data. Our study offers a scalable and accurate method for identifying youth at risk for BD that could help inform clinical decision‐making and facilitate early intervention. Future work includes evaluating the portability of our approach to other healthcare systems and exploring considerations regarding possible implementation.","PeriodicalId":187,"journal":{"name":"Journal of Child Psychology and Psychiatry","volume":"64 1","pages":""},"PeriodicalIF":6.5000,"publicationDate":"2025-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Child Psychology and Psychiatry","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/jcpp.14131","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHIATRY","Score":null,"Total":0}
引用次数: 0

Abstract

BackgroundEarly identification of bipolar disorder (BD) provides an important opportunity for timely intervention. In this study, we aimed to develop machine learning models using large‐scale electronic health record (EHR) data including clinical notes for predicting early‐onset BD.MethodsStructured and unstructured data were extracted from the longitudinal EHR of the Mass General Brigham health system. We defined three cohorts aged 10–25 years: (1) the full youth cohort (N = 300,398); (2) a subcohort defined by having a mental health visit (N = 105,461); and (3) a subcohort defined by having a diagnosis of mood disorder or ADHD (N = 35,213). By adopting a prospective landmark modeling approach that aligns with clinical practice, we developed and validated a range of machine learning models, across different cohorts and prediction windows.ResultsWe found the two tree‐based models, random forests (RF) and light gradient‐boosting machine (LGBM), achieving good discriminative performance across different clinical settings (area under the receiver operating characteristic curve 0.76–0.88 for RF and 0.74–0.89 for LGBM). In addition, we showed comparable performance can be achieved with a greatly reduced set of features, demonstrating computational efficiency can be attained without significant compromise of model accuracy.ConclusionsGood discriminative performance for models predicting early‐onset BD can be achieved utilizing large‐scale EHR data. Our study offers a scalable and accurate method for identifying youth at risk for BD that could help inform clinical decision‐making and facilitate early intervention. Future work includes evaluating the portability of our approach to other healthcare systems and exploring considerations regarding possible implementation.
使用电子健康记录预测早发性双相情感障碍
背景:双相情感障碍(BD)的识别为及时干预提供了重要的机会。在这项研究中,我们旨在利用包括临床记录在内的大规模电子健康记录(EHR)数据开发机器学习模型,以预测早发性bd。方法从麻省总医院布里格姆卫生系统的纵向电子健康记录中提取结构化和非结构化数据。我们定义了三个10-25岁的队列:(1)全青年队列(N = 300,398);(2)以心理健康访问定义的亚队列(N = 105,461);(3)诊断为情绪障碍或ADHD的亚队列(N = 35213)。通过采用与临床实践相一致的前瞻性里程碑建模方法,我们开发并验证了一系列机器学习模型,涵盖不同的队列和预测窗口。结果我们发现两种基于树的模型,随机森林(RF)和光梯度增强机(LGBM),在不同的临床环境中具有良好的判别性能(RF和LGBM的受试者工作特征曲线下面积分别为0.76-0.88和0.74-0.89)。此外,我们还展示了使用大大减少的特征集可以实现相当的性能,这表明可以在不显著损害模型精度的情况下获得计算效率。结论利用大规模电子病历数据,预测早发性BD的模型具有良好的判别性能。我们的研究提供了一种可扩展且准确的方法来识别有双相障碍风险的年轻人,这有助于为临床决策提供信息并促进早期干预。未来的工作包括评估我们的方法对其他医疗保健系统的可移植性,并探索有关可能实施的考虑因素。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
13.80
自引率
5.30%
发文量
169
审稿时长
1 months
期刊介绍: The Journal of Child Psychology and Psychiatry (JCPP) is a highly regarded international publication that focuses on the fields of child and adolescent psychology and psychiatry. It is recognized for publishing top-tier, clinically relevant research across various disciplines related to these areas. JCPP has a broad global readership and covers a diverse range of topics, including: Epidemiology: Studies on the prevalence and distribution of mental health issues in children and adolescents. Diagnosis: Research on the identification and classification of childhood disorders. Treatments: Psychotherapeutic and psychopharmacological interventions for child and adolescent mental health. Behavior and Cognition: Studies on the behavioral and cognitive aspects of childhood disorders. Neuroscience and Neurobiology: Research on the neural and biological underpinnings of child mental health. Genetics: Genetic factors contributing to the development of childhood disorders. JCPP serves as a platform for integrating empirical research, clinical studies, and high-quality reviews from diverse perspectives, theoretical viewpoints, and disciplines. This interdisciplinary approach is a key feature of the journal, as it fosters a comprehensive understanding of child and adolescent mental health. The Journal of Child Psychology and Psychiatry is published 12 times a year and is affiliated with the Association for Child and Adolescent Mental Health (ACAMH), which supports the journal's mission to advance knowledge and practice in the field of child and adolescent mental health.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信