利用电子健康记录识别阿尔茨海默病的常见疾病轨迹

Mingzhou Fu, Timothy S Chang
{"title":"利用电子健康记录识别阿尔茨海默病的常见疾病轨迹","authors":"Mingzhou Fu, Timothy S Chang","doi":"10.1101/2024.07.26.24311084","DOIUrl":null,"url":null,"abstract":"Backgrounds: Alzheimer's disease (AD), a leading cause of dementia, poses a growing global public health challenge. While recent studies have identified AD risk factors, they often focus on specific comorbidities, neglecting the complex interrelations and temporal dynamics. Our study addresses this by analyzing AD progression through longitudinal trajectories, utilizing clinical diagnoses over time. Using machine learning and network analysis, we created a computational framework to identify common AD progression patterns. Methods: We analyzed patient diagnoses from UC Health Data Warehouse's Electronic Health Records, coded with the International Classification of Diseases, version 10 (ICD-10). Using the Fine and Gray model to detect significant temporal risk factors between diagnoses, we examined associations between diagnosis pairs and refined the patients' diagnostic trajectories, delineating all possible trajectory combinations. These refined trajectories were compared using Dynamic Time Warping and grouped into clusters with hierarchical clustering. We investigated common AD trajectories through network analysis and compared patient demographics, symptoms, and AD manifestations across clusters. The Greedy Equivalence Search algorithm was used to infer causal relationships within these trajectories. We rigorously evaluated these trajectories through association tests and comparison to controls, Results: Our analysis included 24,473 eligible AD patients, which was filtered to include 5,762 patients with 6,794 unique AD progression trajectories. We identified four trajectory clusters: 1) a mental health cluster (e.g., anxiety disorder → depressive episode) (N_patient = 1,448); 2) an encephalopathy cluster (e.g., hypertension → other disorders of brain) (N_patient = 3,223); 3) a neurodegenerative disease cluster (e.g., transient cerebral ischemic attacks → other degenerative disease of nervous system) (N_patient = 1,502); and 4) a vascular disease cluster (e.g. hypertension → other cerebrovascular diseases) (N_patient = 1,446). Significant differences were observed in demographics, symptoms, and AD features across clusters. Causal analysis indicated that 26.2% of the identified trajectory connections were causal. We also observed patients with risk trajectories faced higher risks of AD compared to those without the trajectory or with only a single risk factor. Conclusion: We uncovered AD diagnosis trajectories, incorporating temporal aspects and causal relationships. These insights improve our understanding of AD development and AD subtypes, and can enhance risk assessment. Our findings can significantly benefit patient care and medical research by moving toward earlier and more accurate diagnoses, along with personalized treatment, such as medical risk factors management and lifestyle modifications.","PeriodicalId":501367,"journal":{"name":"medRxiv - Neurology","volume":"78 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Identifying common disease trajectories of Alzheimer's disease with electronic health records\",\"authors\":\"Mingzhou Fu, Timothy S Chang\",\"doi\":\"10.1101/2024.07.26.24311084\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Backgrounds: Alzheimer's disease (AD), a leading cause of dementia, poses a growing global public health challenge. While recent studies have identified AD risk factors, they often focus on specific comorbidities, neglecting the complex interrelations and temporal dynamics. Our study addresses this by analyzing AD progression through longitudinal trajectories, utilizing clinical diagnoses over time. Using machine learning and network analysis, we created a computational framework to identify common AD progression patterns. Methods: We analyzed patient diagnoses from UC Health Data Warehouse's Electronic Health Records, coded with the International Classification of Diseases, version 10 (ICD-10). Using the Fine and Gray model to detect significant temporal risk factors between diagnoses, we examined associations between diagnosis pairs and refined the patients' diagnostic trajectories, delineating all possible trajectory combinations. These refined trajectories were compared using Dynamic Time Warping and grouped into clusters with hierarchical clustering. We investigated common AD trajectories through network analysis and compared patient demographics, symptoms, and AD manifestations across clusters. The Greedy Equivalence Search algorithm was used to infer causal relationships within these trajectories. We rigorously evaluated these trajectories through association tests and comparison to controls, Results: Our analysis included 24,473 eligible AD patients, which was filtered to include 5,762 patients with 6,794 unique AD progression trajectories. We identified four trajectory clusters: 1) a mental health cluster (e.g., anxiety disorder → depressive episode) (N_patient = 1,448); 2) an encephalopathy cluster (e.g., hypertension → other disorders of brain) (N_patient = 3,223); 3) a neurodegenerative disease cluster (e.g., transient cerebral ischemic attacks → other degenerative disease of nervous system) (N_patient = 1,502); and 4) a vascular disease cluster (e.g. hypertension → other cerebrovascular diseases) (N_patient = 1,446). Significant differences were observed in demographics, symptoms, and AD features across clusters. Causal analysis indicated that 26.2% of the identified trajectory connections were causal. We also observed patients with risk trajectories faced higher risks of AD compared to those without the trajectory or with only a single risk factor. Conclusion: We uncovered AD diagnosis trajectories, incorporating temporal aspects and causal relationships. These insights improve our understanding of AD development and AD subtypes, and can enhance risk assessment. Our findings can significantly benefit patient care and medical research by moving toward earlier and more accurate diagnoses, along with personalized treatment, such as medical risk factors management and lifestyle modifications.\",\"PeriodicalId\":501367,\"journal\":{\"name\":\"medRxiv - Neurology\",\"volume\":\"78 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"medRxiv - Neurology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1101/2024.07.26.24311084\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv - Neurology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.07.26.24311084","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

背景:阿尔茨海默病(AD)是痴呆症的主要病因,对全球公共卫生构成日益严峻的挑战。虽然最近的研究已经确定了阿尔茨海默病的风险因素,但这些研究往往只关注特定的合并症,而忽视了复杂的相互关系和时间动态。为了解决这一问题,我们的研究通过纵向轨迹,利用一段时间内的临床诊断结果,分析渐冻人症的进展情况。利用机器学习和网络分析,我们创建了一个计算框架来识别常见的渐冻症进展模式。方法我们分析了加州大学健康数据仓库电子病历中的患者诊断,这些诊断是按照《国际疾病分类》第 10 版(ICD-10)编码的。利用 Fine 和 Gray 模型检测诊断之间的重要时间风险因素,我们检查了诊断对之间的关联,并完善了患者的诊断轨迹,划定了所有可能的轨迹组合。我们使用动态时间扭曲对这些细化的轨迹进行了比较,并通过分层聚类将其归类为群组。我们通过网络分析研究了常见的注意力缺失症轨迹,并比较了不同群组中患者的人口统计学特征、症状和注意力缺失症表现。我们使用贪婪等价搜索算法来推断这些轨迹中的因果关系。我们通过关联测试和与对照组的比较对这些轨迹进行了严格评估:我们的分析包括了 24,473 名符合条件的 AD 患者,经过筛选后包括了 5,762 名患者和 6,794 条独特的 AD 进展轨迹。我们确定了四个轨迹集群:1)心理健康集群(如焦虑症→抑郁发作)(N_patient = 1,448);2)脑病集群(如高血压→其他脑部疾病)(N_patient = 3,223);3)神经退行性疾病集群(如、一过性脑缺血发作 → 其他神经系统变性疾病)(患者人数 = 1,502 人);以及 4) 血管疾病群(如高血压 → 其他脑血管疾病)(患者人数 = 1,446 人)。不同群组在人口统计学、症状和注意力缺失症特征方面存在显著差异。因果分析表明,26.2% 的已识别轨迹联系是因果关系。我们还观察到,与没有风险轨迹或只有单一风险因素的患者相比,有风险轨迹的患者面临更高的注意力缺失症风险。结论我们发现了结合时间因素和因果关系的注意力缺失症诊断轨迹。这些见解增进了我们对注意力缺失症发展和注意力缺失症亚型的了解,并可加强风险评估。我们的研究结果将为患者护理和医学研究带来极大益处,使诊断更早、更准确,并提供个性化治疗,如医疗风险因素管理和生活方式调整。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Identifying common disease trajectories of Alzheimer's disease with electronic health records
Backgrounds: Alzheimer's disease (AD), a leading cause of dementia, poses a growing global public health challenge. While recent studies have identified AD risk factors, they often focus on specific comorbidities, neglecting the complex interrelations and temporal dynamics. Our study addresses this by analyzing AD progression through longitudinal trajectories, utilizing clinical diagnoses over time. Using machine learning and network analysis, we created a computational framework to identify common AD progression patterns. Methods: We analyzed patient diagnoses from UC Health Data Warehouse's Electronic Health Records, coded with the International Classification of Diseases, version 10 (ICD-10). Using the Fine and Gray model to detect significant temporal risk factors between diagnoses, we examined associations between diagnosis pairs and refined the patients' diagnostic trajectories, delineating all possible trajectory combinations. These refined trajectories were compared using Dynamic Time Warping and grouped into clusters with hierarchical clustering. We investigated common AD trajectories through network analysis and compared patient demographics, symptoms, and AD manifestations across clusters. The Greedy Equivalence Search algorithm was used to infer causal relationships within these trajectories. We rigorously evaluated these trajectories through association tests and comparison to controls, Results: Our analysis included 24,473 eligible AD patients, which was filtered to include 5,762 patients with 6,794 unique AD progression trajectories. We identified four trajectory clusters: 1) a mental health cluster (e.g., anxiety disorder → depressive episode) (N_patient = 1,448); 2) an encephalopathy cluster (e.g., hypertension → other disorders of brain) (N_patient = 3,223); 3) a neurodegenerative disease cluster (e.g., transient cerebral ischemic attacks → other degenerative disease of nervous system) (N_patient = 1,502); and 4) a vascular disease cluster (e.g. hypertension → other cerebrovascular diseases) (N_patient = 1,446). Significant differences were observed in demographics, symptoms, and AD features across clusters. Causal analysis indicated that 26.2% of the identified trajectory connections were causal. We also observed patients with risk trajectories faced higher risks of AD compared to those without the trajectory or with only a single risk factor. Conclusion: We uncovered AD diagnosis trajectories, incorporating temporal aspects and causal relationships. These insights improve our understanding of AD development and AD subtypes, and can enhance risk assessment. Our findings can significantly benefit patient care and medical research by moving toward earlier and more accurate diagnoses, along with personalized treatment, such as medical risk factors management and lifestyle modifications.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信