Machine learning to detect Alzheimer's disease with data on drugs and diagnoses.

IF 4.3 Q2 BUSINESS
Johanna Wallensten, Caroline Wachtler, Nenad Bogdanovic, Anna Olofsson, Miia Kivipelto, Linus Jönsson, Predrag Petrovic, Axel C Carlsson
{"title":"Machine learning to detect Alzheimer's disease with data on drugs and diagnoses.","authors":"Johanna Wallensten, Caroline Wachtler, Nenad Bogdanovic, Anna Olofsson, Miia Kivipelto, Linus Jönsson, Predrag Petrovic, Axel C Carlsson","doi":"10.1016/j.tjpad.2025.100115","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Integrating machine learning with medical records offers potential for early detection of Alzheimer's disease (AD), enabling timely interventions.</p><p><strong>Objectives: </strong>This study aimed to evaluate the effectiveness of machine learning in constructing a predictive model for AD, designed to predict AD with data up to three years before diagnosis. Using clinical data, including prior diagnoses and medical treatments, we sought to enhance sensitivity and specificity in diagnostic procedures. A second aim was to identify the most important factors in the machine learning models, as these may be important predictors of AD.</p><p><strong>Design: </strong>The study employed Stochastic Gradient Boosting, a machine learning method, to identify diagnoses predictive of AD using primary healthcare data. The analyses were stratified by sex and age groups.</p><p><strong>Setting: </strong>The study included individuals within Region Stockholm, Sweden, using medical records from 2010 to 2022.</p><p><strong>Participants: </strong>The study analyzed clinical data for individuals over the age of 40. Patients with an AD diagnosis (ICD-10-SE codes F00 or G30) during 2010-2012 were excluded to ensure prospective modeling. In total, AD was identified in 3,407 patients aged 41-69 years and 25,796 patients aged over 69.</p><p><strong>Measurements: </strong>The machine learning model ranked predictive diagnoses, with performance assessed by the area under the receiver operating characteristic curve (AUC). Known and novel predictors were evaluated for their contribution to AD risk.</p><p><strong>Results: </strong>AUC values ranged from 0.748 (women aged 41-69) to 0.816 (women over 69), with men across age groups falling within this range. Sensitivity and specificity ranged from 0.73 to 0.79 and 0.66 to 0.79, respectively, across age and gender groups. Negative predictive values were consistently high (≥0.954), while positive predictive values were lower (0.199-0.351). Additionally, we confirmed known risk factors as predictors and identified novel predictors that warrant further investigation. Key predictors included medical observations, cognitive symptoms, antidepressant treatment, visit frequency, and vitamin B12/folic acid treatment.</p><p><strong>Conclusions: </strong>Machine learning applied to clinical data shows promise in predicting AD, with robust model performance across age and sex groups. The findings confirmed known risk factors, such as depression and vitamin B12 deficiency, while also identifying novel predictors that may guide future research. Clinically, this approach could enhance early detection and risk stratification, facilitating timely interventions and improving patient outcomes.</p>","PeriodicalId":22711,"journal":{"name":"The Journal of Prevention of Alzheimer's Disease","volume":" ","pages":"100115"},"PeriodicalIF":4.3000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Journal of Prevention of Alzheimer's Disease","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.tjpad.2025.100115","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/8 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"BUSINESS","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Integrating machine learning with medical records offers potential for early detection of Alzheimer's disease (AD), enabling timely interventions.

Objectives: This study aimed to evaluate the effectiveness of machine learning in constructing a predictive model for AD, designed to predict AD with data up to three years before diagnosis. Using clinical data, including prior diagnoses and medical treatments, we sought to enhance sensitivity and specificity in diagnostic procedures. A second aim was to identify the most important factors in the machine learning models, as these may be important predictors of AD.

Design: The study employed Stochastic Gradient Boosting, a machine learning method, to identify diagnoses predictive of AD using primary healthcare data. The analyses were stratified by sex and age groups.

Setting: The study included individuals within Region Stockholm, Sweden, using medical records from 2010 to 2022.

Participants: The study analyzed clinical data for individuals over the age of 40. Patients with an AD diagnosis (ICD-10-SE codes F00 or G30) during 2010-2012 were excluded to ensure prospective modeling. In total, AD was identified in 3,407 patients aged 41-69 years and 25,796 patients aged over 69.

Measurements: The machine learning model ranked predictive diagnoses, with performance assessed by the area under the receiver operating characteristic curve (AUC). Known and novel predictors were evaluated for their contribution to AD risk.

Results: AUC values ranged from 0.748 (women aged 41-69) to 0.816 (women over 69), with men across age groups falling within this range. Sensitivity and specificity ranged from 0.73 to 0.79 and 0.66 to 0.79, respectively, across age and gender groups. Negative predictive values were consistently high (≥0.954), while positive predictive values were lower (0.199-0.351). Additionally, we confirmed known risk factors as predictors and identified novel predictors that warrant further investigation. Key predictors included medical observations, cognitive symptoms, antidepressant treatment, visit frequency, and vitamin B12/folic acid treatment.

Conclusions: Machine learning applied to clinical data shows promise in predicting AD, with robust model performance across age and sex groups. The findings confirmed known risk factors, such as depression and vitamin B12 deficiency, while also identifying novel predictors that may guide future research. Clinically, this approach could enhance early detection and risk stratification, facilitating timely interventions and improving patient outcomes.

机器学习通过药物和诊断数据来检测阿尔茨海默病。
背景:将机器学习与医疗记录集成为早期发现阿尔茨海默病(AD)提供了可能,从而实现及时干预。目的:本研究旨在评估机器学习在构建阿尔茨海默病预测模型中的有效性,该模型旨在利用诊断前三年的数据预测阿尔茨海默病。利用临床数据,包括先前的诊断和治疗,我们试图提高诊断程序的敏感性和特异性。第二个目标是确定机器学习模型中最重要的因素,因为这些因素可能是AD的重要预测因素。设计:该研究采用随机梯度增强(一种机器学习方法),利用初级医疗保健数据识别AD的预测诊断。这些分析是按性别和年龄组分层的。环境:该研究包括瑞典斯德哥尔摩地区的个人,使用2010年至2022年的医疗记录。参与者:该研究分析了40岁以上个体的临床数据。2010-2012年期间被诊断为AD (ICD-10-SE代码F00或G30)的患者被排除,以确保建模的前瞻性。总共有3407例41-69岁的患者和25796例69岁以上的患者被确诊为AD。测量:机器学习模型对预测诊断进行排名,并通过接受者工作特征曲线(AUC)下的面积评估性能。评估了已知的和新的预测因子对AD风险的贡献。结果:AUC值从0.748(41-69岁女性)到0.816(69岁以上女性),各个年龄段的男性都在这个范围内。不同年龄和性别的敏感性和特异性分别为0.73 ~ 0.79和0.66 ~ 0.79。阴性预测值持续较高(≥0.954),阳性预测值持续较低(0.199 ~ 0.351)。此外,我们确认了已知的风险因素作为预测因素,并确定了值得进一步研究的新的预测因素。主要预测因素包括医学观察、认知症状、抗抑郁治疗、就诊频率和维生素B12/叶酸治疗。结论:机器学习应用于临床数据显示,在预测阿尔茨海默病方面有希望,模型在不同年龄和性别群体中的表现都很稳健。研究结果证实了已知的风险因素,如抑郁症和维生素B12缺乏,同时也确定了可能指导未来研究的新预测因素。在临床上,这种方法可以加强早期发现和风险分层,促进及时干预,改善患者预后。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
The Journal of Prevention of Alzheimer's Disease
The Journal of Prevention of Alzheimer's Disease Medicine-Psychiatry and Mental Health
CiteScore
9.20
自引率
0.00%
发文量
0
期刊介绍: The JPAD Journal of Prevention of Alzheimer’Disease will publish reviews, original research articles and short reports to improve our knowledge in the field of Alzheimer prevention including: neurosciences, biomarkers, imaging, epidemiology, public health, physical cognitive exercise, nutrition, risk and protective factors, drug development, trials design, and heath economic outcomes.JPAD will publish also the meeting abstracts from Clinical Trial on Alzheimer Disease (CTAD) and will be distributed both in paper and online version worldwide.We hope that JPAD with your contribution will play a role in the development of Alzheimer prevention.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信