Mica Xu Ji, Marjola Thanaj, Léna Nehale-Ezzine, Brandon Whitcher, E Louise Thomas, Jimmy D Bell
{"title":"深度学习从英国生物银行的血液和身体成分生物标志物中预测38种与年龄相关的疾病的发病加速。","authors":"Mica Xu Ji, Marjola Thanaj, Léna Nehale-Ezzine, Brandon Whitcher, E Louise Thomas, Jimmy D Bell","doi":"10.1007/s11357-025-01702-w","DOIUrl":null,"url":null,"abstract":"<p><p>A major challenge in multimorbid aging is understanding how diseases co-occur and identifying high-risk groups for accelerated disease development, but to date associations in the relative onset acceleration of disease diagnoses have not been used to characterize disease patterns. This study presents the development and evaluation of a neural network Cox model for predicting onset acceleration risk for age-associated conditions, using demographic, anthropomorphic, imaging, and blood biomarker traits from 60,396 individuals and 218,530 outcome events from the UK Biobank. Risk prediction was evaluated with Harrell's concordance index (C-index). The model performed well on internal (C-index <math><mrow><mn>0.6830</mn> <mo>±</mo> <mn>0.0902</mn></mrow> </math> , <math><mrow><mi>n</mi> <mo>=</mo> <mn>8</mn> <mo>,</mo> <mn>931</mn></mrow> </math> ) and external (C-index <math><mrow><mn>0.6461</mn> <mo>±</mo> <mn>0.1264</mn></mrow> </math> , <math><mrow><mi>n</mi> <mo>=</mo> <mn>855</mn></mrow> </math> ) test sets, attaining C-index <math><mrow><mo>≥</mo> <mn>0.6</mn></mrow> </math> on 38 out of 47 ( <math><mrow><mn>80.9</mn> <mo>%</mo></mrow> </math> ) conditions. Inclusion of body composition and blood biomarker input traits was independently important for predictive performance. Kaplan-Meier curves for predicted risk quartiles (log-rank <math><mrow><mi>p</mi> <mo>≤</mo> <mn>1.16</mn> <mi>E</mi> <mo>-</mo> <mn>16</mn></mrow> </math> ) indicated robust stratification of individuals into high and low risk groups. Analysis of risk quartiles revealed cardiometabolic, vascular-neuropsychiatric, and digestive-neuropsychiatric disease clusters with strong statistically significant inter-correlated onset acceleration ( <math><mrow><mi>r</mi> <mo>≥</mo> <mn>0.6</mn></mrow> </math> , <math><mrow><mi>p</mi> <mo>≤</mo> <mn>3.46</mn> <mi>E</mi> <mo>-</mo> <mn>5</mn></mrow> </math> ), while 13 and 19 conditions were strongly associated with onset acceleration of all-cause mortality and all-cause morbidity, respectively. In prognostic survival analysis, the proportional hazards assumption was met (Schoenfeld residual <math><mrow><mi>p</mi> <mo>></mo> <mn>0.05</mn></mrow> </math> ) in 435 out of 435 or 100% (1238 out of 1334 or 92.8%) of cases across outcomes, <math><mrow><mi>a</mi> <mi>H</mi> <mi>R</mi> <mo>=</mo> <mn>6.11</mn> <mo>±</mo> <mn>9.00</mn></mrow> </math> ( <math><mrow><mi>a</mi> <mi>H</mi> <mi>R</mi> <mo>=</mo> <mn>3.67</mn> <mo>±</mo> <mn>5.78</mn></mrow> </math> ) with (without) Bonferroni correction. The neural architecture of OnsetNet was interpreted with saliency analysis, and several significant body composition and blood biomarkers were identified. The results demonstrate that neural network survival models are able to estimate prognostically informative onset acceleration risk, which could be used to improve understanding of synchronicity in the onset of age-associated diseases and reprioritize patients based on disease-specific risk.</p>","PeriodicalId":12730,"journal":{"name":"GeroScience","volume":" ","pages":""},"PeriodicalIF":5.4000,"publicationDate":"2025-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deep learning predicts onset acceleration of 38 age-associated diseases from blood and body composition biomarkers in the UK Biobank.\",\"authors\":\"Mica Xu Ji, Marjola Thanaj, Léna Nehale-Ezzine, Brandon Whitcher, E Louise Thomas, Jimmy D Bell\",\"doi\":\"10.1007/s11357-025-01702-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>A major challenge in multimorbid aging is understanding how diseases co-occur and identifying high-risk groups for accelerated disease development, but to date associations in the relative onset acceleration of disease diagnoses have not been used to characterize disease patterns. This study presents the development and evaluation of a neural network Cox model for predicting onset acceleration risk for age-associated conditions, using demographic, anthropomorphic, imaging, and blood biomarker traits from 60,396 individuals and 218,530 outcome events from the UK Biobank. Risk prediction was evaluated with Harrell's concordance index (C-index). The model performed well on internal (C-index <math><mrow><mn>0.6830</mn> <mo>±</mo> <mn>0.0902</mn></mrow> </math> , <math><mrow><mi>n</mi> <mo>=</mo> <mn>8</mn> <mo>,</mo> <mn>931</mn></mrow> </math> ) and external (C-index <math><mrow><mn>0.6461</mn> <mo>±</mo> <mn>0.1264</mn></mrow> </math> , <math><mrow><mi>n</mi> <mo>=</mo> <mn>855</mn></mrow> </math> ) test sets, attaining C-index <math><mrow><mo>≥</mo> <mn>0.6</mn></mrow> </math> on 38 out of 47 ( <math><mrow><mn>80.9</mn> <mo>%</mo></mrow> </math> ) conditions. Inclusion of body composition and blood biomarker input traits was independently important for predictive performance. Kaplan-Meier curves for predicted risk quartiles (log-rank <math><mrow><mi>p</mi> <mo>≤</mo> <mn>1.16</mn> <mi>E</mi> <mo>-</mo> <mn>16</mn></mrow> </math> ) indicated robust stratification of individuals into high and low risk groups. Analysis of risk quartiles revealed cardiometabolic, vascular-neuropsychiatric, and digestive-neuropsychiatric disease clusters with strong statistically significant inter-correlated onset acceleration ( <math><mrow><mi>r</mi> <mo>≥</mo> <mn>0.6</mn></mrow> </math> , <math><mrow><mi>p</mi> <mo>≤</mo> <mn>3.46</mn> <mi>E</mi> <mo>-</mo> <mn>5</mn></mrow> </math> ), while 13 and 19 conditions were strongly associated with onset acceleration of all-cause mortality and all-cause morbidity, respectively. In prognostic survival analysis, the proportional hazards assumption was met (Schoenfeld residual <math><mrow><mi>p</mi> <mo>></mo> <mn>0.05</mn></mrow> </math> ) in 435 out of 435 or 100% (1238 out of 1334 or 92.8%) of cases across outcomes, <math><mrow><mi>a</mi> <mi>H</mi> <mi>R</mi> <mo>=</mo> <mn>6.11</mn> <mo>±</mo> <mn>9.00</mn></mrow> </math> ( <math><mrow><mi>a</mi> <mi>H</mi> <mi>R</mi> <mo>=</mo> <mn>3.67</mn> <mo>±</mo> <mn>5.78</mn></mrow> </math> ) with (without) Bonferroni correction. The neural architecture of OnsetNet was interpreted with saliency analysis, and several significant body composition and blood biomarkers were identified. The results demonstrate that neural network survival models are able to estimate prognostically informative onset acceleration risk, which could be used to improve understanding of synchronicity in the onset of age-associated diseases and reprioritize patients based on disease-specific risk.</p>\",\"PeriodicalId\":12730,\"journal\":{\"name\":\"GeroScience\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":5.4000,\"publicationDate\":\"2025-06-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"GeroScience\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s11357-025-01702-w\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GERIATRICS & GERONTOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"GeroScience","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s11357-025-01702-w","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GERIATRICS & GERONTOLOGY","Score":null,"Total":0}
引用次数: 0
摘要
多病性衰老的一个主要挑战是了解疾病是如何共同发生的,并确定疾病加速发展的高危人群,但迄今为止,疾病诊断的相对发病加速之间的关联尚未用于表征疾病模式。本研究提出了一个神经网络Cox模型的开发和评估,该模型用于预测年龄相关疾病的发病加速风险,使用来自英国生物银行的60,396个人和218,530个结果事件的人口统计学、拟人化、成像和血液生物标志物特征。采用Harrell’s concordance index (C-index)评价风险预测。该模型在内部(C-index 0.6830±0.0902,n = 8,931)和外部(C-index 0.6461±0.1264,n = 855)测试集上表现良好,47个条件中有38个条件(80.9%)的C-index≥0.6。包括身体成分和血液生物标志物输入特征是独立的重要预测性能。预测风险四分位数的Kaplan-Meier曲线(log-rank p≤1.16 E - 16)表明个体分为高风险组和低风险组。风险四分位数分析显示,心脏代谢、血管-神经精神和消化-神经精神疾病群的发病加速有很强的统计学意义(r≥0.6,p≤3.46 E - 5),而13种和19种疾病分别与全因死亡率和全因发病率的发病加速有很强的相关性。在预后生存分析中,435 / 435或100%(1334 / 1238或92.8%)的病例符合比例风险假设(Schoenfeld残差p < 0.05),经(不经)Bonferroni校正后,风险比= 6.11±9.00(风险比= 3.67±5.78)。通过显著性分析来解释OnsetNet的神经结构,并确定了几个重要的身体成分和血液生物标志物。结果表明,神经网络生存模型能够估计预后信息的发病加速风险,可用于提高对年龄相关疾病发病同步性的理解,并根据疾病特异性风险对患者进行重新排序。
Deep learning predicts onset acceleration of 38 age-associated diseases from blood and body composition biomarkers in the UK Biobank.
A major challenge in multimorbid aging is understanding how diseases co-occur and identifying high-risk groups for accelerated disease development, but to date associations in the relative onset acceleration of disease diagnoses have not been used to characterize disease patterns. This study presents the development and evaluation of a neural network Cox model for predicting onset acceleration risk for age-associated conditions, using demographic, anthropomorphic, imaging, and blood biomarker traits from 60,396 individuals and 218,530 outcome events from the UK Biobank. Risk prediction was evaluated with Harrell's concordance index (C-index). The model performed well on internal (C-index , ) and external (C-index , ) test sets, attaining C-index on 38 out of 47 ( ) conditions. Inclusion of body composition and blood biomarker input traits was independently important for predictive performance. Kaplan-Meier curves for predicted risk quartiles (log-rank ) indicated robust stratification of individuals into high and low risk groups. Analysis of risk quartiles revealed cardiometabolic, vascular-neuropsychiatric, and digestive-neuropsychiatric disease clusters with strong statistically significant inter-correlated onset acceleration ( , ), while 13 and 19 conditions were strongly associated with onset acceleration of all-cause mortality and all-cause morbidity, respectively. In prognostic survival analysis, the proportional hazards assumption was met (Schoenfeld residual ) in 435 out of 435 or 100% (1238 out of 1334 or 92.8%) of cases across outcomes, ( ) with (without) Bonferroni correction. The neural architecture of OnsetNet was interpreted with saliency analysis, and several significant body composition and blood biomarkers were identified. The results demonstrate that neural network survival models are able to estimate prognostically informative onset acceleration risk, which could be used to improve understanding of synchronicity in the onset of age-associated diseases and reprioritize patients based on disease-specific risk.
GeroScienceMedicine-Complementary and Alternative Medicine
CiteScore
10.50
自引率
5.40%
发文量
182
期刊介绍:
GeroScience is a bi-monthly, international, peer-reviewed journal that publishes articles related to research in the biology of aging and research on biomedical applications that impact aging. The scope of articles to be considered include evolutionary biology, biophysics, genetics, genomics, proteomics, molecular biology, cell biology, biochemistry, endocrinology, immunology, physiology, pharmacology, neuroscience, and psychology.