可解释的机器学习框架,以预测个性化的生理衰老

IF 7.1 1区 医学 Q1 CELL BIOLOGY
Aging Cell Pub Date : 2023-06-10 DOI:10.1111/acel.13872
David Bernard, Emmanuel Doumard, Isabelle Ader, Philippe Kemoun, Jean-Christophe Pagès, Anne Galinier, Sylvain Cussat-Blanc, Felix Furger, Luigi Ferrucci, Julien Aligon, Cyrille Delpierre, Luc Pénicaud, Paul Monsarrat, Louis Casteilla
{"title":"可解释的机器学习框架,以预测个性化的生理衰老","authors":"David Bernard,&nbsp;Emmanuel Doumard,&nbsp;Isabelle Ader,&nbsp;Philippe Kemoun,&nbsp;Jean-Christophe Pagès,&nbsp;Anne Galinier,&nbsp;Sylvain Cussat-Blanc,&nbsp;Felix Furger,&nbsp;Luigi Ferrucci,&nbsp;Julien Aligon,&nbsp;Cyrille Delpierre,&nbsp;Luc Pénicaud,&nbsp;Paul Monsarrat,&nbsp;Louis Casteilla","doi":"10.1111/acel.13872","DOIUrl":null,"url":null,"abstract":"<p>Attaining personalized healthy aging requires accurate monitoring of physiological changes and identifying subclinical markers that predict accelerated or delayed aging. Classic biostatistical methods most rely on supervised variables to estimate physiological aging and do not capture the full complexity of inter-parameter interactions. Machine learning (ML) is promising, but its black box nature eludes direct understanding, substantially limiting physician confidence and clinical usage. Using a broad population dataset from the National Health and Nutrition Examination Survey (NHANES) study including routine biological variables and after selection of XGBoost as the most appropriate algorithm, we created an innovative explainable ML framework to determine a Personalized physiological age (PPA). PPA predicted both chronic disease and mortality independently of chronological age. Twenty-six variables were sufficient to predict PPA. Using SHapley Additive exPlanations (SHAP), we implemented a precise quantitative associated metric for each variable explaining physiological (i.e., accelerated or delayed) deviations from age-specific normative data. Among the variables, glycated hemoglobin (HbA1c) displays a major relative weight in the estimation of PPA. Finally, clustering profiles of identical contextualized explanations reveal different aging trajectories opening opportunities to specific clinical follow-up. These data show that PPA is a robust, quantitative and explainable ML-based metric that monitors personalized health status. Our approach also provides a complete framework applicable to different datasets or variables, allowing precision physiological age estimation.</p>","PeriodicalId":119,"journal":{"name":"Aging Cell","volume":"22 8","pages":""},"PeriodicalIF":7.1000,"publicationDate":"2023-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/acel.13872","citationCount":"3","resultStr":"{\"title\":\"Explainable machine learning framework to predict personalized physiological aging\",\"authors\":\"David Bernard,&nbsp;Emmanuel Doumard,&nbsp;Isabelle Ader,&nbsp;Philippe Kemoun,&nbsp;Jean-Christophe Pagès,&nbsp;Anne Galinier,&nbsp;Sylvain Cussat-Blanc,&nbsp;Felix Furger,&nbsp;Luigi Ferrucci,&nbsp;Julien Aligon,&nbsp;Cyrille Delpierre,&nbsp;Luc Pénicaud,&nbsp;Paul Monsarrat,&nbsp;Louis Casteilla\",\"doi\":\"10.1111/acel.13872\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Attaining personalized healthy aging requires accurate monitoring of physiological changes and identifying subclinical markers that predict accelerated or delayed aging. Classic biostatistical methods most rely on supervised variables to estimate physiological aging and do not capture the full complexity of inter-parameter interactions. Machine learning (ML) is promising, but its black box nature eludes direct understanding, substantially limiting physician confidence and clinical usage. Using a broad population dataset from the National Health and Nutrition Examination Survey (NHANES) study including routine biological variables and after selection of XGBoost as the most appropriate algorithm, we created an innovative explainable ML framework to determine a Personalized physiological age (PPA). PPA predicted both chronic disease and mortality independently of chronological age. Twenty-six variables were sufficient to predict PPA. Using SHapley Additive exPlanations (SHAP), we implemented a precise quantitative associated metric for each variable explaining physiological (i.e., accelerated or delayed) deviations from age-specific normative data. Among the variables, glycated hemoglobin (HbA1c) displays a major relative weight in the estimation of PPA. Finally, clustering profiles of identical contextualized explanations reveal different aging trajectories opening opportunities to specific clinical follow-up. These data show that PPA is a robust, quantitative and explainable ML-based metric that monitors personalized health status. Our approach also provides a complete framework applicable to different datasets or variables, allowing precision physiological age estimation.</p>\",\"PeriodicalId\":119,\"journal\":{\"name\":\"Aging Cell\",\"volume\":\"22 8\",\"pages\":\"\"},\"PeriodicalIF\":7.1000,\"publicationDate\":\"2023-06-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1111/acel.13872\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Aging Cell\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/acel.13872\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CELL BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Aging Cell","FirstCategoryId":"99","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/acel.13872","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CELL BIOLOGY","Score":null,"Total":0}
引用次数: 3

摘要

实现个性化的健康老龄化需要准确监测生理变化并识别预测加速或延迟衰老的亚临床标志物。经典的生物统计学方法大多依赖于监督变量来估计生理衰老,而不能捕捉到参数间相互作用的全部复杂性。机器学习(ML)很有前途,但它的黑箱性质无法直接理解,极大地限制了医生的信心和临床应用。使用来自国家健康和营养检查调查(NHANES)研究的广泛人口数据集,包括常规生物变量,并在选择XGBoost作为最合适的算法后,我们创建了一个创新的可解释的ML框架来确定个性化生理年龄(PPA)。PPA预测慢性疾病和死亡率与实际年龄无关。26个变量足以预测PPA。使用SHapley加性解释(SHAP),我们对解释生理(即加速或延迟)偏离特定年龄规范数据的每个变量实施了精确的定量关联度量。在这些变量中,糖化血红蛋白(HbA1c)在PPA的估计中显示出主要的相对权重。最后,相同情境化解释的聚类概况揭示了不同的衰老轨迹,为特定的临床随访提供了机会。这些数据表明,PPA是一种稳健、定量且可解释的基于ml的指标,可监测个性化的健康状况。我们的方法还提供了一个适用于不同数据集或变量的完整框架,允许精确的生理年龄估计。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Explainable machine learning framework to predict personalized physiological aging

Explainable machine learning framework to predict personalized physiological aging

Attaining personalized healthy aging requires accurate monitoring of physiological changes and identifying subclinical markers that predict accelerated or delayed aging. Classic biostatistical methods most rely on supervised variables to estimate physiological aging and do not capture the full complexity of inter-parameter interactions. Machine learning (ML) is promising, but its black box nature eludes direct understanding, substantially limiting physician confidence and clinical usage. Using a broad population dataset from the National Health and Nutrition Examination Survey (NHANES) study including routine biological variables and after selection of XGBoost as the most appropriate algorithm, we created an innovative explainable ML framework to determine a Personalized physiological age (PPA). PPA predicted both chronic disease and mortality independently of chronological age. Twenty-six variables were sufficient to predict PPA. Using SHapley Additive exPlanations (SHAP), we implemented a precise quantitative associated metric for each variable explaining physiological (i.e., accelerated or delayed) deviations from age-specific normative data. Among the variables, glycated hemoglobin (HbA1c) displays a major relative weight in the estimation of PPA. Finally, clustering profiles of identical contextualized explanations reveal different aging trajectories opening opportunities to specific clinical follow-up. These data show that PPA is a robust, quantitative and explainable ML-based metric that monitors personalized health status. Our approach also provides a complete framework applicable to different datasets or variables, allowing precision physiological age estimation.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Aging Cell
Aging Cell Biochemistry, Genetics and Molecular Biology-Cell Biology
自引率
2.60%
发文量
212
期刊介绍: Aging Cell is an Open Access journal that focuses on the core aspects of the biology of aging, encompassing the entire spectrum of geroscience. The journal's content is dedicated to publishing research that uncovers the mechanisms behind the aging process and explores the connections between aging and various age-related diseases. This journal aims to provide a comprehensive understanding of the biological underpinnings of aging and its implications for human health. The journal is widely recognized and its content is abstracted and indexed by numerous databases and services, which facilitates its accessibility and impact in the scientific community. These include: Academic Search (EBSCO Publishing) Academic Search Alumni Edition (EBSCO Publishing) Academic Search Premier (EBSCO Publishing) Biological Science Database (ProQuest) CAS: Chemical Abstracts Service (ACS) Embase (Elsevier) InfoTrac (GALE Cengage) Ingenta Select ISI Alerting Services Journal Citation Reports/Science Edition (Clarivate Analytics) MEDLINE/PubMed (NLM) Natural Science Collection (ProQuest) PubMed Dietary Supplement Subset (NLM) Science Citation Index Expanded (Clarivate Analytics) SciTech Premium Collection (ProQuest) Web of Science (Clarivate Analytics) Being indexed in these databases ensures that the research published in Aging Cell is discoverable by researchers, clinicians, and other professionals interested in the field of aging and its associated health issues. This broad coverage helps to disseminate the journal's findings and contributes to the advancement of knowledge in geroscience.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信