英国生物库队列中 2 型糖尿病及相关特征的血浆蛋白质组特征

Trisha P. Gupte, Zahra Azizi, Pik Fang Kho, Jiayan Zhou, Kevin Nzenkue, Ming-Li Chen, Daniel J. Panyard, Rodrigo Guarischi-Sousa, Austin T. Hilliard, Disha Sharma, Kathleen Watson, Fahim Abbasi, Philip S. Tsao, Shoa L. Clarke, Themistocles L. Assimes
{"title":"英国生物库队列中 2 型糖尿病及相关特征的血浆蛋白质组特征","authors":"Trisha P. Gupte, Zahra Azizi, Pik Fang Kho, Jiayan Zhou, Kevin Nzenkue, Ming-Li Chen, Daniel J. Panyard, Rodrigo Guarischi-Sousa, Austin T. Hilliard, Disha Sharma, Kathleen Watson, Fahim Abbasi, Philip S. Tsao, Shoa L. Clarke, Themistocles L. Assimes","doi":"10.1101/2024.09.13.24313501","DOIUrl":null,"url":null,"abstract":"Aims/hypothesis: The plasma proteome holds promise as a diagnostic and prognostic tool that can accurately reflect complex human traits and disease processes. We assessed the ability of plasma proteins to predict type 2 diabetes mellitus (T2DM) and related traits. Methods: Clinical, genetic, and high-throughput proteomic data from three subcohorts of UK Biobank participants were analyzed for association with dual-energy x-ray absorptiometry (DXA) derived truncal fat (in the adiposity subcohort), estimated maximum oxygen consumption (VO2max) (in the fitness subcohort), and incident T2DM (in the T2DM subcohort). We used least absolute shrinkage and selection operator (LASSO) regression to assess the relative ability of non-proteomic and proteomic variables to associate with each trait by comparing variance explained (R2) and area under the curve (AUC) statistics between data types. Stability selection with randomized LASSO regression identified the most robustly associated proteins for each trait. The benefit of proteomic signatures (PSs) over QDiabetes, a T2DM clinical risk score, was evaluated through the derivation of delta (∆) AUC values. We also assessed the incremental gain in model performance metrics using proteomic datasets with varying numbers of proteins. A series of two-sample Mendelian randomization (MR) analyses were conducted to identify potentially causal proteins for adiposity, fitness, and T2DM. Results: Across all three subcohorts, the mean age was 56.7 years and 54.9% were female. In the T2DM subcohort, 5.8% developed incident T2DM over a median follow-up of 7.6 years. LASSO-derived PSs increased the R2 of truncal fat and VO2max over clinical and genetic factors by 0.074 and 0.057, respectively. We observed a similar improvement in T2DM prediction over the QDiabetes score [Δ AUC: 0.016 (95% CI 0.008, 0.024)] when using a robust PS derived strictly from the T2DM outcome versus a model further augmented with non-overlapping proteins associated with adiposity and fitness. A small number of proteins (29 for truncal adiposity, 18 for VO2max, and 26 for T2DM) identified by stability selection algorithms offered most of the improvement in prediction of each outcome. Filtered and clustered versions of the full proteomic dataset supplied by the UK Biobank (ranging between 600-1,500 proteins) performed comparably to the full dataset for T2DM prediction. Using MR, we identified 4 proteins as potentially causal for adiposity, 1 as potentially causal for fitness, and 4 as potentially causal for T2DM. Conclusions/Interpretation: Plasma PSs modestly improve the prediction of incident T2DM over that possible with clinical and genetic factors. Further studies are warranted to better elucidate the clinical utility of these signatures in predicting the risk of T2DM over the standard practice of using the QDiabetes score. Candidate causally associated proteins identified through MR deserve further study as potential novel therapeutic targets for T2DM.","PeriodicalId":501375,"journal":{"name":"medRxiv - Genetic and Genomic Medicine","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Plasma proteomic signatures for type 2 diabetes mellitus and related traits in the UK Biobank cohort\",\"authors\":\"Trisha P. Gupte, Zahra Azizi, Pik Fang Kho, Jiayan Zhou, Kevin Nzenkue, Ming-Li Chen, Daniel J. Panyard, Rodrigo Guarischi-Sousa, Austin T. Hilliard, Disha Sharma, Kathleen Watson, Fahim Abbasi, Philip S. Tsao, Shoa L. Clarke, Themistocles L. Assimes\",\"doi\":\"10.1101/2024.09.13.24313501\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Aims/hypothesis: The plasma proteome holds promise as a diagnostic and prognostic tool that can accurately reflect complex human traits and disease processes. We assessed the ability of plasma proteins to predict type 2 diabetes mellitus (T2DM) and related traits. Methods: Clinical, genetic, and high-throughput proteomic data from three subcohorts of UK Biobank participants were analyzed for association with dual-energy x-ray absorptiometry (DXA) derived truncal fat (in the adiposity subcohort), estimated maximum oxygen consumption (VO2max) (in the fitness subcohort), and incident T2DM (in the T2DM subcohort). We used least absolute shrinkage and selection operator (LASSO) regression to assess the relative ability of non-proteomic and proteomic variables to associate with each trait by comparing variance explained (R2) and area under the curve (AUC) statistics between data types. Stability selection with randomized LASSO regression identified the most robustly associated proteins for each trait. The benefit of proteomic signatures (PSs) over QDiabetes, a T2DM clinical risk score, was evaluated through the derivation of delta (∆) AUC values. We also assessed the incremental gain in model performance metrics using proteomic datasets with varying numbers of proteins. A series of two-sample Mendelian randomization (MR) analyses were conducted to identify potentially causal proteins for adiposity, fitness, and T2DM. Results: Across all three subcohorts, the mean age was 56.7 years and 54.9% were female. In the T2DM subcohort, 5.8% developed incident T2DM over a median follow-up of 7.6 years. LASSO-derived PSs increased the R2 of truncal fat and VO2max over clinical and genetic factors by 0.074 and 0.057, respectively. We observed a similar improvement in T2DM prediction over the QDiabetes score [Δ AUC: 0.016 (95% CI 0.008, 0.024)] when using a robust PS derived strictly from the T2DM outcome versus a model further augmented with non-overlapping proteins associated with adiposity and fitness. A small number of proteins (29 for truncal adiposity, 18 for VO2max, and 26 for T2DM) identified by stability selection algorithms offered most of the improvement in prediction of each outcome. Filtered and clustered versions of the full proteomic dataset supplied by the UK Biobank (ranging between 600-1,500 proteins) performed comparably to the full dataset for T2DM prediction. Using MR, we identified 4 proteins as potentially causal for adiposity, 1 as potentially causal for fitness, and 4 as potentially causal for T2DM. Conclusions/Interpretation: Plasma PSs modestly improve the prediction of incident T2DM over that possible with clinical and genetic factors. Further studies are warranted to better elucidate the clinical utility of these signatures in predicting the risk of T2DM over the standard practice of using the QDiabetes score. Candidate causally associated proteins identified through MR deserve further study as potential novel therapeutic targets for T2DM.\",\"PeriodicalId\":501375,\"journal\":{\"name\":\"medRxiv - Genetic and Genomic Medicine\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"medRxiv - Genetic and Genomic Medicine\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1101/2024.09.13.24313501\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv - Genetic and Genomic Medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.09.13.24313501","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

目的/假设:血浆蛋白质组有望成为一种诊断和预后工具,准确反映复杂的人体特征和疾病过程。我们评估了血浆蛋白质预测 2 型糖尿病(T2DM)及相关特征的能力。研究方法我们分析了英国生物库三个子队列参与者的临床、遗传和高通量蛋白质组数据与双能 X 射线吸收测定法(DXA)得出的躯干脂肪(在肥胖子队列中)、估计最大耗氧量(VO2max)(在体能子队列中)和 T2DM 事件(在 T2DM 子队列中)之间的关联。我们使用最小绝对收缩和选择算子(LASSO)回归法,通过比较数据类型之间的解释方差(R2)和曲线下面积(AUC)统计量,评估非蛋白质组变量和蛋白质组变量与每个性状相关联的相对能力。通过随机 LASSO 回归进行稳定性选择,确定了与每个性状关联性最强的蛋白质。通过得出 delta (∆) AUC 值,评估了蛋白质组特征(PSs)相对于 T2DM 临床风险评分 QDiabetes 的优势。我们还利用蛋白质数量不同的蛋白质组数据集评估了模型性能指标的增益。我们还进行了一系列双样本孟德尔随机化(MR)分析,以确定脂肪、体能和 T2DM 的潜在因果蛋白。分析结果所有三个亚群的平均年龄为 56.7 岁,54.9% 为女性。在 T2DM 亚群中,5.8% 的人在中位 7.6 年的随访期间患上了 T2DM。LASSO 衍生的 PS 使截肢脂肪和 VO2max 的 R2 分别比临床因素和遗传因素增加了 0.074 和 0.057。我们观察到,与 QDiabetes 评分相比,当使用严格从 T2DM 结果得出的稳健 PS 与使用与脂肪和体能相关的非重叠蛋白进一步增强的模型时,T2DM 预测结果也有类似的改善[Δ AUC:0.016 (95% CI 0.008, 0.024)]。通过稳定性选择算法确定的少量蛋白质(29 个用于预测躯干脂肪,18 个用于预测 VO2max,26 个用于预测 T2DM)为每种结果的预测提供了大部分改进。英国生物库提供的完整蛋白质组数据集的过滤和聚类版本(蛋白质数量在 600-1,500 个之间)在 T2DM 预测方面的表现与完整数据集相当。利用磁共振技术,我们确定了 4 种蛋白质可能与肥胖有关,1 种可能与体质有关,4 种可能与 T2DM 有关。结论/解释:与临床和遗传因素相比,血浆 PS 可适度改善对 T2DM 发病的预测。与使用 QDiabetes 评分的标准做法相比,这些特征在预测 T2DM 风险方面的临床实用性有待进一步研究。通过磁共振鉴定出的候选因果相关蛋白作为 T2DM 的潜在新型治疗靶点值得进一步研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Plasma proteomic signatures for type 2 diabetes mellitus and related traits in the UK Biobank cohort
Aims/hypothesis: The plasma proteome holds promise as a diagnostic and prognostic tool that can accurately reflect complex human traits and disease processes. We assessed the ability of plasma proteins to predict type 2 diabetes mellitus (T2DM) and related traits. Methods: Clinical, genetic, and high-throughput proteomic data from three subcohorts of UK Biobank participants were analyzed for association with dual-energy x-ray absorptiometry (DXA) derived truncal fat (in the adiposity subcohort), estimated maximum oxygen consumption (VO2max) (in the fitness subcohort), and incident T2DM (in the T2DM subcohort). We used least absolute shrinkage and selection operator (LASSO) regression to assess the relative ability of non-proteomic and proteomic variables to associate with each trait by comparing variance explained (R2) and area under the curve (AUC) statistics between data types. Stability selection with randomized LASSO regression identified the most robustly associated proteins for each trait. The benefit of proteomic signatures (PSs) over QDiabetes, a T2DM clinical risk score, was evaluated through the derivation of delta (∆) AUC values. We also assessed the incremental gain in model performance metrics using proteomic datasets with varying numbers of proteins. A series of two-sample Mendelian randomization (MR) analyses were conducted to identify potentially causal proteins for adiposity, fitness, and T2DM. Results: Across all three subcohorts, the mean age was 56.7 years and 54.9% were female. In the T2DM subcohort, 5.8% developed incident T2DM over a median follow-up of 7.6 years. LASSO-derived PSs increased the R2 of truncal fat and VO2max over clinical and genetic factors by 0.074 and 0.057, respectively. We observed a similar improvement in T2DM prediction over the QDiabetes score [Δ AUC: 0.016 (95% CI 0.008, 0.024)] when using a robust PS derived strictly from the T2DM outcome versus a model further augmented with non-overlapping proteins associated with adiposity and fitness. A small number of proteins (29 for truncal adiposity, 18 for VO2max, and 26 for T2DM) identified by stability selection algorithms offered most of the improvement in prediction of each outcome. Filtered and clustered versions of the full proteomic dataset supplied by the UK Biobank (ranging between 600-1,500 proteins) performed comparably to the full dataset for T2DM prediction. Using MR, we identified 4 proteins as potentially causal for adiposity, 1 as potentially causal for fitness, and 4 as potentially causal for T2DM. Conclusions/Interpretation: Plasma PSs modestly improve the prediction of incident T2DM over that possible with clinical and genetic factors. Further studies are warranted to better elucidate the clinical utility of these signatures in predicting the risk of T2DM over the standard practice of using the QDiabetes score. Candidate causally associated proteins identified through MR deserve further study as potential novel therapeutic targets for T2DM.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信