基于代谢组学和蛋白质组学的机器学习模型预测帕金森病的认知障碍

IF 6.7 1区 医学 Q1 NEUROSCIENCES
Baiyuan Yang, Yongyun Zhu, Kelu Li, Fang Wang, Bin Liu, Qian Zhou, Yuchao Tai, Zhaochao Liu, Lin Yang, Ruiqiong Ba, Chunyan Lei, Hui Ren, Zhong Xu, Ailan Pang, Xinglong Yang
{"title":"基于代谢组学和蛋白质组学的机器学习模型预测帕金森病的认知障碍","authors":"Baiyuan Yang, Yongyun Zhu, Kelu Li, Fang Wang, Bin Liu, Qian Zhou, Yuchao Tai, Zhaochao Liu, Lin Yang, Ruiqiong Ba, Chunyan Lei, Hui Ren, Zhong Xu, Ailan Pang, Xinglong Yang","doi":"10.1038/s41531-024-00795-y","DOIUrl":null,"url":null,"abstract":"<p>There is an urgent need to identify predictive biomarkers of Parkinson’s disease (PD) with cognitive impairment (PDCI) in order to individualize patient management, ensure timely intervention, and improve prognosis. The aim of this study was to screen for these biomarkers by comparing the plasma proteome and metabolome of PD patients with or without cognitive impairment. Proteomics and metabolomics analyses were performed on a discover cohort. A machine learning model was used to identify candidate protein and metabolite biomarkers of PDCI, which were validated in an independent cohort. The predictive ability of these biomarkers for PDCI was evaluated by plotting receiver operating characteristic curves and calculating the area under the curve (AUC). Moreover, we assessed the predictive ability of these proteins in combination with neuroimaging. In the discover cohort (<i>n</i> = 100), we identified 25 protein features with best results in the machine learning model, including top-ranked PSAP and H3C15. The two-proteins were used for model construction, achieving an Area under the curve (AUC) of 0.951 in the train set and AUC of 0.981 in the test set. Similarly, the model gives a rank list of endogenous metabolite features, Glycocholic Acid and 6-Methylnicotinamide were two top features. Combining these two markers further got the AUC of 0.969 in train set and 0.867 in the test set. To validate the performance of the protein biomarkers, we performed targeted analysis of selected proteins (H3C15 and PSAP) and proteins likely associated with PDCI (NCAM2 and LAMB2) using parallel reaction monitoring in validation cohort (<i>n</i> = 116). The AUC of the classifier built with H3C15 and PSAP is 0.813. Moreover, when combining H3C15, PSAP, NCAM2, and LAMB2, the model achieved AUC of 0.983 in the train set, AUC of 0.981 in the test set, and AUC of 0.839 in the validation set. Furthermore, we verified that these protein markers we discovered can improve the predictive effect of neuroimaging on PDCI: the classifier built with neuroimaging features had AUC of 0.833, which improved to 0.905 when combined with H3C15. Taken together, our integrated proteomics and metabolomics analysis successfully identified potential biomarkers for PDCI. Additionally, H3C15 showed promise in enhancing the predictive performance of neuroimaging for cognitive impairment.</p>","PeriodicalId":19706,"journal":{"name":"NPJ Parkinson's Disease","volume":"14 1","pages":""},"PeriodicalIF":6.7000,"publicationDate":"2024-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine learning model base on metabolomics and proteomics to predict cognitive impairment in Parkinson’s disease\",\"authors\":\"Baiyuan Yang, Yongyun Zhu, Kelu Li, Fang Wang, Bin Liu, Qian Zhou, Yuchao Tai, Zhaochao Liu, Lin Yang, Ruiqiong Ba, Chunyan Lei, Hui Ren, Zhong Xu, Ailan Pang, Xinglong Yang\",\"doi\":\"10.1038/s41531-024-00795-y\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>There is an urgent need to identify predictive biomarkers of Parkinson’s disease (PD) with cognitive impairment (PDCI) in order to individualize patient management, ensure timely intervention, and improve prognosis. The aim of this study was to screen for these biomarkers by comparing the plasma proteome and metabolome of PD patients with or without cognitive impairment. Proteomics and metabolomics analyses were performed on a discover cohort. A machine learning model was used to identify candidate protein and metabolite biomarkers of PDCI, which were validated in an independent cohort. The predictive ability of these biomarkers for PDCI was evaluated by plotting receiver operating characteristic curves and calculating the area under the curve (AUC). Moreover, we assessed the predictive ability of these proteins in combination with neuroimaging. In the discover cohort (<i>n</i> = 100), we identified 25 protein features with best results in the machine learning model, including top-ranked PSAP and H3C15. The two-proteins were used for model construction, achieving an Area under the curve (AUC) of 0.951 in the train set and AUC of 0.981 in the test set. Similarly, the model gives a rank list of endogenous metabolite features, Glycocholic Acid and 6-Methylnicotinamide were two top features. Combining these two markers further got the AUC of 0.969 in train set and 0.867 in the test set. To validate the performance of the protein biomarkers, we performed targeted analysis of selected proteins (H3C15 and PSAP) and proteins likely associated with PDCI (NCAM2 and LAMB2) using parallel reaction monitoring in validation cohort (<i>n</i> = 116). The AUC of the classifier built with H3C15 and PSAP is 0.813. Moreover, when combining H3C15, PSAP, NCAM2, and LAMB2, the model achieved AUC of 0.983 in the train set, AUC of 0.981 in the test set, and AUC of 0.839 in the validation set. Furthermore, we verified that these protein markers we discovered can improve the predictive effect of neuroimaging on PDCI: the classifier built with neuroimaging features had AUC of 0.833, which improved to 0.905 when combined with H3C15. Taken together, our integrated proteomics and metabolomics analysis successfully identified potential biomarkers for PDCI. Additionally, H3C15 showed promise in enhancing the predictive performance of neuroimaging for cognitive impairment.</p>\",\"PeriodicalId\":19706,\"journal\":{\"name\":\"NPJ Parkinson's Disease\",\"volume\":\"14 1\",\"pages\":\"\"},\"PeriodicalIF\":6.7000,\"publicationDate\":\"2024-10-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"NPJ Parkinson's Disease\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1038/s41531-024-00795-y\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"NEUROSCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"NPJ Parkinson's Disease","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1038/s41531-024-00795-y","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"NEUROSCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

目前迫切需要确定帕金森病(PD)伴认知障碍(PDCI)的预测性生物标志物,以便对患者进行个体化管理、确保及时干预并改善预后。本研究旨在通过比较帕金森病伴或不伴认知障碍患者的血浆蛋白质组和代谢组来筛选这些生物标志物。蛋白质组学和代谢组学分析是在一个发现队列中进行的。利用机器学习模型确定了 PDCI 的候选蛋白质和代谢物生物标志物,并在一个独立队列中进行了验证。通过绘制接收者操作特征曲线和计算曲线下面积(AUC),评估了这些生物标志物对 PDCI 的预测能力。此外,我们还评估了这些蛋白质与神经影像学相结合的预测能力。在发现队列(n = 100)中,我们确定了机器学习模型中效果最好的 25 个蛋白质特征,包括排名第一的 PSAP 和 H3C15。这两种蛋白质被用于构建模型,训练集的曲线下面积(AUC)为 0.951,测试集的曲线下面积(AUC)为 0.981。同样,该模型给出了内源性代谢物特征的等级列表,其中甘氨胆酸和 6-甲基烟酰胺是最重要的两个特征。结合这两个标记,训练集的 AUC 为 0.969,测试集为 0.867。为了验证蛋白质生物标记物的性能,我们在验证队列(n = 116)中使用平行反应监测对选定的蛋白质(H3C15 和 PSAP)和可能与 PDCI 相关的蛋白质(NCAM2 和 LAMB2)进行了靶向分析。使用 H3C15 和 PSAP 建立的分类器的 AUC 为 0.813。此外,当结合使用 H3C15、PSAP、NCAM2 和 LAMB2 时,模型在训练集中的 AUC 为 0.983,在测试集中的 AUC 为 0.981,在验证集中的 AUC 为 0.839。此外,我们还验证了我们发现的这些蛋白质标记物可以提高神经影像学对 PDCI 的预测效果:利用神经影像学特征建立的分类器的 AUC 为 0.833,当与 H3C15 结合使用时,AUC 提高到了 0.905。综上所述,我们的蛋白质组学和代谢组学综合分析成功地鉴定出了 PDCI 的潜在生物标记物。此外,H3C15 在提高神经影像学对认知障碍的预测性能方面也显示出了前景。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Machine learning model base on metabolomics and proteomics to predict cognitive impairment in Parkinson’s disease

Machine learning model base on metabolomics and proteomics to predict cognitive impairment in Parkinson’s disease

There is an urgent need to identify predictive biomarkers of Parkinson’s disease (PD) with cognitive impairment (PDCI) in order to individualize patient management, ensure timely intervention, and improve prognosis. The aim of this study was to screen for these biomarkers by comparing the plasma proteome and metabolome of PD patients with or without cognitive impairment. Proteomics and metabolomics analyses were performed on a discover cohort. A machine learning model was used to identify candidate protein and metabolite biomarkers of PDCI, which were validated in an independent cohort. The predictive ability of these biomarkers for PDCI was evaluated by plotting receiver operating characteristic curves and calculating the area under the curve (AUC). Moreover, we assessed the predictive ability of these proteins in combination with neuroimaging. In the discover cohort (n = 100), we identified 25 protein features with best results in the machine learning model, including top-ranked PSAP and H3C15. The two-proteins were used for model construction, achieving an Area under the curve (AUC) of 0.951 in the train set and AUC of 0.981 in the test set. Similarly, the model gives a rank list of endogenous metabolite features, Glycocholic Acid and 6-Methylnicotinamide were two top features. Combining these two markers further got the AUC of 0.969 in train set and 0.867 in the test set. To validate the performance of the protein biomarkers, we performed targeted analysis of selected proteins (H3C15 and PSAP) and proteins likely associated with PDCI (NCAM2 and LAMB2) using parallel reaction monitoring in validation cohort (n = 116). The AUC of the classifier built with H3C15 and PSAP is 0.813. Moreover, when combining H3C15, PSAP, NCAM2, and LAMB2, the model achieved AUC of 0.983 in the train set, AUC of 0.981 in the test set, and AUC of 0.839 in the validation set. Furthermore, we verified that these protein markers we discovered can improve the predictive effect of neuroimaging on PDCI: the classifier built with neuroimaging features had AUC of 0.833, which improved to 0.905 when combined with H3C15. Taken together, our integrated proteomics and metabolomics analysis successfully identified potential biomarkers for PDCI. Additionally, H3C15 showed promise in enhancing the predictive performance of neuroimaging for cognitive impairment.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
NPJ Parkinson's Disease
NPJ Parkinson's Disease Medicine-Neurology (clinical)
CiteScore
9.80
自引率
5.70%
发文量
156
审稿时长
11 weeks
期刊介绍: npj Parkinson's Disease is a comprehensive open access journal that covers a wide range of research areas related to Parkinson's disease. It publishes original studies in basic science, translational research, and clinical investigations. The journal is dedicated to advancing our understanding of Parkinson's disease by exploring various aspects such as anatomy, etiology, genetics, cellular and molecular physiology, neurophysiology, epidemiology, and therapeutic development. By providing free and immediate access to the scientific and Parkinson's disease community, npj Parkinson's Disease promotes collaboration and knowledge sharing among researchers and healthcare professionals.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信