Identification of Associations Between Peripheral Blood Gene Expression and Cerebrospinal Fluid Biomarkers in Alzheimer's Disease Using an Improved Joint Multi-Task Sparse Canonical Correlation Analysis Algorithm.

IF 3.3 4区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY
Qianqian Wu, Zhihui Ma, Feng Wang
{"title":"Identification of Associations Between Peripheral Blood Gene Expression and Cerebrospinal Fluid Biomarkers in Alzheimer's Disease Using an Improved Joint Multi-Task Sparse Canonical Correlation Analysis Algorithm.","authors":"Qianqian Wu, Zhihui Ma, Feng Wang","doi":"10.1007/s12010-025-05297-y","DOIUrl":null,"url":null,"abstract":"<p><p>Alzheimer's disease (AD) is an irreversible neurodegenerative disorder, and early diagnosis is crucial for effective clinical intervention. Traditional diagnostic methods involve detecting living brain tissue across the blood-brain barrier, but these invasive procedures cause unavoidable damage to patients. Genetic biomarkers in peripheral blood may provide valuable insights into brain lesions, potentially offering a non-invasive method for early AD diagnosis. The aim of this study is to propose an improved joint multi-task sparse canonical correlation analysis (MTSCCA) algorithm to identify significant genetic biomarkers in peripheral blood that correlate with brain markers of AD, such as cerebrospinal fluid (CSF) markers. This approach aims to accurately predict AD and assess disease progression. The study employs a multi-task sparse canonical correlation analysis (MTSCCA) approach with separate analyses for AD and healthy controls. Both tasks are constrained with class-consistent and class-specific conditions to identify significant features for each diagnostic group. To enhance robustness, the Laplacian matrix constraints were incorporated into the MTSCCA-LR algorithm to reduce noise in genetic data. The proposed algorithm identifies key differentially expressed genes (DEGs) that are involved in pathways closely linked to AD pathogenesis. These genes have specific diagnostic significance. Validation of these genes for predicting CSF markers was conducted using two regression models, showing good predictive accuracy. Furthermore, a Support Vector Machine (SVM) classifier was used to classify the two diagnostic groups, demonstrating high classification accuracy. The Top 20 genes identified using the proposed algorithm were used to construct an AD diagnostic model, which exhibited strong potential for non-invasive AD diagnosis, with significant implications for clinical practice. The code and example data of the proposed algorithm have been made publicly available on GitHub ( https://github.com/Zoe491/Improved-MTSCCA1 ).</p>","PeriodicalId":465,"journal":{"name":"Applied Biochemistry and Biotechnology","volume":" ","pages":""},"PeriodicalIF":3.3000,"publicationDate":"2025-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Biochemistry and Biotechnology","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s12010-025-05297-y","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Alzheimer's disease (AD) is an irreversible neurodegenerative disorder, and early diagnosis is crucial for effective clinical intervention. Traditional diagnostic methods involve detecting living brain tissue across the blood-brain barrier, but these invasive procedures cause unavoidable damage to patients. Genetic biomarkers in peripheral blood may provide valuable insights into brain lesions, potentially offering a non-invasive method for early AD diagnosis. The aim of this study is to propose an improved joint multi-task sparse canonical correlation analysis (MTSCCA) algorithm to identify significant genetic biomarkers in peripheral blood that correlate with brain markers of AD, such as cerebrospinal fluid (CSF) markers. This approach aims to accurately predict AD and assess disease progression. The study employs a multi-task sparse canonical correlation analysis (MTSCCA) approach with separate analyses for AD and healthy controls. Both tasks are constrained with class-consistent and class-specific conditions to identify significant features for each diagnostic group. To enhance robustness, the Laplacian matrix constraints were incorporated into the MTSCCA-LR algorithm to reduce noise in genetic data. The proposed algorithm identifies key differentially expressed genes (DEGs) that are involved in pathways closely linked to AD pathogenesis. These genes have specific diagnostic significance. Validation of these genes for predicting CSF markers was conducted using two regression models, showing good predictive accuracy. Furthermore, a Support Vector Machine (SVM) classifier was used to classify the two diagnostic groups, demonstrating high classification accuracy. The Top 20 genes identified using the proposed algorithm were used to construct an AD diagnostic model, which exhibited strong potential for non-invasive AD diagnosis, with significant implications for clinical practice. The code and example data of the proposed algorithm have been made publicly available on GitHub ( https://github.com/Zoe491/Improved-MTSCCA1 ).

使用改进的联合多任务稀疏典型相关分析算法鉴定阿尔茨海默病外周血基因表达与脑脊液生物标志物之间的关联
阿尔茨海默病(AD)是一种不可逆的神经退行性疾病,早期诊断对有效的临床干预至关重要。传统的诊断方法包括通过血脑屏障检测活的脑组织,但这些侵入性程序会对患者造成不可避免的损害。外周血中的遗传生物标志物可能为大脑病变提供有价值的见解,可能为早期AD诊断提供一种非侵入性方法。本研究的目的是提出一种改进的联合多任务稀疏典型相关分析(MTSCCA)算法,以识别外周血中与AD脑标志物(如脑脊液(CSF)标志物)相关的重要遗传生物标志物。该方法旨在准确预测AD并评估疾病进展。该研究采用多任务稀疏典型相关分析(MTSCCA)方法,对AD和健康对照进行单独分析。这两项任务都受到类别一致和类别特定条件的约束,以确定每个诊断组的重要特征。为了增强鲁棒性,将拉普拉斯矩阵约束引入MTSCCA-LR算法中,以降低遗传数据中的噪声。该算法确定了与AD发病机制密切相关的关键差异表达基因(DEGs)。这些基因具有特定的诊断意义。使用两种回归模型验证这些基因预测脑脊液标记物,显示出良好的预测准确性。此外,使用支持向量机分类器对两组诊断进行分类,显示出较高的分类准确率。利用该算法鉴定出的前20个基因构建了AD诊断模型,该模型显示出非侵入性AD诊断的强大潜力,对临床实践具有重要意义。所提出算法的代码和示例数据已在GitHub (https://github.com/Zoe491/Improved-MTSCCA1)上公开提供。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Applied Biochemistry and Biotechnology
Applied Biochemistry and Biotechnology 工程技术-生化与分子生物学
CiteScore
5.70
自引率
6.70%
发文量
460
审稿时长
5.3 months
期刊介绍: This journal is devoted to publishing the highest quality innovative papers in the fields of biochemistry and biotechnology. The typical focus of the journal is to report applications of novel scientific and technological breakthroughs, as well as technological subjects that are still in the proof-of-concept stage. Applied Biochemistry and Biotechnology provides a forum for case studies and practical concepts of biotechnology, utilization, including controls, statistical data analysis, problem descriptions unique to a particular application, and bioprocess economic analyses. The journal publishes reviews deemed of interest to readers, as well as book reviews, meeting and symposia notices, and news items relating to biotechnology in both the industrial and academic communities. In addition, Applied Biochemistry and Biotechnology often publishes lists of patents and publications of special interest to readers.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信