{"title":"使用改进的联合多任务稀疏典型相关分析算法鉴定阿尔茨海默病外周血基因表达与脑脊液生物标志物之间的关联","authors":"Qianqian Wu, Zhihui Ma, Feng Wang","doi":"10.1007/s12010-025-05297-y","DOIUrl":null,"url":null,"abstract":"<p><p>Alzheimer's disease (AD) is an irreversible neurodegenerative disorder, and early diagnosis is crucial for effective clinical intervention. Traditional diagnostic methods involve detecting living brain tissue across the blood-brain barrier, but these invasive procedures cause unavoidable damage to patients. Genetic biomarkers in peripheral blood may provide valuable insights into brain lesions, potentially offering a non-invasive method for early AD diagnosis. The aim of this study is to propose an improved joint multi-task sparse canonical correlation analysis (MTSCCA) algorithm to identify significant genetic biomarkers in peripheral blood that correlate with brain markers of AD, such as cerebrospinal fluid (CSF) markers. This approach aims to accurately predict AD and assess disease progression. The study employs a multi-task sparse canonical correlation analysis (MTSCCA) approach with separate analyses for AD and healthy controls. Both tasks are constrained with class-consistent and class-specific conditions to identify significant features for each diagnostic group. To enhance robustness, the Laplacian matrix constraints were incorporated into the MTSCCA-LR algorithm to reduce noise in genetic data. The proposed algorithm identifies key differentially expressed genes (DEGs) that are involved in pathways closely linked to AD pathogenesis. These genes have specific diagnostic significance. Validation of these genes for predicting CSF markers was conducted using two regression models, showing good predictive accuracy. Furthermore, a Support Vector Machine (SVM) classifier was used to classify the two diagnostic groups, demonstrating high classification accuracy. The Top 20 genes identified using the proposed algorithm were used to construct an AD diagnostic model, which exhibited strong potential for non-invasive AD diagnosis, with significant implications for clinical practice. The code and example data of the proposed algorithm have been made publicly available on GitHub ( https://github.com/Zoe491/Improved-MTSCCA1 ).</p>","PeriodicalId":465,"journal":{"name":"Applied Biochemistry and Biotechnology","volume":" ","pages":""},"PeriodicalIF":3.3000,"publicationDate":"2025-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Identification of Associations Between Peripheral Blood Gene Expression and Cerebrospinal Fluid Biomarkers in Alzheimer's Disease Using an Improved Joint Multi-Task Sparse Canonical Correlation Analysis Algorithm.\",\"authors\":\"Qianqian Wu, Zhihui Ma, Feng Wang\",\"doi\":\"10.1007/s12010-025-05297-y\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Alzheimer's disease (AD) is an irreversible neurodegenerative disorder, and early diagnosis is crucial for effective clinical intervention. Traditional diagnostic methods involve detecting living brain tissue across the blood-brain barrier, but these invasive procedures cause unavoidable damage to patients. Genetic biomarkers in peripheral blood may provide valuable insights into brain lesions, potentially offering a non-invasive method for early AD diagnosis. The aim of this study is to propose an improved joint multi-task sparse canonical correlation analysis (MTSCCA) algorithm to identify significant genetic biomarkers in peripheral blood that correlate with brain markers of AD, such as cerebrospinal fluid (CSF) markers. This approach aims to accurately predict AD and assess disease progression. The study employs a multi-task sparse canonical correlation analysis (MTSCCA) approach with separate analyses for AD and healthy controls. Both tasks are constrained with class-consistent and class-specific conditions to identify significant features for each diagnostic group. To enhance robustness, the Laplacian matrix constraints were incorporated into the MTSCCA-LR algorithm to reduce noise in genetic data. The proposed algorithm identifies key differentially expressed genes (DEGs) that are involved in pathways closely linked to AD pathogenesis. These genes have specific diagnostic significance. Validation of these genes for predicting CSF markers was conducted using two regression models, showing good predictive accuracy. Furthermore, a Support Vector Machine (SVM) classifier was used to classify the two diagnostic groups, demonstrating high classification accuracy. The Top 20 genes identified using the proposed algorithm were used to construct an AD diagnostic model, which exhibited strong potential for non-invasive AD diagnosis, with significant implications for clinical practice. The code and example data of the proposed algorithm have been made publicly available on GitHub ( https://github.com/Zoe491/Improved-MTSCCA1 ).</p>\",\"PeriodicalId\":465,\"journal\":{\"name\":\"Applied Biochemistry and Biotechnology\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2025-06-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Biochemistry and Biotechnology\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1007/s12010-025-05297-y\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Biochemistry and Biotechnology","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s12010-025-05297-y","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
Identification of Associations Between Peripheral Blood Gene Expression and Cerebrospinal Fluid Biomarkers in Alzheimer's Disease Using an Improved Joint Multi-Task Sparse Canonical Correlation Analysis Algorithm.
Alzheimer's disease (AD) is an irreversible neurodegenerative disorder, and early diagnosis is crucial for effective clinical intervention. Traditional diagnostic methods involve detecting living brain tissue across the blood-brain barrier, but these invasive procedures cause unavoidable damage to patients. Genetic biomarkers in peripheral blood may provide valuable insights into brain lesions, potentially offering a non-invasive method for early AD diagnosis. The aim of this study is to propose an improved joint multi-task sparse canonical correlation analysis (MTSCCA) algorithm to identify significant genetic biomarkers in peripheral blood that correlate with brain markers of AD, such as cerebrospinal fluid (CSF) markers. This approach aims to accurately predict AD and assess disease progression. The study employs a multi-task sparse canonical correlation analysis (MTSCCA) approach with separate analyses for AD and healthy controls. Both tasks are constrained with class-consistent and class-specific conditions to identify significant features for each diagnostic group. To enhance robustness, the Laplacian matrix constraints were incorporated into the MTSCCA-LR algorithm to reduce noise in genetic data. The proposed algorithm identifies key differentially expressed genes (DEGs) that are involved in pathways closely linked to AD pathogenesis. These genes have specific diagnostic significance. Validation of these genes for predicting CSF markers was conducted using two regression models, showing good predictive accuracy. Furthermore, a Support Vector Machine (SVM) classifier was used to classify the two diagnostic groups, demonstrating high classification accuracy. The Top 20 genes identified using the proposed algorithm were used to construct an AD diagnostic model, which exhibited strong potential for non-invasive AD diagnosis, with significant implications for clinical practice. The code and example data of the proposed algorithm have been made publicly available on GitHub ( https://github.com/Zoe491/Improved-MTSCCA1 ).
期刊介绍:
This journal is devoted to publishing the highest quality innovative papers in the fields of biochemistry and biotechnology. The typical focus of the journal is to report applications of novel scientific and technological breakthroughs, as well as technological subjects that are still in the proof-of-concept stage. Applied Biochemistry and Biotechnology provides a forum for case studies and practical concepts of biotechnology, utilization, including controls, statistical data analysis, problem descriptions unique to a particular application, and bioprocess economic analyses. The journal publishes reviews deemed of interest to readers, as well as book reviews, meeting and symposia notices, and news items relating to biotechnology in both the industrial and academic communities.
In addition, Applied Biochemistry and Biotechnology often publishes lists of patents and publications of special interest to readers.