{"title":"基于新型非线性相关分析算法的阿尔茨海默病成像遗传标记物探索","authors":"Renbo Yang, Wei Kong, Kun Liu, Gen Wen, Yaling Yu","doi":"10.1007/s12031-024-02190-x","DOIUrl":null,"url":null,"abstract":"<div><p>Alzheimer’s disease (AD) is an irreversible neurological disorder characterized by insidious onset. Identifying potential markers in its emergence and progression is crucial for early diagnosis and treatment. Imaging genetics typically merges genetic variables with multiple imaging parameters, employing various association analysis algorithms to investigate the links between pathological phenotypes and genetic variations, and to unearth molecular-level insights from brain images. However, most existing imaging genetics algorithms based on sparse learning assume a linear relationship between genetic factors and brain functions, limiting their ability to discern complex nonlinear correlation patterns and resulting in reduced accuracy. To address these issues, we propose a novel nonlinear imaging genetic association analysis method, Deep Self-Reconstruction-based Adaptive Sparse Multi-view Deep Generalized Canonical Correlation Analysis (DSR-AdaSMDGCCA). This approach facilitates joint learning of the nonlinear relationships between pathological phenotypes and genetic variations by integrating three different types of data: structural magnetic resonance imaging (sMRI), single-nucleotide polymorphism (SNP), and gene expression data. By incorporating nonlinear transformations in DGCCA, our model effectively uncovers nonlinear associations across multiple data types. Additionally, the DSR algorithm clusters samples with identical labels, incorporating label information into the nonlinear feature extraction process and thus enhancing the performance of association analysis. The application of the DSR-AdaSMDGCCA algorithm on real data sets identified several AD risk regions (such as the hippocampus, parahippocampus, and fusiform gyrus) and risk genes (including VSIG4, NEDD4L, and PINK1), achieving maximum classification accuracy with the fewest selected features compared to baseline algorithms. Molecular biology enrichment analysis revealed that the pathways enriched by these top genes are intimately linked to AD progression, affirming that our algorithm not only improves correlation analysis performance but also identifies biologically significant markers.</p></div>","PeriodicalId":652,"journal":{"name":"Journal of Molecular Neuroscience","volume":"74 2","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2024-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Exploring Imaging Genetic Markers of Alzheimer’s Disease Based on a Novel Nonlinear Correlation Analysis Algorithm\",\"authors\":\"Renbo Yang, Wei Kong, Kun Liu, Gen Wen, Yaling Yu\",\"doi\":\"10.1007/s12031-024-02190-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Alzheimer’s disease (AD) is an irreversible neurological disorder characterized by insidious onset. Identifying potential markers in its emergence and progression is crucial for early diagnosis and treatment. Imaging genetics typically merges genetic variables with multiple imaging parameters, employing various association analysis algorithms to investigate the links between pathological phenotypes and genetic variations, and to unearth molecular-level insights from brain images. However, most existing imaging genetics algorithms based on sparse learning assume a linear relationship between genetic factors and brain functions, limiting their ability to discern complex nonlinear correlation patterns and resulting in reduced accuracy. To address these issues, we propose a novel nonlinear imaging genetic association analysis method, Deep Self-Reconstruction-based Adaptive Sparse Multi-view Deep Generalized Canonical Correlation Analysis (DSR-AdaSMDGCCA). This approach facilitates joint learning of the nonlinear relationships between pathological phenotypes and genetic variations by integrating three different types of data: structural magnetic resonance imaging (sMRI), single-nucleotide polymorphism (SNP), and gene expression data. By incorporating nonlinear transformations in DGCCA, our model effectively uncovers nonlinear associations across multiple data types. Additionally, the DSR algorithm clusters samples with identical labels, incorporating label information into the nonlinear feature extraction process and thus enhancing the performance of association analysis. The application of the DSR-AdaSMDGCCA algorithm on real data sets identified several AD risk regions (such as the hippocampus, parahippocampus, and fusiform gyrus) and risk genes (including VSIG4, NEDD4L, and PINK1), achieving maximum classification accuracy with the fewest selected features compared to baseline algorithms. Molecular biology enrichment analysis revealed that the pathways enriched by these top genes are intimately linked to AD progression, affirming that our algorithm not only improves correlation analysis performance but also identifies biologically significant markers.</p></div>\",\"PeriodicalId\":652,\"journal\":{\"name\":\"Journal of Molecular Neuroscience\",\"volume\":\"74 2\",\"pages\":\"\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2024-04-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Molecular Neuroscience\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s12031-024-02190-x\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Molecular Neuroscience","FirstCategoryId":"3","ListUrlMain":"https://link.springer.com/article/10.1007/s12031-024-02190-x","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
摘要
阿尔茨海默病(AD)是一种不可逆的神经系统疾病,其特点是起病隐匿。识别阿尔茨海默病发生和发展的潜在标志物对于早期诊断和治疗至关重要。成像遗传学通常将遗传变异与多种成像参数相结合,采用各种关联分析算法来研究病理表型与遗传变异之间的联系,并从大脑图像中发现分子层面的见解。然而,现有的成像遗传学算法大多基于稀疏学习,假定遗传因素与大脑功能之间存在线性关系,从而限制了其辨别复杂的非线性相关模式的能力,导致准确性降低。为了解决这些问题,我们提出了一种新型非线性成像遗传关联分析方法--基于深度自重构的自适应稀疏多视角深度广义卡农关联分析(DSR-AdaSMDGCCA)。这种方法通过整合三种不同类型的数据:结构磁共振成像(sMRI)、单核苷酸多态性(SNP)和基因表达数据,促进了病理表型与遗传变异之间非线性关系的联合学习。通过在 DGCCA 中加入非线性变换,我们的模型可以有效地发现多种数据类型之间的非线性关联。此外,DSR 算法对具有相同标签的样本进行聚类,将标签信息纳入非线性特征提取过程,从而提高了关联分析的性能。DSR-AdaSMDGCCA 算法在真实数据集上的应用确定了几个 AD 风险区域(如海马、海马旁和纺锤形回)和风险基因(包括 VSIG4、NEDD4L 和 PINK1),与基线算法相比,以最少的特征选择达到了最高的分类准确率。分子生物学富集分析表明,这些顶级基因所富集的通路与AD进展密切相关,这证明我们的算法不仅提高了相关性分析性能,而且还能识别具有生物学意义的标记。
Exploring Imaging Genetic Markers of Alzheimer’s Disease Based on a Novel Nonlinear Correlation Analysis Algorithm
Alzheimer’s disease (AD) is an irreversible neurological disorder characterized by insidious onset. Identifying potential markers in its emergence and progression is crucial for early diagnosis and treatment. Imaging genetics typically merges genetic variables with multiple imaging parameters, employing various association analysis algorithms to investigate the links between pathological phenotypes and genetic variations, and to unearth molecular-level insights from brain images. However, most existing imaging genetics algorithms based on sparse learning assume a linear relationship between genetic factors and brain functions, limiting their ability to discern complex nonlinear correlation patterns and resulting in reduced accuracy. To address these issues, we propose a novel nonlinear imaging genetic association analysis method, Deep Self-Reconstruction-based Adaptive Sparse Multi-view Deep Generalized Canonical Correlation Analysis (DSR-AdaSMDGCCA). This approach facilitates joint learning of the nonlinear relationships between pathological phenotypes and genetic variations by integrating three different types of data: structural magnetic resonance imaging (sMRI), single-nucleotide polymorphism (SNP), and gene expression data. By incorporating nonlinear transformations in DGCCA, our model effectively uncovers nonlinear associations across multiple data types. Additionally, the DSR algorithm clusters samples with identical labels, incorporating label information into the nonlinear feature extraction process and thus enhancing the performance of association analysis. The application of the DSR-AdaSMDGCCA algorithm on real data sets identified several AD risk regions (such as the hippocampus, parahippocampus, and fusiform gyrus) and risk genes (including VSIG4, NEDD4L, and PINK1), achieving maximum classification accuracy with the fewest selected features compared to baseline algorithms. Molecular biology enrichment analysis revealed that the pathways enriched by these top genes are intimately linked to AD progression, affirming that our algorithm not only improves correlation analysis performance but also identifies biologically significant markers.
期刊介绍:
The Journal of Molecular Neuroscience is committed to the rapid publication of original findings that increase our understanding of the molecular structure, function, and development of the nervous system. The criteria for acceptance of manuscripts will be scientific excellence, originality, and relevance to the field of molecular neuroscience. Manuscripts with clinical relevance are especially encouraged since the journal seeks to provide a means for accelerating the progression of basic research findings toward clinical utilization. All experiments described in the Journal of Molecular Neuroscience that involve the use of animal or human subjects must have been approved by the appropriate institutional review committee and conform to accepted ethical standards.