Discovering Novel Biomarkers and Potential Therapeutic Targets of Amyotrophic Lateral Sclerosis Through Integrated Machine Learning and Gene Expression Profiling

IF 2.8 4区 医学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY
Farah Anjum, Abdulaziz Alsharif, Maha Bakhuraysah, Alaa Shafie, Md.Imtaiyaz Hassan, Taj Mohammad
{"title":"Discovering Novel Biomarkers and Potential Therapeutic Targets of Amyotrophic Lateral Sclerosis Through Integrated Machine Learning and Gene Expression Profiling","authors":"Farah Anjum,&nbsp;Abdulaziz Alsharif,&nbsp;Maha Bakhuraysah,&nbsp;Alaa Shafie,&nbsp;Md.Imtaiyaz Hassan,&nbsp;Taj Mohammad","doi":"10.1007/s12031-025-02340-9","DOIUrl":null,"url":null,"abstract":"<div><p>Amyotrophic lateral sclerosis (ALS) is a progressive neurodegenerative disorder that has multiple factors that make its molecular pathogenesis difficult to understand and its diagnosis and treatment during the early stages difficult to determine. Discovering novel biomarkers in ALS for diagnostic and therapeutic potential has become important. Consequently, bioinformatics and machine learning algorithms are useful for identifying differentially expressed genes (DEGs) and potential biomarkers, as well as understanding the molecular mechanisms and intricacies of diseases such as ALS. To achieve the aim of the present study, six datasets obtained from the Gene Expression Omnibus (GEO) were utilized and analyzed using an integrative bioinformatics and machine learning approach. Log transformation was done during data preprocessing, RMA normalization was performed, and the batch effect was corrected. Differential expression analysis identified 206 DEGs that were significantly associated with different biological processes, including muscle function, energy metabolism, and mitochondrial membrane activity. Functional enrichment analysis highlighted pathways, including those related to prion disease, Parkinson’s disease, and ATP synthesis via chemiosmotic coupling. We employed a multi-step machine learning framework incorporating random forest, LASSO regression, and SVM-RFE to identify robust biomarkers. This approach identified three key genes, <i>CHRNA1</i>, <i>DLG5</i>, and <i>PLA2G4C</i>, which could be explored as promising biomarkers for ALS after further validation. The internal validation, including principal component analysis (PCA) and ROC-AUC analysis, demonstrated strong diagnostic potential of these hub genes, achieving an AUC of 0.96. This work highlights the utility of bioinformatics and machine learning in identifying key genes as biomarkers for diagnostic and therapeutic potential in ALS.</p></div>","PeriodicalId":652,"journal":{"name":"Journal of Molecular Neuroscience","volume":"75 2","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2025-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Molecular Neuroscience","FirstCategoryId":"3","ListUrlMain":"https://link.springer.com/article/10.1007/s12031-025-02340-9","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Amyotrophic lateral sclerosis (ALS) is a progressive neurodegenerative disorder that has multiple factors that make its molecular pathogenesis difficult to understand and its diagnosis and treatment during the early stages difficult to determine. Discovering novel biomarkers in ALS for diagnostic and therapeutic potential has become important. Consequently, bioinformatics and machine learning algorithms are useful for identifying differentially expressed genes (DEGs) and potential biomarkers, as well as understanding the molecular mechanisms and intricacies of diseases such as ALS. To achieve the aim of the present study, six datasets obtained from the Gene Expression Omnibus (GEO) were utilized and analyzed using an integrative bioinformatics and machine learning approach. Log transformation was done during data preprocessing, RMA normalization was performed, and the batch effect was corrected. Differential expression analysis identified 206 DEGs that were significantly associated with different biological processes, including muscle function, energy metabolism, and mitochondrial membrane activity. Functional enrichment analysis highlighted pathways, including those related to prion disease, Parkinson’s disease, and ATP synthesis via chemiosmotic coupling. We employed a multi-step machine learning framework incorporating random forest, LASSO regression, and SVM-RFE to identify robust biomarkers. This approach identified three key genes, CHRNA1, DLG5, and PLA2G4C, which could be explored as promising biomarkers for ALS after further validation. The internal validation, including principal component analysis (PCA) and ROC-AUC analysis, demonstrated strong diagnostic potential of these hub genes, achieving an AUC of 0.96. This work highlights the utility of bioinformatics and machine learning in identifying key genes as biomarkers for diagnostic and therapeutic potential in ALS.

通过集成机器学习和基因表达谱发现肌萎缩性侧索硬化症的新生物标志物和潜在治疗靶点
肌萎缩性侧索硬化症(ALS)是一种进行性神经退行性疾病,有多种因素,其分子发病机制难以理解,早期诊断和治疗难以确定。在渐冻症中发现新的生物标志物用于诊断和治疗已经变得非常重要。因此,生物信息学和机器学习算法对于识别差异表达基因(deg)和潜在的生物标志物,以及理解ALS等疾病的分子机制和复杂性非常有用。为了实现本研究的目的,利用生物信息学和机器学习的综合方法,利用从基因表达综合(GEO)中获得的六个数据集进行分析。在数据预处理过程中进行对数变换,进行RMA归一化,并对批处理效果进行校正。差异表达分析鉴定出206个与不同生物过程显著相关的deg,包括肌肉功能、能量代谢和线粒体膜活性。功能富集分析强调了包括朊病毒病、帕金森病和通过化学渗透偶联合成ATP相关的途径。我们采用了多步机器学习框架,结合随机森林、LASSO回归和SVM-RFE来识别稳健的生物标志物。该方法确定了三个关键基因,CHRNA1, DLG5和PLA2G4C,在进一步验证后可以作为ALS的有希望的生物标志物进行探索。内部验证,包括主成分分析(PCA)和ROC-AUC分析,显示这些枢纽基因具有很强的诊断潜力,AUC为0.96。这项工作强调了生物信息学和机器学习在识别关键基因作为ALS诊断和治疗潜力的生物标志物方面的效用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Molecular Neuroscience
Journal of Molecular Neuroscience 医学-神经科学
CiteScore
6.60
自引率
3.20%
发文量
142
审稿时长
1 months
期刊介绍: The Journal of Molecular Neuroscience is committed to the rapid publication of original findings that increase our understanding of the molecular structure, function, and development of the nervous system. The criteria for acceptance of manuscripts will be scientific excellence, originality, and relevance to the field of molecular neuroscience. Manuscripts with clinical relevance are especially encouraged since the journal seeks to provide a means for accelerating the progression of basic research findings toward clinical utilization. All experiments described in the Journal of Molecular Neuroscience that involve the use of animal or human subjects must have been approved by the appropriate institutional review committee and conform to accepted ethical standards.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信