Identification of Mitochondrial Dysfunction Genes as Diagnostic Biomarkers for Ischemic Stroke by Integrated Bioinformatics Analysis and Machine Learning

IF 2.7 4区 医学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY
Dandan Wu, Xiaolan Huang, Jie Li, Dingmin Mo, Weiwei Lan, Zihan Song, Li Su, Jianxiong Long, Jialei Yang
{"title":"Identification of Mitochondrial Dysfunction Genes as Diagnostic Biomarkers for Ischemic Stroke by Integrated Bioinformatics Analysis and Machine Learning","authors":"Dandan Wu,&nbsp;Xiaolan Huang,&nbsp;Jie Li,&nbsp;Dingmin Mo,&nbsp;Weiwei Lan,&nbsp;Zihan Song,&nbsp;Li Su,&nbsp;Jianxiong Long,&nbsp;Jialei Yang","doi":"10.1007/s12031-026-02533-w","DOIUrl":null,"url":null,"abstract":"<div><p> Current diagnostics for ischemic stroke (IS) lack timeliness and accessibility, highlighting the need for novel molecular diagnostic models. Three gene expression datasets (GSE16561, GSE22255 and GSE58294), encompassing both IS patients and healthy control subjects, were retrieved from a public database. The mitochondrial dysfunction genes retrieve from the intersection of the GeneCards and MitoCarta3.0 databases. The limma and WGCNA package were used to obtain the genes related to IS. Feature genes were screened using LASSO, RF, SVM, and diagnostic models were constructed using NeighborMethod, NeuralNet, and BayesMethod. 3548 differentially expressed genes (DEGs) (1538 upregulated, 2010 downregulated) were identified in IS patients when compared to controls. WGCNA analysis yielded 10 IS-related modules containing 1643 genes. The intersection of DEGs, module genes, and mitochondrial dysfunction genes yielded 100 mitochondrial dysfunction genes associated with IS. These genes collectively regulate biological processes like mitochondrial ATP synthesis coupled electron transport and respiratory electron transport chain, and participate in IS-associated signaling pathways such as reactive oxygen species and oxidative phosphorylation. Further machine learning methods identified 4 feature genes, including <i>MCL1</i>, <i>MRPL46</i>, <i>MTX3</i> and <i>RNASEH1</i>. These four genes exhibited robust diagnostic potential in the merged dataset (all AUC &gt; 0.7). The machine learning models achieved AUC values of 0.814 (NeighborMethod), 0.852 (NeuralNet), and 0.842 (BayesMethod). External validation using an independent cohort confirmed that all models maintained high diagnostic accuracy (AUC range: 0.730–0.783). This study established a multi-gene diagnostic model for IS, identifying novel molecular biomarkers to improve the timeliness and accessibility of IS diagnosis.</p></div>","PeriodicalId":652,"journal":{"name":"Journal of Molecular Neuroscience","volume":"76 2","pages":""},"PeriodicalIF":2.7000,"publicationDate":"2026-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Molecular Neuroscience","FirstCategoryId":"3","ListUrlMain":"https://link.springer.com/article/10.1007/s12031-026-02533-w","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Current diagnostics for ischemic stroke (IS) lack timeliness and accessibility, highlighting the need for novel molecular diagnostic models. Three gene expression datasets (GSE16561, GSE22255 and GSE58294), encompassing both IS patients and healthy control subjects, were retrieved from a public database. The mitochondrial dysfunction genes retrieve from the intersection of the GeneCards and MitoCarta3.0 databases. The limma and WGCNA package were used to obtain the genes related to IS. Feature genes were screened using LASSO, RF, SVM, and diagnostic models were constructed using NeighborMethod, NeuralNet, and BayesMethod. 3548 differentially expressed genes (DEGs) (1538 upregulated, 2010 downregulated) were identified in IS patients when compared to controls. WGCNA analysis yielded 10 IS-related modules containing 1643 genes. The intersection of DEGs, module genes, and mitochondrial dysfunction genes yielded 100 mitochondrial dysfunction genes associated with IS. These genes collectively regulate biological processes like mitochondrial ATP synthesis coupled electron transport and respiratory electron transport chain, and participate in IS-associated signaling pathways such as reactive oxygen species and oxidative phosphorylation. Further machine learning methods identified 4 feature genes, including MCL1, MRPL46, MTX3 and RNASEH1. These four genes exhibited robust diagnostic potential in the merged dataset (all AUC > 0.7). The machine learning models achieved AUC values of 0.814 (NeighborMethod), 0.852 (NeuralNet), and 0.842 (BayesMethod). External validation using an independent cohort confirmed that all models maintained high diagnostic accuracy (AUC range: 0.730–0.783). This study established a multi-gene diagnostic model for IS, identifying novel molecular biomarkers to improve the timeliness and accessibility of IS diagnosis.

Abstract Image

通过综合生物信息学分析和机器学习鉴定作为缺血性卒中诊断生物标志物的线粒体功能障碍基因。
目前缺血性脑卒中(IS)的诊断缺乏及时性和可及性,突出了对新型分子诊断模型的需求。三个基因表达数据集(GSE16561, GSE22255和GSE58294),包括IS患者和健康对照者,从公共数据库中检索。从GeneCards和MitoCarta3.0数据库的交集中检索线粒体功能障碍基因。利用limma和WGCNA包获得IS相关基因。使用LASSO、RF、SVM筛选特征基因,并使用NeighborMethod、NeuralNet和BayesMethod构建诊断模型。与对照组相比,在IS患者中鉴定出3548个差异表达基因(DEGs)(1538个上调,2010个下调)。WGCNA分析得到10个is相关模块,包含1643个基因。deg、模块基因和线粒体功能障碍基因的交集产生了100个与IS相关的线粒体功能障碍基因。这些基因共同调控线粒体ATP合成耦合电子传递和呼吸电子传递链等生物过程,并参与活性氧和氧化磷酸化等is相关信号通路。进一步的机器学习方法确定了4个特征基因,包括MCL1、MRPL46、MTX3和RNASEH1。这四个基因在合并的数据集中显示出强大的诊断潜力(所有AUC均为>.7)。机器学习模型的AUC值分别为0.814 (NeighborMethod)、0.852 (NeuralNet)和0.842 (BayesMethod)。使用独立队列的外部验证证实,所有模型都保持了较高的诊断准确性(AUC范围:0.730-0.783)。本研究建立IS多基因诊断模型,寻找新的分子生物标志物,提高IS诊断的时效性和可及性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Molecular Neuroscience
Journal of Molecular Neuroscience 医学-神经科学
CiteScore
6.60
自引率
3.20%
发文量
142
审稿时长
1 months
期刊介绍: The Journal of Molecular Neuroscience is committed to the rapid publication of original findings that increase our understanding of the molecular structure, function, and development of the nervous system. The criteria for acceptance of manuscripts will be scientific excellence, originality, and relevance to the field of molecular neuroscience. Manuscripts with clinical relevance are especially encouraged since the journal seeks to provide a means for accelerating the progression of basic research findings toward clinical utilization. All experiments described in the Journal of Molecular Neuroscience that involve the use of animal or human subjects must have been approved by the appropriate institutional review committee and conform to accepted ethical standards.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书