Exploring the potential biomarkers between stroke and obstructive sleep apnea by WGCNA and machine learning.

IF 2
Lin Zhou, Pengfan Ye, Yiming Wang
{"title":"Exploring the potential biomarkers between stroke and obstructive sleep apnea by WGCNA and machine learning.","authors":"Lin Zhou, Pengfan Ye, Yiming Wang","doi":"10.1007/s11325-025-03369-1","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Ischemic stroke (IS) and obstructive sleep apnea (OSA) are highly prevalent disorders with significant societal and individual burdens. OSA exacerbates stroke outcomes, elevates recurrent stroke risk, and impedes functional recovery. Identifying shared biomarkers and elucidating the molecular mechanisms linking IS and OSA have been critical for developing targeted therapies and improving patient prognosis.</p><p><strong>Methods: </strong>Transcriptomic data for IS and OSA were obtained from the GEO database (GSE58294, GSE135917, GSE38792, and GSE22255). After batch-effect correction, weighted gene co-expression network analysis (WGCNA) and differential expression analysis were performed to identify disease-associated genes. Functional enrichment analysis and a protein-protein interaction network construction were conducted. Advanced machine learning algorithms-Least Absolute Shrinkage and Selection Operator (LASSO) regression and random forests-were applied to screen hub genes, followed by validation of their diagnostic performance. Patients were stratified into high-and low-expression groups based on hub genes levels, and gene set enrichment analysis (GSEA) was performed to characterize pathway activity.</p><p><strong>Results: </strong>Integration of WGCNA and differential expression analysis revealed 112 shared differentially expressed genes (DEGs) significantly associated with IS and OSA. Enrichment analysis implicated these DEGs in critical processes, including protein ubiquitination, fatty acid metabolism, cell proliferation and apoptosis, autophagy, cyclooxygenase pathway, and chromatin remodeling. Machine learning identified DUSP1 as a central hub gene, with significantly elevated expression in both IS and OSA. Diagnostic validation demonstrated robust performance for DUSP1 (AUCs: 1.000 in GSE58294, 0.885 in GSE135917, 0.718 in GSE22255), though variability was observed in GSE38792 (AUC: 0.487). GSEA highlighted distinct pathway signatures: high DUSP1 expression correlated with activation of ribosome, spliceosome, and nucleocytoplasmic transport pathway, while suppressing Ras/Rap1 signaling, platelet activation, PI3K-AKT signaling, IL-17 signaling, and immune-related pathways (e.g., Fc gamma R-mediated phagocytosis, cytokine-cytokine receptor interaction, and B cell receptor signaling pathway).</p><p><strong>Conclusion: </strong>Through integrative bioinformatics and machine learning, this study identifies DUSP1 as a novel hub gene linking IS and OSA. Functional annotation reveal its involvement in shared biological pathways, offering mechanistic insights into disease pathogenesis and highlighting DUSP1 as a potential therapeutic target.</p>","PeriodicalId":520777,"journal":{"name":"Sleep & breathing = Schlaf & Atmung","volume":"29 4","pages":"219"},"PeriodicalIF":2.0000,"publicationDate":"2025-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sleep & breathing = Schlaf & Atmung","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s11325-025-03369-1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Ischemic stroke (IS) and obstructive sleep apnea (OSA) are highly prevalent disorders with significant societal and individual burdens. OSA exacerbates stroke outcomes, elevates recurrent stroke risk, and impedes functional recovery. Identifying shared biomarkers and elucidating the molecular mechanisms linking IS and OSA have been critical for developing targeted therapies and improving patient prognosis.

Methods: Transcriptomic data for IS and OSA were obtained from the GEO database (GSE58294, GSE135917, GSE38792, and GSE22255). After batch-effect correction, weighted gene co-expression network analysis (WGCNA) and differential expression analysis were performed to identify disease-associated genes. Functional enrichment analysis and a protein-protein interaction network construction were conducted. Advanced machine learning algorithms-Least Absolute Shrinkage and Selection Operator (LASSO) regression and random forests-were applied to screen hub genes, followed by validation of their diagnostic performance. Patients were stratified into high-and low-expression groups based on hub genes levels, and gene set enrichment analysis (GSEA) was performed to characterize pathway activity.

Results: Integration of WGCNA and differential expression analysis revealed 112 shared differentially expressed genes (DEGs) significantly associated with IS and OSA. Enrichment analysis implicated these DEGs in critical processes, including protein ubiquitination, fatty acid metabolism, cell proliferation and apoptosis, autophagy, cyclooxygenase pathway, and chromatin remodeling. Machine learning identified DUSP1 as a central hub gene, with significantly elevated expression in both IS and OSA. Diagnostic validation demonstrated robust performance for DUSP1 (AUCs: 1.000 in GSE58294, 0.885 in GSE135917, 0.718 in GSE22255), though variability was observed in GSE38792 (AUC: 0.487). GSEA highlighted distinct pathway signatures: high DUSP1 expression correlated with activation of ribosome, spliceosome, and nucleocytoplasmic transport pathway, while suppressing Ras/Rap1 signaling, platelet activation, PI3K-AKT signaling, IL-17 signaling, and immune-related pathways (e.g., Fc gamma R-mediated phagocytosis, cytokine-cytokine receptor interaction, and B cell receptor signaling pathway).

Conclusion: Through integrative bioinformatics and machine learning, this study identifies DUSP1 as a novel hub gene linking IS and OSA. Functional annotation reveal its involvement in shared biological pathways, offering mechanistic insights into disease pathogenesis and highlighting DUSP1 as a potential therapeutic target.

通过WGCNA和机器学习探索卒中和阻塞性睡眠呼吸暂停之间的潜在生物标志物。
背景:缺血性卒中(IS)和阻塞性睡眠呼吸暂停(OSA)是非常普遍的疾病,具有显著的社会和个人负担。阻塞性睡眠呼吸暂停会加重卒中结局,增加卒中复发风险,并阻碍功能恢复。确定IS和OSA之间的共同生物标志物和分子机制对于开发靶向治疗和改善患者预后至关重要。方法:从GEO数据库(GSE58294、GSE135917、GSE38792和GSE22255)中获取IS和OSA的转录组学数据。经批效应校正后,进行加权基因共表达网络分析(WGCNA)和差异表达分析,鉴定疾病相关基因。进行了功能富集分析和蛋白质相互作用网络构建。先进的机器学习算法-最小绝对收缩和选择算子(LASSO)回归和随机森林-应用于筛选中心基因,然后验证其诊断性能。根据枢纽基因水平将患者分为高表达组和低表达组,并进行基因集富集分析(GSEA)来表征途径活性。结果:整合WGCNA和差异表达分析,发现112个共享差异表达基因(DEGs)与IS和OSA显著相关。富集分析表明这些deg参与关键过程,包括蛋白质泛素化、脂肪酸代谢、细胞增殖和凋亡、自噬、环氧化酶途径和染色质重塑。机器学习发现DUSP1是中心枢纽基因,在IS和OSA中表达显著升高。诊断验证显示DUSP1具有良好的性能(AUC: GSE58294为1.000,GSE135917为0.885,GSE22255为0.718),尽管在GSE38792中观察到变异性(AUC: 0.487)。GSEA突出了不同的途径特征:DUSP1的高表达与核糖体、剪接体和核质转运途径的激活相关,同时抑制Ras/Rap1信号传导、血小板激活、PI3K-AKT信号传导、IL-17信号传导和免疫相关途径(如Fc γ r介导的吞噬、细胞因子-细胞因子受体相互作用和B细胞受体信号传导途径)。结论:通过综合生物信息学和机器学习,本研究确定了DUSP1是连接IS和OSA的新型枢纽基因。功能注释揭示了其参与共享的生物学途径,为疾病发病机制提供了机制见解,并突出了DUSP1作为潜在的治疗靶点。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信