Identification of hub gene and immune infiltration in Lyme disease revealed by weighted gene co-expression network analysis and machine learning.

IF 3.4 3区 医学 Q2 MEDICINE, RESEARCH & EXPERIMENTAL
Yan Dong, Meng Liu, Yanshuang Luo, Yantong Chen, Xuesong Chen, Xiaorong Liu, Xingbo Cai, Fusong Yang, Chao Song, Guozhong Zhou
{"title":"Identification of hub gene and immune infiltration in Lyme disease revealed by weighted gene co-expression network analysis and machine learning.","authors":"Yan Dong, Meng Liu, Yanshuang Luo, Yantong Chen, Xuesong Chen, Xiaorong Liu, Xingbo Cai, Fusong Yang, Chao Song, Guozhong Zhou","doi":"10.1186/s40001-025-03108-y","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Lyme disease (LD), caused by the spirochete Borrelia burgdorferi (Bb), is a multisystem disorder with early symptoms such as erythema migrans and late manifestations including arthritis and neuroborreliosis. The molecular mechanisms driving tissue damage and inflammatory dysregulation in LD remain incompletely characterized. Given the central role of peripheral blood mononuclear cells (PBMCs) in orchestrating immune responses, we aimed to identify optimal feature genes (OFGs) within PBMCs associated with LD pathogenesis and delineate their immune infiltration patterns using integrated bioinformatics.</p><p><strong>Methods: </strong>Transcriptomic datasets (GSE42606, GSE68765, GSE103481) were retrieved from GEO. Differential expression analysis identified LD-related genes. Weighted Gene Co-expression Network Analysis (WGCNA) screened disease-associated modules. Feature selection was performed via SVM-Recursive Feature Elimination (SVM-RFE), Least absolute shrinkage and selection operator (LASSO) regression, and random forest (RF) to pinpoint OFGs. Immune cell infiltration was quantified using CIBERSORT, followed by correlation analysis between OFGs and immune subsets. The Single-gene gene set enrichment analysis (GSEA) was performed to explore the functional associations of OFGs. Biological pathways linked to OFGs were inferred by single-sample GSEA (ssGSEA). Diagnostic utility was assessed via ROC curves and nomogram modeling. Finally, we used RT-qPCR to confirm the bioinformatics results.</p><p><strong>Results: </strong>Our study identified 174 DEGs among the LD patients, with 156 genes located within the \"turquoise\" module by WGCNA, exhibiting the most robust correlation with clinical characteristics. Among these, KIAA1199 turned out to be the unique OFG, selected via three distinct machine learning methodologies, possessing exceptional diagnostic potential. The Single-gene gene set enrichment analysis showed KIAA1199 was strongly correlated with multiple immune-related pathways. Furthermore, RT-qPCR validated candidate gene expression within a THP-1 cellular model.</p><p><strong>Conclusion: </strong>In conclusion, this study integrated WGCNA and machine learning methodologies to identify one core gene associated with LD from PBMC gene expression data: KIAA1199. The predictive model constructed using these genes demonstrated robust diagnostic accuracy, providing a basis for further research on host immune responses and the development of new diagnostic methods.</p>","PeriodicalId":11949,"journal":{"name":"European Journal of Medical Research","volume":"30 1","pages":"860"},"PeriodicalIF":3.4000,"publicationDate":"2025-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12465215/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Medical Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s40001-025-03108-y","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction: Lyme disease (LD), caused by the spirochete Borrelia burgdorferi (Bb), is a multisystem disorder with early symptoms such as erythema migrans and late manifestations including arthritis and neuroborreliosis. The molecular mechanisms driving tissue damage and inflammatory dysregulation in LD remain incompletely characterized. Given the central role of peripheral blood mononuclear cells (PBMCs) in orchestrating immune responses, we aimed to identify optimal feature genes (OFGs) within PBMCs associated with LD pathogenesis and delineate their immune infiltration patterns using integrated bioinformatics.

Methods: Transcriptomic datasets (GSE42606, GSE68765, GSE103481) were retrieved from GEO. Differential expression analysis identified LD-related genes. Weighted Gene Co-expression Network Analysis (WGCNA) screened disease-associated modules. Feature selection was performed via SVM-Recursive Feature Elimination (SVM-RFE), Least absolute shrinkage and selection operator (LASSO) regression, and random forest (RF) to pinpoint OFGs. Immune cell infiltration was quantified using CIBERSORT, followed by correlation analysis between OFGs and immune subsets. The Single-gene gene set enrichment analysis (GSEA) was performed to explore the functional associations of OFGs. Biological pathways linked to OFGs were inferred by single-sample GSEA (ssGSEA). Diagnostic utility was assessed via ROC curves and nomogram modeling. Finally, we used RT-qPCR to confirm the bioinformatics results.

Results: Our study identified 174 DEGs among the LD patients, with 156 genes located within the "turquoise" module by WGCNA, exhibiting the most robust correlation with clinical characteristics. Among these, KIAA1199 turned out to be the unique OFG, selected via three distinct machine learning methodologies, possessing exceptional diagnostic potential. The Single-gene gene set enrichment analysis showed KIAA1199 was strongly correlated with multiple immune-related pathways. Furthermore, RT-qPCR validated candidate gene expression within a THP-1 cellular model.

Conclusion: In conclusion, this study integrated WGCNA and machine learning methodologies to identify one core gene associated with LD from PBMC gene expression data: KIAA1199. The predictive model constructed using these genes demonstrated robust diagnostic accuracy, providing a basis for further research on host immune responses and the development of new diagnostic methods.

基于加权基因共表达网络分析和机器学习的莱姆病中枢基因鉴定与免疫浸润
莱姆病(LD)是由伯氏疏螺旋体(Bb)引起的一种多系统疾病,早期症状为迁移性红斑,晚期表现为关节炎和神经疏螺旋体病。LD中驱动组织损伤和炎症失调的分子机制仍不完全清楚。考虑到外周血单个核细胞(PBMCs)在协调免疫应答中的核心作用,我们旨在确定与LD发病机制相关的PBMCs中的最佳特征基因(OFGs),并利用综合生物信息学描述其免疫浸润模式。方法:从GEO检索转录组数据集(GSE42606, GSE68765, GSE103481)。差异表达分析鉴定出ld相关基因。加权基因共表达网络分析(WGCNA)筛选疾病相关模块。通过svm -递归特征消除(SVM-RFE)、最小绝对收缩和选择算子(LASSO)回归以及随机森林(RF)进行特征选择,以确定ofg。采用CIBERSORT定量免疫细胞浸润,分析ofg与免疫亚群的相关性。采用单基因基因集富集分析(GSEA)探讨了OFGs的功能关联。通过单样本GSEA (ssGSEA)推断与OFGs相关的生物学途径。通过ROC曲线和nomogram建模来评估诊断效用。最后,我们使用RT-qPCR对生物信息学结果进行验证。结果:我们的研究在LD患者中鉴定出174个deg,其中156个基因被WGCNA定位在“绿松石”模块中,与临床特征的相关性最强。其中,KIAA1199被证明是独特的OFG,通过三种不同的机器学习方法选择,具有非凡的诊断潜力。单基因基因集富集分析显示KIAA1199与多种免疫相关通路密切相关。此外,RT-qPCR验证了候选基因在THP-1细胞模型中的表达。结论:本研究结合WGCNA和机器学习方法,从PBMC基因表达数据中鉴定出一个与LD相关的核心基因KIAA1199。利用这些基因构建的预测模型具有较强的诊断准确性,为进一步研究宿主免疫应答和开发新的诊断方法提供了基础。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
European Journal of Medical Research
European Journal of Medical Research 医学-医学:研究与实验
CiteScore
3.20
自引率
0.00%
发文量
247
审稿时长
>12 weeks
期刊介绍: European Journal of Medical Research publishes translational and clinical research of international interest across all medical disciplines, enabling clinicians and other researchers to learn about developments and innovations within these disciplines and across the boundaries between disciplines. The journal publishes high quality research and reviews and aims to ensure that the results of all well-conducted research are published, regardless of their outcome.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信